News Score: Score the News, Sort the News, Rewrite the Headlines

Fast(er) regular expression engines in Ruby

Introduction With modern, overengineered, and over-obfuscated websites, we at SerpApi face increasing challenges with extracting data from them. Beside the usual HTML parsing, sometimes we're literally forced to fall back to good 'ol regular expressions, e.g. for extracting embedded JS data. And while regexps do the trick, they might come at a cost. Onigmo, the default regexp engine in Ruby, while substantially updated in Ruby 3.2, still has week points that may really upset in terms of scan tim...

Read more at serpapi.com

© News Score  score the news, sort the news, rewrite the headlines