Skip to main content

Fixed Bugs With Saving Locally, The Analyzer, And a New Recommendation Engine

Bug Fixes

Today's update takes care of a couple bugs:
  1. Saving locally to your computer with Regex Hero Professional would fail with certain special characters.
  2. The analyzer would fail in rare situations involving very complex character classes.

Recommendation Engine

And lastly, I've included a beta version of a new recommendation engine. This is available only to users of Regex Hero Professional. The recommendations that are produced are all related to performance. And the type of recommendations that are produced are limited at this point. Often you'll see the message, "No recommendations found." This is something I intend to continue to work on and improve in the upcoming months.

Here are the possible recommendations I've included so far:

  1. IgnoreCase is not needed and slows down processing. Please disable it. This one determines when the IgnoreCase flag isn't doing anything for you. For example, \w matches word characters and it's case insensitive, so adding the IgnoreCase just for that would be pointless and would slow down the regular expression.
  2. Redundant quantifiers may be slow. Please remove the first quantifier. This identifies situations such as x+x+. The '+' quantifier used back to back on the same character does nothing but make the regular expression much slower than it should be. This can be simplified to xx+.
  3. Alternations are slow. Please change to a character class. This identifies single character alternations such as a|b|c. It is slightly more efficient to use [abc] instead.
  4. Do not repeat 3 or more characters. Use a numbered quantifier instead. This is a minor one, but rather than \w\w\w you can use \w{3} and see slightly improved performance. The performance gains are greater the more characters you're dealing with.
  5. Do not perform case insensitive matching with a character class. Use the IgnoreCase option instead. In some old regex implementations, there was no IgnoreCase flag. So the workaround would be to explicitly include both cases, e.g. [Aa][Bb][Cc]. But there's no need for that anymore, and it's more efficient to just use the IgnoreCase option.
There will be more rules coming, as well as improvements to the intelligence and guidance behind these existing rules. But the big feature to come next is to actually allow you to simply click a button to fix the problem.



Comments

Popular posts from this blog

Regex Hero for Windows 10 is Underway

Awhile back I began working on an HTML5 / JavaScript version of Regex Hero . However, it was a huge undertaking essentially requiring a complete rewrite of the entire application. I have not had enough time to dedicate to this lately. So I've begun again, this time rewriting Regex Hero to work in WPF. It'll be usable in Windows 10 and downloadable from the Microsoft Store. This is a much easier task that also has the advantage of running the .NET regex library from the application itself. This will allow for the same speedy experience of testing your regular expressions and getting instant feedback that Regex Hero users have always enjoyed. I expect the first release to be ready in Q4 of 2019.

Optimizing Your Regular Expressions

Regular expressions will backtrack.  That's an unfortunate thing about them because backtracking can be slow.    And in certain (rare) cases the performance can become so awful that executing the regular expression against a relatively short string could take over a minute.  There's a good article about catastrophic backtracking over at regular-expressions.info . And today I created a video about all of this called  Regex Lesson 5: Optimization .  In the video I start with a very poorly written regular expression and make several improvements to it, using the benchmarking feature along the way.  By the end of the video I make the regular expression over 3 million times faster. In addition, today's update to Regex Hero provides a little message in the event that you encounter a regular expression that takes over 10 seconds to evaluate... And then last of all, I changed the benchmarking feature a bit.  In the past it would simply test your regular expression against

Silverlight 4 Coming in April, or Maybe Sooner

The exact release date has not been announced. But Visual Studio 2010 RTM is coming out in April and I think it's safe to assume that Silverlight 4 will be released no later than that. Each release of Silverlight has brought massive improvements over the previous version. And once again, Silverlight 4 does not disappoint. There is a long list of improvements but the ones that I think that will affect Regex Hero are as follows: RichTextBox My plan is to use this in place of all 4 major textboxes in Regex Hero. The new RichTextBox has built-in multiple undos & redos, so I can ditch my home-brewed code. It should be nice to use for syntax highlighting for the regular expressions I intend to create. It also has a built-in API to determine the pixel position of the text. I should be able to use this API and build a new highlighting scheme based off of it. This should do a couple things. First, I should be able to finally fix the problem I had with the ScrollViewer and