Skip to main content

Refining Regex Recommendations

100% Accuracy

Today I built an automated test so I can verify that the new regex recommendations engine is producing valid recommendations. In other words, I want to make sure that all of the recommended changes coming out of it do not affect the text that your regular expression matches. So I'm running the engine on about 2,000 regular expressions and target strings to verify its accuracy. I found a couple mistakes it was making and today's update takes care of that so we're up to 100% accuracy.

Other recommendations

I'm still working on making this feature better. And as I continue to expand on this functionality I want to also establish a concept of confidence with the recommendation engine. Thanks to today's test, I have 100% confidence in the recommendations that are in Regex Hero Professional today.

But there are other recommendations I want to create that I couldn't possibly have such confidence in. For instance, there's the issue of capturing groups vs non-capturing groups. It's a very common mistake to see a regular expression with capturing groups, and yet the captured values are never used for anything. Non-capturing groups are significantly faster and should be used instead in this scenario. But there's no way of me knowing how you're going to use the regular expression, so I can't recommend non-capturing groups with 100% confidence. So I need to create a new type of recommendation that appears differently in Regex Hero to cover scenarios like this. Look for this and more recommendations coming soon.

Comments

Popular posts from this blog

Regex Hero for Windows 10 is Underway

Awhile back I began working on an HTML5 / JavaScript version of Regex Hero . However, it was a huge undertaking essentially requiring a complete rewrite of the entire application. I have not had enough time to dedicate to this lately. So I've begun again, this time rewriting Regex Hero to work in WPF. It'll be usable in Windows 10 and downloadable from the Microsoft Store. This is a much easier task that also has the advantage of running the .NET regex library from the application itself. This will allow for the same speedy experience of testing your regular expressions and getting instant feedback that Regex Hero users have always enjoyed. I expect the first release to be ready in Q4 of 2019.

Optimizing Your Regular Expressions

Regular expressions will backtrack.  That's an unfortunate thing about them because backtracking can be slow.    And in certain (rare) cases the performance can become so awful that executing the regular expression against a relatively short string could take over a minute.  There's a good article about catastrophic backtracking over at regular-expressions.info . And today I created a video about all of this called  Regex Lesson 5: Optimization .  In the video I start with a very poorly written regular expression and make several improvements to it, using the benchmarking feature along the way.  By the end of the video I make the regular expression over 3 million times faster. In addition, today's update to Regex Hero provides a little message in the event that you encounter a regular expression that takes over 10 seconds to evaluate... And then last of all, I changed the benchmarking feature a bit.  In the past it would simply test your regular expression against

Silverlight 4 Coming in April, or Maybe Sooner

The exact release date has not been announced. But Visual Studio 2010 RTM is coming out in April and I think it's safe to assume that Silverlight 4 will be released no later than that. Each release of Silverlight has brought massive improvements over the previous version. And once again, Silverlight 4 does not disappoint. There is a long list of improvements but the ones that I think that will affect Regex Hero are as follows: RichTextBox My plan is to use this in place of all 4 major textboxes in Regex Hero. The new RichTextBox has built-in multiple undos & redos, so I can ditch my home-brewed code. It should be nice to use for syntax highlighting for the regular expressions I intend to create. It also has a built-in API to determine the pixel position of the text. I should be able to use this API and build a new highlighting scheme based off of it. This should do a couple things. First, I should be able to finally fix the problem I had with the ScrollViewer and