[TUHS] Re: regex early discussions

4 Mar 2024

On Mon, 4 Mar 2024, 08:27 Rob Pike, &lt;robpike(a)gmail.com&gt; wrote [to Larry]
Oh happy days. Hi Rob, loved the book.
If that's really true, that you learned from Spencer's library, then you
...
  didn't learn the most important thing about them,
which is the automata
 theory that guarantees their performance is always linear. Not to take
 anything away from Henry, who admitted at the time that it could be slow
 for bad expressions, but we're still paying the price for refusing to
 connect "regex" with the theory that created them, ignoring it in fact.

I once got into a bunfight with a Googler on the topic of coding interview
questions, on a related matter. He was promulgating a regular expression to
correctly match/parse-out legitimate dotted-quad IPv4 addresses, including
bounds-checking the octets to be in the range 0..255, and arguing that it
since it was going to be run through a DFA that it was a sunk cost for
efficiency and therefore perfect.
The result looked like line noise, and he was perturbed that I said I would
prefer to take a much simpler (NFA?) RE, parse out the ints and
bounds-check them, just to reduce cognitive load and increase
maintainability of code.
We didn't really come to an agreement.
-a

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

2001

2000

1999

1998

1997

1996

1995

1994

1993

1992

1991

1990

[TUHS] Re: regex early discussions