Zhaba Zhournal
Friday, October 10, 2003 
Editing geekery 
Okay, this is how much of a regular-expression geek I am: I have previously had dreams in which I did regular expression searches, but last night I had one that was so accurate I was able to use it, to good effect, at work this morning. (Warning: This next bit probably doesn't make much/any sense if you don't use regular expressions. But I'm pleased with myself, so I'm writing it anyway.)

<geek>

I'd been searching the files I'm editing for HTML character entities—&eacute; instead of é, for instance. So I searched for this:
&[^&;]+;
Except that kept bringing up every line that had an ampersand and a semicolon, including the lines with bizarre hidden Word coding.

(This, for instance: T]âÞ?CæÍ@¨Ž¡T0æ?}GÕ ÕÚ¡øEtó;ðoKôv(úÄÔÜÜÃÞ>DÇP/·æi¼l?Ds Å&ÑÍÚyÂ+Ç;D¦}ú‚ What the hell is that?)

And I wound up with the dismaying message "Found 109 occurrence(s) in 32 file(s)"—a pain in the neck to individually examine. Feh.

But in my dream, it occurred to me to limit the size of the string: &[^&;]{2,8}; I.e., to only catch strings with two to eight characters between the & and the ;. And I managed to remember it, and I tried it when I got to work this morning. And not only did it work, I was actually correct about the number of characters in a legitimate HTML entity. (In general, the longest ones have six characters, but in the Greek language set there's one with eight characters: &thetasym; I don't know if that will show up if I try to put it in here, though. ϑ—does that work on anyone's browser?)

Anyway, I ran that search, and got this: "Found 5 occurrence(s) in 5 file(s)". Hey, I like those numbers a lot better...

</geek>

And now, back to my regularly scheduled editing...

[ at 10:02 AM • by Abby • permalink  ]




Yes, that's me.
archives:
2003
AprMayJunJulAugSepOctNovDec

2004
JanFebMarAprMayJunJulAugSepOctNovDec

2005
JanFebhome (i.e. right now)

me-stuff:
about zhaba
what the heck is "zhaba"?
amazon wish list

feed me
  Subscribe with Bloglines

who's got the button?
sign my guestmap  
 
join us
Get Firefox   Get Thunderbird
countdown
days till the next U.S. presidential election

blogrolls:
blogs
>=recently updated
>Blogroll Me!<

amusements


news




even more buttons
Feedback by backBlog


© 2003–05 Zhaba Productions, so don't steal anything.