ToLower - A HTML Compatibility Tool

Environment: VC6 SP4, Win9x NT4 SP5, Win2k

In General How It Works Usage
Source Code Full Version Credits

ToLower was created when I tried to port a website from a Windows to an LINUX/UNIX platform and experienced some annoying compatibility problems.

This is due to the Windows file system's habit, not to distinguish between lower and upper case file names (it's "case insensitive") and that it allows characters (e. g. "?", "!" and "-") that have a special function in UNIX and are therefore reserved.

ToLower parses one or more HTML-files that are specified on its command-line for regular expressions (regex courtesy of Henry Spencer, some slight cosmetic changes to the code done by me) that contain <a href="filename.htm"> Text </a>, <img src="filename.htm"> or <body backgound="filename.htm"> Text </body> - tags, corrects the case and other extras that could affect compatibility.

After that, all referenced files (respective their names) are adapted according to the changes done in the HTML file.

Finally, all other files in the path of the specified HTML-file(s) are renamed to their lower-case (compatibility) file names to avoid ambiguities.

The ToLower source code contains some interesting and recyleable bits of code that I thought I should share with you:

  • CStrList Class for generating a list of strings. Has various insertion, appending, sorting, searching and deleting functions. Files: cstrlist.h, cstrlist.cpp
  • regex
  • A full featured regular expression searcher. Necessary files: cclass.h, cname.h, engine.c, regcomp.c, regerror.c, regex.h, regex2.h, regexec.c, regfree.c, utils.h. Usage is pretty self-explaining (take a look at the
    // search for regular expression that

    -section of tolower.cpp.

  • str2list
  • Converts a delimited string to a list of elements. Useful when parsing command lines. Files: str2list.h, str2list.cpp and the CStrList-class.
  • TempNamA
  • Creates unique temporary file name. Files: tempnama.h, tempnama.cpp.
The usage of ToLower is simple:

Just list every HTML-file that has to be checked/changed on the command line. To change all files in the path, we would do a

dir /s /b *.htm *.html > tolower.bat

and insert a

tolower

prior to each file name (don't forget the space char after "ToLower"). A call to

ToLower.bat

does the trick!

Of course, ToLower supports absolute and relative paths.

Credits: My credits go to Henry Spencer for the regular expression search engine, to Chris Windmüller for CStrList and all the other open source programmers that are never mentioned in any commercial projects.

If you have any comments, improvements or complaints - feel free to send me an email. Have fun!

Volker.

Downloads

Complete VC6 project: ToLowerSrc.zip (50 kBytes)
Compiled executable: ToLowerExe.zip (80 kBytes)


Comments

  • There are no comments yet. Be the first to comment!

Leave a Comment
  • Your email address will not be published. All fields are required.

Top White Papers and Webcasts

  • Managing your company's financials is the backbone of your business and is vital to the long-term health and viability of your company. To continue applying the necessary financial rigor to support rapid growth, the accounting department needs the right tools to most efficiently do their job. Read this white paper to understand the 10 essentials of a complete financial management system and how the right solution can help you keep up with the rapidly changing business world.

  • Live Event Date: August 20, 2014 @ 1:00 p.m. ET / 10:00 a.m. PT When you look at natural user interfaces as a developer, it isn't just fun and games. There are some very serious, real-world usage models of how things can help make the world a better place – things like Intel® RealSense™ technology. Check out this upcoming eSeminar and join the panel of experts, both from inside and outside of Intel, as they discuss how natural user interfaces will likely be getting adopted in a wide variety …

Most Popular Programming Stories

More for Developers

Latest Developer Headlines

RSS Feeds