This article is pretty useless and already confirms what we already know. However, I am looking for an actual solution in our google analytics to search/replace landing page urls that end with .html to without for consistency. Some people remember to remove the .html and others do not. As a result we are getting duplicate line items one ending in .html and one without, which skews the data.
Test thoroughly (use the "verify" feature).
Note GA regexen are not properly anchored to URL components (host, path, query) and I find they cannot perform most tasks without some risk of false positives.