Search & Replace; Converting META NAME Elements
Some times you need to transform each META NAME line into
another form. In following example, we take the header
information from a Microsoft Knowledge Base article and convert
it to XML. Note the conversion is not complete, but it is a
start. Be sure to correct "dangling" content. The regular
expression cannot convert elements in which the content is split
across lines.
FIND:
^[^"]+"\([^"]*\)"[^"]*"\([^"]*\)"*.*
REPLACE:
<\1>\2</\1><BR>
CHANGES:
<TITLE>Q10022 - OFF97: EPS File Is Not Printed or Is Printed
Incorrectly Under Windows NT</TITLE>
<META HTTP-EQUIV="CONTENT-Type" CONTENT="text/html;
charset=iso-8859-1" />
<META HTTP-EQUIV="PICS-Label" CONTENT="(PICS-1.1
"http://www.rsac.org/ratingsv01.html" l gen true comment "RSACi
North America Server" by "Inet@microsoft.com" for "http://support.microsoft.com"
on "1998.02.17T12:28-0800" r (n 0 s 0 v 0 l 0))" />
<META NAME="ms.locale" CONTENT="EN-US" />
<META NAME="Category" CONTENT="Support; KB Article" />
<META NAME="Premium" CONTENT="Support" />
<META NAME="KBArea" CONTENT="Support; KB; word97" />
<META NAME="KBID" CONTENT="Q10022" />
<META NAME="KBTitle" CONTENT="OFF97: EPS File Is Not Printed or
Is Printed Incorrectly Under Windows NT" />
<META NAME="Description" CONTENT="When you print an Encapsulated
PostScript (EPS) file, the file may fail to print or may be
printed incorrectly." />
<META NAME="Product" CONTENT="word97" />
<META NAME="KBCreate" CONTENT="April 6, 1984" />
<META NAME="KBModify" CONTENT="April 28, 1999" />
<META NAME="EditDate" CONTENT="January 22, 1999" />
<META NAME="Question" CONTENT="" />
<META NAME="Versions" CONTENT="WINDOWS:97; winnt:4.0" />
<META NAME="Component" CONTENT="" />
<META NAME="Technology" CONTENT="" />
<META NAME="Links" CONTENT="" />
<META NAME="Keywords" CONTENT="kbprint offwin offprint" />
<META NAME="Platform" CONTENT="WINDOWS winnt" />
<META NAME="Hardware" CONTENT="" />
<META NAME="SolutionType" CONTENT="kbfix" />
<META NAME="IssueType" CONTENT="kbbug" />
<META NAME="BoilerPlate" CONTENT="" />
<META NAME="ProducedView" CONTENT="" />
<META NAME="QUERYWORDS"
CONTENT="EPS">
<META NAME="QUERYWORDS"
CONTENT="filter">
<META NAME="QUERYWORDS"
CONTENT="print">
<META NAME="QUERYWORDS"
CONTENT="garbled">
<META NAME="QUERYWORDS"
CONTENT="does">
<META NAME="QUERYWORDS"
CONTENT="not">
<META NAME="QUERYWORDS"
CONTENT="is">
<META NAME="QUERYWORDS"
CONTENT="printed">
|
<META NAME="QUERYWORDS" CONTENT="does"> |
|
<META NAME="QUERYWORDS" CONTENT="EPS"> |
|
<META NAME="QUERYWORDS" CONTENT="filter"> |
|
<META NAME="QUERYWORDS" CONTENT="garbled"> |
|
<META NAME="QUERYWORDS" CONTENT="is"> |
|
<META NAME="QUERYWORDS" CONTENT="print"> |
|
<META NAME="QUERYWORDS" CONTENT="printed"> |