November 26th, 2002, 03:41 PM
Modify MS Word HTML output
Is there a way to use Visual Basic to modify the HTML docs that MS Word creates?
I am trying to find a way to get MS word to insert somekind of mark up for page breaks in the output HTML document.
November 27th, 2002, 10:11 AM
When MS Word creates a HTML document, it is just a text file. This is easy enough to open up and do what you want with it.
Open your_file for Input as #1...
You have lost me as far as mark up for page breaks.
There aren't page breaks on html pages. It is just one long page. If you want to have some form of page breaks, then you would probably want to seperate the large file into many seperate ones.
November 27th, 2002, 10:34 AM
I probably should have provided more info upfront.
I am working on converting a large legal document from MS Word into HTML (not pdf for this situation). The document is ~1000 pages.
Each chapter of the document is an individual word file. Some of the chapters are 50 pages long. I need to convert each chapter into a series of individual HTML pages rather than one long HTML page.
MS word will convert the chapter into one long HTML file. However, there seems to be no way to have it convert according to page breaks. for example break up a chapter into 5 individual html pages instead of one being 5 printed pages long.
I am hoping I can find a way for MS Word to insert some kind of mark up in the output HTML chapter indicating location of page breaks. Then use a perl script on the server to break the HTML chapter into individual pages.
I'm asking in this board because I assumed there is a way to alter the way Word outputs html using Visual Basic. (I wonder if there is a macro out there that already does this)
November 27th, 2002, 10:56 AM
I am not sure, but you might want to check the MSDN reference.