GoogleHTMLOutputHandler (PDFxStream API Reference)

java.lang.Object
- com.snowtide.pdf.OutputHandler
- - pdfts.examples.GoogleHTMLOutputHandler

```
public class GoogleHTMLOutputHandler
extends OutputHandler
```
This class is an example OutputHandler implementation that builds an XHTML document to mimic the HTML view that Google offers for indexed PDF documents.

Source for this class is included in every PDFxStream bundle.

Version:

©2004-2014 Snowtide

Constructor Summary

Constructors
Constructor and Description

GoogleHTMLOutputHandler()

Constructors
Constructor and Description
`GoogleHTMLOutputHandler()`

Method Summary

All Methods Static Methods Instance Methods Concrete Methods Deprecated Methods
Modifier and Type	Method and Description
`void`	`endPage(Page page)` Invoked when PDFxStream has finished processing a page
`org.w3c.dom.Document`	`getHTMLDocument()` Returns the XHTML document that is built up by this OutputHandler.
`static void`	`main(java.lang.String[] args)` Deprecated. Command-line usage of this class may be moved or removed in future PDFxStream releases.
`void`	`startPage(Page page)` Invoked when a page is about to be processed.
`void`	`startPDF(java.lang.String pdfName, java.io.File pdfFile)` Invoked when a new PDF is about to be processed.
`void`	`textUnit(TextUnit tu)` Invoked when a run of characters is to be outputted, as represented by the given `TextUnit` instance.

Methods inherited from class com.snowtide.pdf.OutputHandler
endBlock, endLine, endPDF, linebreaks, spaces, startBlock, startLine

- Constructor Detail
  - GoogleHTMLOutputHandler
```
public GoogleHTMLOutputHandler()
```
- Method Detail
  - main
```
public static void main(java.lang.String[] args)
```
    Deprecated. Command-line usage of this class may be moved or removed in future PDFxStream releases.
    Main method for command-line execution. Usage:
```
java GoogleHTMLOutputHandler [input_pdf_file] [output_html_path]
```
  - getHTMLDocument
```
public org.w3c.dom.Document getHTMLDocument()
```
    Returns the XHTML document that is built up by this OutputHandler.
  - startPage
```
public void startPage(Page page)
```
    Description copied from class: OutputHandler
    
    Invoked when a page is about to be processed.
    
    Overrides:
    
    startPage in class OutputHandler
    
    Parameters:
    
    page - a reference to the Page that is about to be processed
  - endPage
```
public void endPage(Page page)
```
    Description copied from class: OutputHandler
    
    Invoked when PDFxStream has finished processing a page
    
    Overrides:
    
    endPage in class OutputHandler
    
    Parameters:
    
    page - a reference to the Page that has been processed
  - startPDF
```
public void startPDF(java.lang.String pdfName,
                     java.io.File pdfFile)
```
    Description copied from class: OutputHandler
    
    Invoked when a new PDF is about to be processed.
    
    Overrides:
    
    startPDF in class OutputHandler
    
    Parameters:
    
    pdfName - the 'name' of the PDF document, as provided by Document.getName() }
    
    pdfFile - the file reference PDFxStream is about to begin processing. This reference may be null if the source Document is not reading from a File or InputStream.
  - textUnit
```
public void textUnit(TextUnit tu)
```
    Description copied from class: OutputHandler
    
    Invoked when a run of characters is to be outputted, as represented by the given TextUnit instance.
    
    Overrides:
    
    textUnit in class OutputHandler

Class GoogleHTMLOutputHandler

Constructor Summary

Method Summary

Methods inherited from class com.snowtide.pdf.OutputHandler

Constructor Detail

GoogleHTMLOutputHandler

Method Detail

main

getHTMLDocument

startPage

endPage

startPDF

textUnit