|
|||||||||
| PREV NEXT | FRAMES NO FRAMES | ||||||||
AcroForm.interactive forms.Form implementation that represents the
contents of an interactive PDF form,
available via PDFTextStream.getFormData().AcroForm instances.interactive forms.Page.AcroButtonField.getButtonType() when a AcroButtonField represents
a checkbox.
AcroButtonField.getButtonType() when a AcroButtonField represents
a pushbutton.
AcroButtonField.getButtonType() when a AcroButtonField represents
a group of radio buttons.
AcroChoiceField.getWidgetType(), indicating that an AcroChoiceField
is representing a drop-down combo box.
AcroChoiceField.getWidgetType() indicating that an AcroChoiceField
is representing a list that can potentially allow multiple values to be selected.
com.snowtide.pdf.annot package contains interfaces and classes that PDFTextStream uses to represent
various types of annotations present in PDF documents.com.snowtide.pdf.forms package is home to a variety of classes that support PDFTextStream's form
extraction functionality.com.snowtide.pdf.lucene package provides a method of seamless integration between the
Apache Lucene full-text indexing and search engine available
for the Java environment.Strings, one for each Table found in the given
page, using the specified character as a cell delimiter and the specified String as a linebreak sequence.
convertTablesToCSV(Page, char, String) with the specified page, cell delimiter, and
a single linefeed (\n) as a linebreak sequence.
convertToCSV(Table, char, String) with the specified Table, cell delimiter,
and a single linefeed (\n) as a linebreak sequence.
Table, using
the specified character as a cell delimiter and the specified String as a linebreak
sequence.
Block instance that contains only the content held by
this Block instance that intersects the given "query" area.
Page instance that contains only the content held by
this Page instance that intersects the given "query" area.
PDFDocumentFactory class.OutputTarget.OutputTarget(StringBuffer)
Form.OutputTarget.OutputTarget(Writer) instead
Annotation implementation that represents
a PDF annotation of type FreeText.Table instances that are available in the given Page.
Annotation interface.
TextUnit.getCharCode().
PDFTextStreamConfig instance that this OutputTarget is currently using.
PDFTextStreamConfig instance provided to this page by its parent PDFTextStream
instance.
PDFTextStreamConfig instance that this PDFTextStream instance is using
to govern its operation.
Page.getPageHeight() and Page.getPageWidth().
PDFTextStream instances use by default (which is settable
via PDFTextStreamConfig.setDefaultConfig(PDFTextStreamConfig).
Bookmark.
java.util.List of the possible export values of this button field, as dictated by the
field dictionary's /Opt entry.
AcroFormField that has the given field name, as provided by the
AcroFormField.getFullName() function.
AcroFormField.getFullName())
of all of the form fields held by this form.
Enumeration of the field names of all of the fields held by this form.
Font that was in force when this textunit was outputted.
Form object
that represents that data.
Line at the given index.
OutputTarget (and its subclasses) output for each linebreak identified in
extracted PDF content.
Lines held by this Block.
Lines held by this Block.
PDFDocumentFactory class using this config object.
Table.
pdfName String that was provided if this instance was created
with an InputStream constructor.
java.io.Writer or java.lang.StringBuffer.
getValue()) as keys mapped to
display values (strings that are returned by AcroChoiceField.getDisplayValue(String), given
a valid export value).
RegionOutputTarget.addRegion(float, float, float, float, String).
i-th region that was registered with this RegionOutputTarget.
FreeTextAnnotation.getRichContent() function when that content is displayed.
BlockParent instance containing the Blocks that form the
indicated row number.
VisualOutputTarget.
PDFTextStream instance from which this Page was sourced.
TextUnit at the specified index.
TextUnits in this Line.
TextUnits held by this Line.
link's annotation action type is 'URI'.
AcroChoiceField.CHOICE_TYPE_COMBO (indicating that this choice widget is a dropdown,
single-selection field), or AcroChoiceField.CHOICE_TYPE_LIST, indicating that this choice widget is a
multiple-row, possibly multiple-selection field).
packetName
in this form.
Set of the packet names of the XFA data.
XMLOutputTarget as a String.
XMLOutputTarget is building.
AcroTextField.getRichTextValue() function.
PDFDocumentFactory using this config object will be indexed.
PDFDocumentFactory using this config object will be indexed.
VisualOutputTarget will include rotated TextUnits in its output
(true by default).
FormField instances held by this Form.
Block.linebreak String provided by
the current configuration) to the java.io.Writer or java.io.StringBuffer
object that this OutputTarget wraps.
Annotation implementation that represents
a PDF annotation of type Link.LogFactory implementation.LoggingRegistry class)
must implement this interface.OutputHandler implementation that provides a common output
interface for Writer and StringBuffer instances, allowing
PDFTextStream to easily redirect output to either type of object.java.io.Writer instance.
java.lang.StringBuffer instance.
java.util.Date objects.PDFTextStream gives your Java, .NET, and Python applications the ability to:
Extract text and metadata from PDF documents (including metadata like XMP data, bookmarks, and annotations)
Extract and update interactive AcroForm data
Merge PDF documents
Instances of this class can either access a PDF file directly, or process equivalent data
delivered via a java.io.InputStream or java.nio.ByteBuffer.PDFTextStream.getPDFVersion().OutputHandler.
OutputHandler.
OutputHandler implementation.
OutputHandler.
OutputHandler implemenation is used to selectively extract text from certain regions of each PDF page.VisualOutputTarget to lay out the text extracted for each region.
Log objects.
TextUnit at the specified index, and returns it.
Line at the given index from this Block, and returns it.
PDFTextStreamConfig instance this OutputTarget should use.
PDFTextStreamConfig instance that this PDFTextStream instance will
use in various contexts to govern its operation.
copyAllPDFAttrs attribute.
PDFTextStream instances use by default.
VisualOutputTarget will include rotated TextUnits in its output
(true by default).
OutputTarget (and its subclasses) output for each linebreak identified in
extracted PDF content.
Table.
VisualOutputTarget.
java.io.Writer or java.io.StringBuffer
object that this OutputTarget wraps.
PDFDocumentFactory using this config object will be stored.
PDFDocumentFactory using this config object will be stored.
Tables.Annotation implementation that represents
a PDF annotation of type Text.Line.TextUnit instance.
TextUnit instance to the java.io.Writer or java.io.StringBuffer
object that this OutputTarget wraps.
PDFDocumentFactory using this config object will be tokenized.
PDFDocumentFactory using this config object will be tokenized.
AcroForm.writeUpdatedDocument(OutputStream), except
the PDF document data is written to the path specified by the given File object.
via command line), and builds up an XML DOM Document instance containing the
extracted form data.OutputHandler implementation that builds up
a DOM XML model of extracted PDF content.XMLOutputTarget.
|
|||||||||
| PREV NEXT | FRAMES NO FRAMES | ||||||||