|
|||||||||
PREV NEXT | FRAMES NO FRAMES |
AcroForm
.interactive forms
.Form
implementation that represents the
contents of an interactive PDF form,
available via PDFTextStream.getFormData()
.AcroForm
instances.interactive forms
.Page
.AcroButtonField.getButtonType()
when a AcroButtonField
represents
a checkbox.
AcroButtonField.getButtonType()
when a AcroButtonField
represents
a pushbutton.
AcroButtonField.getButtonType()
when a AcroButtonField
represents
a group of radio buttons.
AcroChoiceField.getWidgetType()
, indicating that an AcroChoiceField
is representing a drop-down combo box.
AcroChoiceField.getWidgetType()
indicating that an AcroChoiceField
is representing a list that can potentially allow multiple values to be selected.
Page.addColumnPartition(int)
.
Page.addColumnPartition(int)
.
Page.addColumnPartition(int)
.
com.snowtide.pdf.annot
package contains interfaces and classes that PDFTextStream uses to represent
various types of annotations present in PDF documents.com.snowtide.pdf.forms
package is home to a variety of classes that support PDFTextStream's form
extraction functionality.com.snowtide.pdf.lucene
package provides a method of seamless integration between the
Apache Lucene full-text indexing and search engine available
for the Java environment.String
s, one for each Table
found in the given
page, using the specified character as a cell delimiter and the specified String as a linebreak sequence.
convertTablesToCSV(Page, char, String)
with the specified page, cell delimiter, and
a single linefeed (\n) as a linebreak sequence.
convertToCSV(Table, char, String)
with the specified Table
, cell delimiter,
and a single linefeed (\n) as a linebreak sequence.
Table
, using
the specified character as a cell delimiter and the specified String as a linebreak
sequence.
Block
instance that contains only the content held by
this Block
instance that intersects the given "query" area.
Page
instance that contains only the content held by
this Page
instance that intersects the given "query" area.
PDFDocumentFactory
class.OutputTarget.OutputTarget(Appendable)
Form
.OutputTarget.OutputTarget(Writer)
instead
Annotation
implementation that represents
a PDF annotation of type FreeText
.Table
instances that are available in the given Page
.
Annotation
interface.
TextUnit.getCharCode()
.
PDFTextStreamConfig
instance that this OutputTarget
is currently using.
PDFTextStreamConfig
instance provided to this page by its parent PDFTextStream
instance.
PDFTextStreamConfig
instance that this PDFTextStream
instance is using
to govern its operation.
Page.getPageHeight()
and Page.getPageWidth()
.
PDFTextStream
instances use by default (which is settable
via PDFTextStreamConfig.setDefaultConfig(PDFTextStreamConfig)
.
Bookmark
.
java.util.List
of the possible export values of this button field, as dictated by the
field dictionary's /Opt entry.
AcroFormField
that has the given field name, as provided by the
AcroFormField.getFullName()
function.
AcroFormField.getFullName()
)
of all of the form fields held by this form.
Enumeration
of the field names of all of the fields held by this form.
Font
that was in force when this textunit was outputted.
Form
object
that represents that data.
Line
at the given index.
OutputTarget
(and its subclasses) output for each linebreak identified in
extracted PDF content.
Line
s held by this Block.
Line
s held by this Block.
PDFDocumentFactory
class using this config object.
Table
.
pdfName
String that was provided if this instance was created
with an InputStream constructor.
java.io.Writer
or java.lang.Appendable
.
getValue()
) as keys mapped to
display values (strings that are returned by AcroChoiceField.getDisplayValue(String)
, given
a valid export value).
RegionOutputTarget.addRegion(float, float, float, float, String)
.
i-th
region that was registered with this RegionOutputTarget.
FreeTextAnnotation.getRichContent()
function when that content is displayed.
BlockParent
instance containing the Block
s that form the
indicated row number.
VisualOutputTarget
.
PDFTextStream
instance from which this Page was sourced.
TextUnit
at the specified index.
TextUnit
s in this Line.
TextUnit
s held by this Line.
link's annotation action type
is 'URI'.
AcroChoiceField.CHOICE_TYPE_COMBO
(indicating that this choice widget is a dropdown,
single-selection field), or AcroChoiceField.CHOICE_TYPE_LIST
, indicating that this choice widget is a
multiple-row, possibly multiple-selection field).
packetName
in this form.
Set
of the packet names of the XFA data.
XMLOutputTarget
as a String
.
XMLOutputTarget
is building.
AcroTextField.getRichTextValue()
function.
PDFDocumentFactory
using this config object will be indexed.
PDFDocumentFactory
using this config object will be indexed.
VisualOutputTarget
will include rotated TextUnit
s in its output
(true by default).
FormField
instances held by this Form.
Block
.linebreak String provided by
the current configuration
) to the java.io.Writer
or java.lang.Appendable
object that this OutputTarget wraps.
Annotation
implementation that represents
a PDF annotation of type Link
.LogFactory
implementation.LoggingRegistry
class)
must implement this interface.OutputHandler
implementation that provides a common output
interface for Writer
and Appendable
instances (such as
StringBuilder
s and StringBuffer
s), allowing
PDFTextStream to easily redirect output to either type of object.java.io.Writer
instance.
java.lang.Appendable
instance.
java.util.Date objects
.PDFTextStream
gives your Java, .NET, and Python applications the ability to:
Extract text and metadata from PDF documents (including metadata like XMP data, bookmarks, and annotations)
Extract and update interactive AcroForm data
Merge PDF documents
Instances of this class can either access a PDF file directly, or process equivalent data
delivered via a java.io.InputStream
or java.nio.ByteBuffer
.PDFTextStream.getPDFVersion()
.OutputHandler
.
OutputHandler
.
OutputHandler
implementation.
OutputHandler
.
OutputHandler
implemenation is used to selectively extract text from certain regions of each PDF page.VisualOutputTarget
to lay out the text extracted for each region.
Log
objects.
TextUnit
at the specified index, and returns it.
Line
at the given index from this Block, and returns it.
PDFTextStreamConfig
instance this OutputTarget
should use.
PDFTextStreamConfig
instance that this PDFTextStream
instance will
use in various contexts to govern its operation.
copyAllPDFAttrs
attribute.
PDFTextStream
instances use by default.
VisualOutputTarget
will include rotated TextUnit
s in its output
(true by default).
OutputTarget
(and its subclasses) output for each linebreak identified in
extracted PDF content.
Table
.
VisualOutputTarget
.
java.io.Writer
or java.lang.Appendable
object that this OutputTarget wraps.
PDFDocumentFactory
using this config object will be stored.
PDFDocumentFactory
using this config object will be stored.
Table
s.Annotation
implementation that represents
a PDF annotation of type Text
.Line
.TextUnit
instance.
TextUnit
instance to the java.io.Writer
or java.lang.Appendable
object that this OutputTarget wraps.
PDFDocumentFactory
using this config object will be tokenized.
PDFDocumentFactory
using this config object will be tokenized.
AcroForm.writeUpdatedDocument(OutputStream)
, except
the PDF document data is written to the path specified by the given File object.
via command line
), and builds up an XML DOM Document instance containing the
extracted form data.OutputHandler
implementation that builds up
a DOM XML model of extracted PDF content.XMLOutputTarget
.
|
|||||||||
PREV NEXT | FRAMES NO FRAMES |