Cisco Cisco Collaboration Server Dynamic Content Adapter
DCA 2.01 Administration and Configuration Guide
About the Parsing Engine 104
About the Parsing Engine
The DCA parser is actually a parsing engine comprised of a hierarchical collection of
one or more XML-based parsers. Within the hierarchy, individual parsers extend from
other parsers, inheriting and overriding behavior as needed. Parsers can be targeted
at content based on protocol, host name, port, document path, and content type.
When the DCA server is started, these files are processed through an XML parser
which combines them into a single parsing engine.
one or more XML-based parsers. Within the hierarchy, individual parsers extend from
other parsers, inheriting and overriding behavior as needed. Parsers can be targeted
at content based on protocol, host name, port, document path, and content type.
When the DCA server is started, these files are processed through an XML parser
which combines them into a single parsing engine.
By customizing the parser, you can control the handling of all documents served
through the DCA. Parser customization allows you to:
through the DCA. Parser customization allows you to:
• Specify how documents are parsed;
• Specify behaviors to reformat/modify that content before it is sent back from the
server.
Parser Structure
Each parser is made up of one
or more rules bound into a
single MetaMatch tagset. A
MetaMatch, therefore, is a
wrapper object that corresponds
to a single parser. You can
target a parser for use with a
specific content type, protocol,
host, etc. by setting its
MetaMatch attributes.
or more rules bound into a
single MetaMatch tagset. A
MetaMatch, therefore, is a
wrapper object that corresponds
to a single parser. You can
target a parser for use with a
specific content type, protocol,
host, etc. by setting its
MetaMatch attributes.
Rules are the core elements
that define a parser's behavior.
Each Rule consists of one or
Accept elements -- regular
expressions used to match text
in a parsed document, and one
or more Script elements --
which define actions to be taken
based on the matched text.
that define a parser's behavior.
Each Rule consists of one or
Accept elements -- regular
expressions used to match text
in a parsed document, and one
or more Script elements --
which define actions to be taken
based on the matched text.
The parser includes a published
Java API whose object classes
can be used to extend the script
environment.
Java API whose object classes
can be used to extend the script
environment.