XML SAX Parser for file endpoint
Use the XML SAX Parser to read large XML documents that the DOM-based XML Parser cannot handle because of memory constraints. The XML SAX Parser is based on the Apache Xerces library.
The XML SAX Parser extracts data that is enclosed within the Group tag that you specify in the configuration. It creates an entry with the attributes that are present in the data. To access the XML SAX Parser configuration parameters:
- Add a File endpoint.
- On the File endpoint configuration page, click Parser and select XML SAX Parser from the list.
- Expand the Parser section to view the parameters.
Parameters
- Group Tag
- Names of one or more XML group tags that enclose the entries. We can specify multiple tags by separating each tag name with a comma. If you do not specify a value, the root tag is used and the entire XML document is returned as a single entry.
- Comment
- Add your comments. The comment is not considered while parsing data.
- Detailed Log
- Select this check box to generate log messages with detailed debug information.
We can also configure the following advanced parameters. Under the Parser section, expand Advanced to view these parameters.
- Remove prefix
- Specify the prefix that you want to remove from the attribute names.
- Ignore Attributes
- Select this check box to ignore the attributes of the group tag and its child attributes.
- Character Encoding
- Character encoding to use for reading or writing. The default value is UTF-8.
- Document Validation
- Select this check box to request file validation that is based on the specified DTD or schema.
- Use XSD Validation
- Select this check box to use XSD instead of DTD to validate the XML file.
- Namespace Aware
- Select this check box to indicate the parser must request a namespace-aware parser.
- Read Timeout
- Number of seconds after which the parser stops if no data is received.
For detailed information about the XML SAX Parser and examples of its usage, go to the SDI documentation and search for XML SAX Parser.
Parent topic:
File parsers