This project has declared the following modules:
| Name | Description |
|---|---|
| jMatchParser-util | A java-based parser for parsing/grabbing web sites and other text or XML documents, based on a nondeterministic parser language, creating XML output. Also contains a few utility classes for HTML, CSV and text parsing, and additional character sets. The jMatchParser-util module contains the utility classes for parsing. |
| jMatchParser-charset | A java-based parser for parsing/grabbing web sites and other text or XML documents, based on a nondeterministic parser language, creating XML output. Also contains a few utility classes for HTML, CSV and text parsing, and additional character sets. The jMatchParser-charset module contains the character sets. |
| jMatchParser-jchardet | A java-based parser for parsing/grabbing web sites and other text or XML documents, based on a nondeterministic parser language, creating XML output. Also contains a few utility classes for HTML, CSV and text parsing, and additional character sets. The jMatchParser-jchardet module contains a charset provider for a character set that uses jchardet for automatically detecting the charset. |
| jMatchParser-juniversalchardet | A java-based parser for parsing/grabbing web sites and other text or XML documents, based on a nondeterministic parser language, creating XML output. Also contains a few utility classes for HTML, CSV and text parsing, and additional character sets. The jMatchParser-juniversalchardet module contains a charset provider for a character set that uses juniversalchardet for automatically detecting the charset. |
| jMatchParser-icu4j-chardet | A java-based parser for parsing/grabbing web sites and other text or XML documents, based on a nondeterministic parser language, creating XML output. Also contains a few utility classes for HTML, CSV and text parsing, and additional character sets. The jMatchParser-icu4j-chardet module contains the character sets. |
| jMatchParser-parser | A java-based parser for parsing/grabbing web sites and other text or XML documents, based on a nondeterministic parser language, creating XML output. Also contains a few utility classes for HTML, CSV and text parsing, and additional character sets. The jMatchParser-parser module contains the actual parser. |
| jMatchParser-all | A java-based parser for parsing/grabbing web sites and other text or XML documents, based on a nondeterministic parser language, creating XML output. Also contains a few utility classes for HTML, CSV and text parsing, and additional character sets. The jMatchParser-all module contains metadata for building aggregate packages. |