Relative Content

Tag Archive for parsing

How to support mixed grammars?

I am currently writing a grammar for a programming language (PowerBuilder) and I am also collecting documentation about parsing and compilers creation.

Can the csv format be defined by a regex?

A colleague and I have recently argued over whether a pure regex is capable of fully encapsulating the csv format, such that it is capable of parsing all files with any given escape char, quote char, and separator char.

Does JAXP natively parse HTML?

So, I whip up a quick test case in Java 7 to grab a couple of elements from random URIs, and see if the built-in parsing stuff will do what I need.

design for a parser to handle very large files

I have written a program which records protocol messages between an application and a hardware device which matches each application request with each hardware response. This is so that I can later remove the hardware, connect a ‘replay’ application to the main application and wait for an application request and reply with a matched copy of the requisite hardware reply message.

Persisting natural language processing parsed data

I’ve recently started experimenting with natural language processing (NLP) using Stanford’s CoreNLP, and I’m wondering what are some of the standard ways to store NLP parsed data for something like a text mining application?