How to support mixed grammars?
I am currently writing a grammar for a programming language (PowerBuilder) and I am also collecting documentation about parsing and compilers creation.
Programming Language Parser (in Java) – What would be a better design alternative for a special case?
Background I’m currently designing my own programming language as a research project. I have most of the grammar done and written down as context-free grammar, and it should be working as is. – Now I’m working on the actual compiler that should translate the language into x86 binary assembly code, more specifically, I am working […]
parsing terminology: comments+whitespaces vs actual code
In languages like c/c++ spacing and comments are ignored and only actual code gets into compiler.
I’m interested if there is accepted way of naming these two things?
How do I translate user input into a fictitious language?
For experimental reasons, I am trying to convert user input into a fictitious language. All of the translation can be 1:1.
Can the csv format be defined by a regex?
A colleague and I have recently argued over whether a pure regex is capable of fully encapsulating the csv format, such that it is capable of parsing all files with any given escape char, quote char, and separator char.
Does JAXP natively parse HTML?
So, I whip up a quick test case in Java 7 to grab a couple of elements from random URIs, and see if the built-in parsing stuff will do what I need.
design for a parser to handle very large files
I have written a program which records protocol messages between an application and a hardware device which matches each application request with each hardware response. This is so that I can later remove the hardware, connect a ‘replay’ application to the main application and wait for an application request and reply with a matched copy of the requisite hardware reply message.
Persisting natural language processing parsed data
I’ve recently started experimenting with natural language processing (NLP) using Stanford’s CoreNLP, and I’m wondering what are some of the standard ways to store NLP parsed data for something like a text mining application?
Generic rule parser for RPG board game rules – how to do it?
I want to build a generic rule parser for pen and paper style RPG systems. A rule can involve usually 1 to N entities 1 to N roles of a dice and calculating values based on multiple attributes of an entity.
What does it mean that StAX parsers aren’t complete or correct?
I’ve been browsing the net and found theses two slides on site of XOM (XML object model):