Separating words in a string
How do I separate words in a string?
Persisting natural language processing parsed data
I’ve recently started experimenting with natural language processing (NLP) using Stanford’s CoreNLP, and I’m wondering what are some of the standard ways to store NLP parsed data for something like a text mining application?
NLP – Queries using semantic wildcards in full text searching, maybe with Lucene?
Let’s say I have a big corpus (for example in english or an arbitrary language), and I want to perform some semantic search on it. For example I have the query:
Tasks incorrectly categorized with online text classifiers
Context: Finding company to do the job
How can I distinguish between one-worded expressions and two-worded expressions?
I was told to create two lists concerning most frequently used words from a plain text (10 Mb arbitrary texts) as monograms (for single worded expressions such as human, water, is) and bigrams (for two-worded expressions such as basketball team, united states and etc)
How can I distinguish between one-worded expressions and two-worded expressions?
I was told to create two lists concerning most frequently used words from a plain text (10 Mb arbitrary texts) as monograms (for single worded expressions such as human, water, is) and bigrams (for two-worded expressions such as basketball team, united states and etc)
How can I distinguish between one-worded expressions and two-worded expressions?
I was told to create two lists concerning most frequently used words from a plain text (10 Mb arbitrary texts) as monograms (for single worded expressions such as human, water, is) and bigrams (for two-worded expressions such as basketball team, united states and etc)
How to do simple natural language categorization [closed]
Closed 8 years ago.
How to do simple natural language categorization [closed]
Closed 8 years ago.
Sentence Tree vs. Words List
I was recently tasked with building a Name Entity Recognizer as part of a project. The objective was to parse a given sentence and come up with all the possible combinations of the entities.