LUCENE: Sorting a tokenized string field ALMOST works perfectly as of version 10.0.0
My team has come across an error that only happens with a particular sort condition.
LUCENE: Sorting a tokenized string field ALMOST works perfectly as of version 10.0.0
My team has come across an error that only happens with a particular sort condition.
Good technique for search text tokenization
We are looking for a way to tokenize some text in the same or similar way as a search engine would do it.
NLP – Queries using semantic wildcards in full text searching, maybe with Lucene?
Let’s say I have a big corpus (for example in english or an arbitrary language), and I want to perform some semantic search on it. For example I have the query:
Lucene + Joins == RDBMS?
Now that Lucene supports joins (at indexing time and at querying time) can one use Lucene as a databse (a NoSQL one, with Eventual Consistency)?
Lucene + Joins == RDBMS?
Now that Lucene supports joins (at indexing time and at querying time) can one use Lucene as a databse (a NoSQL one, with Eventual Consistency)?
Lucene + Joins == RDBMS?
Now that Lucene supports joins (at indexing time and at querying time) can one use Lucene as a databse (a NoSQL one, with Eventual Consistency)?
How is machine learning incorporated into search engine design?
I am currently building a small in-house search engine based on Apache Lucene. Its purpose is simple – based on some keywords, it will suggest some articles written internally within our company. I am using a fairly standard TF-IDF scoring as a base metric and built my own scoring mechanism on top it. All of these seem to be working excellent except for some corner cases where the ranking seems messed up.
How is machine learning incorporated into search engine design?
I am currently building a small in-house search engine based on Apache Lucene. Its purpose is simple – based on some keywords, it will suggest some articles written internally within our company. I am using a fairly standard TF-IDF scoring as a base metric and built my own scoring mechanism on top it. All of these seem to be working excellent except for some corner cases where the ranking seems messed up.
Is lucene.net/solrnet a good solution for searching a list of names with fuzzy matching? [closed]
Closed 10 years ago.