Why are the arguments for substring functions mismatched?
In many languages, the substring function works like this:
What direction should I consider the offset when searching strings in reverse?
I’m making an indexOfReverse utility function in my C++ program, and I have caught a bit of a snag. Implementing the offset and maxOffset in my indexOf was very intuitive to me.. These start from the beginning of the string (+offset), and go forth to the end of the string (or maxOffset if given).
When should I use StringBuilder or StringBuffer?
In a production web application, my fellow programmers used StringBuffer everywhere. Now I am taking care of application development and corrections. After reading StringBuilder and StringBuffer I have decided to replace all the StringBuffer code with StringBuilder because we don’t need thread safety in our data beans.
Shortest Common Superstring: find shortest string that contains all given string fragments
Given some string fragments, I would like to find the shortest possible single string (“output string”) that contains all the fragments. Fragments can overlap each other in the output string.
Complex string matching with fuzzywuzzy
I’m attempting to write a process that matches obscure strings to a single ‘master string’ for further processing. I have a lot of data that looks something like this:
How to implement string matching based on a pattern
I was asked to build a tool that can identify if a string match a pattern.
Why does Java use UTF-16 for internal string representation?
I would imagine the reason was fast, array like access to the character at index, but some characters won’t fit into 16 bits, so it wouldn’t work…
Looking for a non-cryptographic hash function that returns a single character
Suppose I have a dictionary of ASCII words stored in uppercase. I also want to save those words into separate files so that the total word count of each file is approximately the same. By simply looking at the word I need to know which file it should be in (if it’s there at all). Duplicate words should go into the same file and overwrite the last one.
Finding occurrences of a useful words and phrases in strings
I am building an app that analyzes posts by people by pulling their Tweets and Facebook posts. I need to process all the posts and find useful phrases. What I mean by useful is that, any word or phrase that is a noun/adjective/verb that would represent a discrete object or an idea, or in other words, I am looking for keywords.
Which string search algorithm is actually the fastest?
I have been stuck for some time on which is the fastest string search algorithm, heard many opinions, but in the end I’m not sure.