Which algorithms are supported?

Levenshtein distance (character edits), Jaccard similarity (set overlap), cosine similarity (word vector), and n-gram similarity. Each serves different use cases.

Levenshtein for typo detection, Jaccard for plagiarism/overlap, cosine for semantic-ish comparison of shorter texts. N-gram for phrase matching.

Is this semantic comparison?

No — these are syntactic algorithms. For semantic similarity (meaning, not spelling), use embeddings (OpenAI, Cohere) — a different tool entirely.

Does it work for code?

Yes — Levenshtein is fine for small code snippets. For real diff-style comparison of code, use text-diff or git diff.

Text Similarity Scorer | DevTools Surf

DevTools Surf

About Text Similarity Scorer

Text Similarity Scorer preview - Fun / Niche tool

Calculate similarity between two text strings (Levenshtein, Jaccard). Part of the DevTools Surf developer suite. Browse more tools in the Fun / Niche collection.

Use Cases

Tune fuzzy search thresholds for product catalogs
Evaluate duplicate detection accuracy in data pipelines
Compare OCR output against ground truth text
Test string matching algorithms for deduplication systems

Tips

Compare Levenshtein and Jaccard scores for different perspectives
Test fuzzy matching thresholds for search implementations
Paste two strings to quantify how different they are

Fun Facts

Vladimir Levenshtein published his edit distance algorithm in 1965, originally for correcting deletion and insertion errors in binary codes.
Paul Jaccard introduced his similarity coefficient in 1901 to compare the flora of different Alpine regions in Switzerland.
Spell checkers typically suggest corrections for words within a Levenshtein distance of 2, which covers 95% of common typos.

FAQ

Which algorithms are supported?: Levenshtein distance (character edits), Jaccard similarity (set overlap), cosine similarity (word vector), and n-gram similarity. Each serves different use cases.
Which should I use?: Levenshtein for typo detection, Jaccard for plagiarism/overlap, cosine for semantic-ish comparison of shorter texts. N-gram for phrase matching.
Is this semantic comparison?: No — these are syntactic algorithms. For semantic similarity (meaning, not spelling), use embeddings (OpenAI, Cohere) — a different tool entirely.
Does it work for code?: Yes — Levenshtein is fine for small code snippets. For real diff-style comparison of code, use text-diff or git diff.

Related Fun / Niche Tools

Emoji / Unicode Lookup NATO Phonetic Alphabet Morse Code ↔ Text Roman Numeral Converter Number → Words Leet Speak ↔ Plain Slug Generator Zero-Width Character Detector