Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Ha, good one. I implemented levenshtein distance in Apps Script the other day to help some poor colleagues who were doing manual data cleaning tasks.


Amazing, I did avoid hell to collegues by implementing some form of counter in VBA. My most prized coding contribution. 8 lines that evaporated hell (filtering 1000 rows in Excel to find duplicates) into fun.


This is how I started as well. But we had too many typos so I had to find something a little bit 'smarter'.

Next thing on my list is to implement tf–idf to help with free text categorisation.


Sounds great! Feel free to adapt this to your usecase, if useful: https://github.com/yesbabyyes/tfidf


I did levenstein a few time at that job, indeed, anything with operator input will have errors




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: