Objectives
- To perform fuzzy matching between two data sets easily. The shiny app: https://mhdzahier.shinyapps.io/fuzzy_matching/
- It is using fuzzyjoin package in R. For this shiny app, we are currently using Jaro-Winkler distance method. It will return a number between 0 (exact match) & 1 (completely different)
Applications?
- To match stores between two data sets (Based on the address list)
- To match similarities between strings
Steps
- To upload the first file
- To upload the second file
- Choose column to match respectively
- Click submit button to perform fuzzy matching
- Download matched file
Users may use example files to test the shiny app here github
Examples
Try it here
Future
- To add a feature where the users can specify the max distance
Contact
Please email me at zahiernasrudin@gmail.com for any inquiries.