Shiny App: Fuzzy Matching

A shiny app that helps to perform fuzzy matching

shiny
Author

Zahier Nasrudin

Published

November 30, 2022

Objectives

  1. To perform fuzzy matching between two data sets easily. The shiny app: https://mhdzahier.shinyapps.io/fuzzy_matching/
  2. It is using fuzzyjoin package in R. For this shiny app, we are currently using Jaro-Winkler distance method. It will return a number between 0 (exact match) & 1 (completely different)

Applications?

  1. To match stores between two data sets (Based on the address list)
  2. To match similarities between strings

Steps

  1. To upload the first file
  2. To upload the second file
  3. Choose column to match respectively
  4. Click submit button to perform fuzzy matching
  5. Download matched file

Users may use example files to test the shiny app here github

Examples

Upload the first file

Upload the second file

Select columns & click submit to perform fuzzy matching. The output will return “TO MATCH” column (renamed from the specified column from the first file) and “MATCHED” column (renamed from the specified column from the second file) and the distance between them.

Download file

Try it here

Future

  1. To add a feature where the users can specify the max distance

Contact

Please email me at zahiernasrudin@gmail.com for any inquiries.