Title: | Restaurant Data for Entity Resolution |
Version: | 0.1.0 |
Description: | Duplicated restaurant data (pre-processed and formatted) for entity resolution. This package contains formatted data from a data set that contains information about different restaurants, with the Zagats portion containing 331 records and the Fodors portion containing 533 records. The following variables are included in the data set: id, name, address, city, phone, type. The data set has a respective gold data set that provides information on which records match based on id. |
URL: | https://github.com/resteorts/restaurant |
BugReports: | https://github.com/resteorts/restaurant/issues |
Depends: | R (≥ 3.4.0) |
License: | CC0 |
Encoding: | UTF-8 |
LazyData: | true |
RoxygenNote: | 7.1.1.9000 |
NeedsCompilation: | no |
Packaged: | 2020-10-13 13:56:58 UTC; rebeccasteorts |
Author: | Rebecca Steorts [aut, cre], Andee Kaplan [aut], Srini Sunil [aut] |
Maintainer: | Rebecca Steorts <beka@stat.duke.edu> |
Repository: | CRAN |
Date/Publication: | 2020-10-22 08:50:02 UTC |
Restaurants data set
Description
This provides a record linkage data set with information about different restaurants.
Usage
restaurant
Format
A data frame with 16 variables: id
, name
, address
, city
, phone
, type
, class
This data set includes 533 restaurants from the Fodors database and 331 records from the Zagat database. It is appropriate for performing various types of record linkage and can be assessed by standard record linkage methods.
Examples
head(restaurant)
dim(restaurant)
Restaurants Gold
Description
This data set includes the matched record pairs based on ID.
Usage
restaurant_gold
Format
A data frame with 3 variables: class
, id1
, id2
This data set includes the matched record pairs based on ID from the Zagats and Fodors data sets. This data set can be used to evaluate the performance of record linkage methods performed on the Zagats and Fodors data sets.
Examples
head(restaurant_gold)
dim(restaurant_gold)