Baseball, P. (2000). Within the P. Baseball, H. F. Spirer, & L. Spirer (Eds.), Making the Circumstances: Investigating Large-scale People Legal rights Violations Having fun with Recommendations Expertise and you will Studies Study. AAAS.
Belin, T. R., & Rubin, D. B. (1995). A method having calibrating false-fits pricing into the number linkage. Record of the American Analytical Organization, 90(430), 694–707.
Bilenko, Yards., & Mooney, Roentgen. J. (2003). Adaptive Content Identification Using Learnable String Similarity Measures. In the KDD ’03 (pp. 39–48). ACM.
Christen, P. (2008). Automated List Linkage Using Seeded Nearby Neighbour and you can Help Vector Server Classification. During the KDD ’08 (pp. 151–159). ACM.
Christen, P. (2012). A survey off indexing tricks for scalable listing linkage and you may deduplication. IEEE Deals with the Knowledge and you can Investigation Technology, 24(9), 1537–1555.
Cohen, W., Raviku). An evaluation from string metrics having matching brands and ideas. For the KDD working area with the study cleaning and you can target consolidation (Vol. step 3, pp. 73–78).
Copas, J., & Hilton, F. (1990). Checklist linkage: Mathematical habits to own matching pc information. Journal of one’s Regal Statistical People, Series Good, 153(3), 287–320.
Dai, An effective. M., & Storkey, An effective. J. (2011). The fresh labeled creator-point design to own unsupervised entity solution. Within the Artificial sensory systems and server reading–icann 2011 (pp. 241–249). Springer.
Fortini, M., Liseo, B., Nuccitelli, An excellent., & Scanu, Yards. (2001). With the Bayesian Checklist Linkage. Browse within the Specialized Analytics, 4(1), 185–198.
Gutman, Roentgen., Afendulis, C., & Zaslavsky, An effective. (2013). Good bayesian procedure of file hooking up to research prevent- of-lifestyle medical will set you back. Log of Western Analytical Relationship, 108(501), 34–47.
Hsu, W., Lee, Meters. L., Liu, B., & Ling, T. W. (2000). Mining Mining during the Diabetics Databases: Results and you may Findings. In KDD ’00 (pp. 430–436). ACM.
A split-mix Markov strings Monte Carlo process of brand new Dirichlet process mixture model
Jewell, N. P., Spagat, Meters., & Jewell, B. L. (2013). MSE and Casualty Matters: Assumptions, Interpretation, and you may Challenges. For the T. B. Seybolt, J. D. Aronson, & B. Fischhoff (Eds.), Counting Civilian Casualties: An overview of Recording and Quoting Nonmilitary Fatalities in conflict. Oxford, UK: Oxford School Push.
Larsen, M. D. (2002)ments for the Hierarchical Bayesian Record Linkage. Inside Process of the combined mathematical conferences, part toward survey look strategies (pp. 1995–2000). New Western Analytical Relationship.
Steorts, R
Larsen, Meters. D. (2005). Enhances when you look at the Listing Linkage Concept: Hierarchical Bayesian Checklist Linkage Concept. From inside the Process of combined mathematical conferences, section for the survey browse procedures (pp. 3277–3284). New Western Statistical Organization.
Larsen, Meters. D., & Rubin, D. B. (2001). Iterative automatic number linkage using blend habits. Log of your American Mathematical Relationship, 96(453), 32–41.
Lum, K., Speed, Yards. Age., & Banking companies, D. (2013). Software of Several Systems Quote most sexy Lang son girl from inside the People Liberties Search. The brand new American Statistician, 67(4), 191–200.
Marchant, N. Grams., C., Kaplan, A good., Rubinstein, B. We. P., & Elazar, D. Letter. (2019). D-blink: Marketed avoid-to-end bayesian organization quality.
McCallum, A., & Wellner, B. (2004). Conditional Varieties of Identity Suspicion having Application to Noun Coreference. Into the Enhances during the neural advice processing expertise (nips ’04) (pp. 905–912). MIT Push.
Miller, P. L., Frawley, S. J., & Sayward, F. Grams. (2000). IMM/Scrub: A site-Particular Tool to your Deduplication away from Inoculation History Info in Youth Immunization Registriesputers and you will Biomedical Lookup, 33(2), 126–143.
Murphy, J., Brackbill, Roentgen. M., Thalji, L., Dolan, Meters., Pulliam, P., & Walker, D. J. (2007). Calculating and you may Maximizing Visibility around the globe Trading Cardiovascular system Health Registry. Analytics inside Medication, 26(8), 1688–1701.
Murray, J. S. (2016). Probabilistic number linkage and you will deduplication after indexing, clogging, and filtering. Record from Privacy and you can Confidentiality, 7(1), 3–24.
Newcombe, H. B., Kennedy, J. M., Axford, S. J., & James, A. P. (1959). Automated linkage off public record information computers can be used to extract” follow-up” statistics regarding family away from files away from regimen records. Science, 130(3381), 954–959.
Sadinle, Yards. (2014). Finding Duplicates inside the a murder Registry Playing with a good Bayesian Partitioning Means. Annals from Used Analytics, 8(4), 2404–2434.
Sariyar, Meters., Borg, A., & Pommerening, K. (2012). Productive Understanding Suggestions for the latest Deduplication out of Digital Patient Investigation Having fun with Category Trees. Diary away from Biomedical Informatics, 45(5), 893–900.
C., Hallway, Roentgen., & Fienberg, S. Age. (2016). A beneficial Bayesian Way of Visual Listing Linkage and you may Deduplication. Log of the American Mathematical Connection, 111(516), 1660–1672.
Tancredi, An excellent., & Liseo, B. (2011). A beneficial hierarchical Bayesian method to list linkage and people dimensions difficulties. Annals from Used Statistics, 5(2B), 1553–1585.