Basketball, P. (2000). During the P. Ball, H. F. Spirer, & L. Spirer (Eds.), Putting some Instance: Investigating Major Peoples Rights Violations Playing with Recommendations Possibilities and you will Studies Research. AAAS.
Belin, T. R., & Rubin, D. B. (1995). A strategy to have calibrating not the case-meets rates inside listing linkage. Log of the Western Statistical Association, 90(430), 694–707.
Bilenko, Yards., & Mooney, Roentgen. J. (2003). Transformative Duplicate Identification Playing with Learnable Sequence Similarity Strategies. In the KDD ’03 (pp. 39–48). ACM.
Christen, P. (2008). Automated Record Linkage Playing with Seeded Nearest Neighbor and you may Service Vector Host Category. For the KDD ’08 (pp. 151–159). ACM.
Christen, P. (2012). A study of indexing suggestions for scalable record linkage and deduplication. IEEE Purchases on the Education and Data Systems, 24(9), 1537–1555.
Cohen, W., Raviku). A comparison from sequence metrics for complimentary names and you can details. In the KDD workshop with the analysis cleanup and you will object combination (Vol. step three, pp. 73–78).
Copas, J., & Hilton, F. (1990). Number linkage: Analytical activities to have complimentary computers ideas. Diary of your Regal Mathematical Neighborhood, Series Good, 153(3), 287–320.
Dai, An effective. Yards., & Storkey, A good. J. (2011). New grouped blogger-procedure model to possess unsupervised organization quality. Within the Phony neural networking sites and machine understanding–icann 2011 (pp. 241–249). Springer.
Fortini, M., Liseo, B., Nuccitelli, An effective., & Scanu, Yards. (2001). For the Bayesian List Linkage. Browse inside the Formal Analytics, 4(1), 185–198.
Gutman, Roentgen., Afendulis, C., & Zaslavsky, An effective. (2013). An excellent bayesian means of document connecting to research prevent- of-lives scientific costs. Record of the American Statistical Organization, 108(501), 34–47.
Hsu, W., Lee, M. L., Liu, B., & Ling, T. W. (2000). Mining Exploration within the Diabetic patients Database: Findings and you can Findings. Within the KDD ’00 (pp. 430–436). ACM.
A torn-mix Markov chain Monte Carlo means of the newest Dirichlet techniques mixture model
Jewell, N. P., Spagat, Meters., & Jewell, B. L. (2013). MSE and you can Casualty Matters: Assumptions, Translation, and you can Pressures. Within the T. B. Seybolt, J Zaporizhzhya women for marriage. D. Aronson, & B. Fischhoff (Eds.), Depending Civilian Casualties: An introduction to Tape and you can Quoting Nonmilitary Fatalities incompatible. Oxford, UK: Oxford University Press.
Larsen, Meters. D. (2002)ments towards Hierarchical Bayesian Number Linkage. For the Proceedings of your own shared analytical conferences, section on the survey browse tips (pp. 1995–2000). The newest Western Statistical Organization.
Larsen, Yards. D. (2005). Enhances when you look at the Listing Linkage Concept: Hierarchical Bayesian Checklist Linkage Idea. Inside Proceedings of your combined analytical group meetings, area to the survey research strategies (pp. 3277–3284). This new American Statistical Relationship.
Larsen, M. D., & Rubin, D. B. (2001). Iterative automated list linkage using combination designs. Journal of your own American Statistical Organization, 96(453), 32–41.
Lum, K., Rates, Meters. Age., & Banking institutions, D. (2013). Apps out of Numerous Solutions Estimate inside the People Liberties Lookup. The fresh Western Statistician, 67(4), 191–2 hundred.
Marchant, Letter. Grams., C., Kaplan, An effective., Rubinstein, B. I. P., & Elazar, D. N. (2019). D-blink: Distributed prevent-to-end bayesian entity resolution.
McCallum, A., & Wellner, B. (2004). Conditional Type Identity Suspicion which have Application to Noun Coreference. During the Enhances inside neural information control possibilities (nips ’04) (pp. 905–912). MIT Drive.
Miller, P. L., Frawley, S. J., & Sayward, F. Grams. (2000). IMM/Scrub: A domain-Certain Unit towards the Deduplication regarding Inoculation History Details during the Youthfulness Immunization Registriesputers and Biomedical Browse, 33(2), 126–143.
Murphy, J., Brackbill, R. Meters., Thalji, L., Dolan, Meters., Pulliam, P., & Walker, D. J. (2007). Calculating and you can Maximizing Coverage around the globe Change Heart Fitness Registry. Analytics for the Treatments, 26(8), 1688–1701.
Murray, J. S. (2016). Probabilistic record linkage and you can deduplication immediately following indexing, clogging, and filtering. Log off Privacy and you can Confidentiality, 7(1), 3–24.
Newcombe, H. B., Kennedy, J. Meters., Axford, S. J., & James, A. P. (1959). Automated linkage regarding public record information computers are often used to extract » follow-up » analytics away from family members out of documents away from program records. Research, 130(3381), 954–959.
Sadinle, M. (2014). Finding Duplicates during the a homicide Registry Using good Bayesian Partitioning Approach. Annals of Applied Analytics, 8(4), 2404–2434.
Sariyar, M., Borg, An excellent., & Pommerening, K. (2012). Energetic Learning Strategies for the new Deduplication out-of Digital Diligent Investigation Using Class Woods. Log from Biomedical Informatics, 45(5), 893–900.
C., Hallway, R., & Fienberg, S. E. (2016). An excellent Bayesian Method to Graphical Number Linkage and Deduplication. Diary of your Western Statistical Relationship, 111(516), 1660–1672.
Tancredi, A., & Liseo, B. (2011). A great hierarchical Bayesian way of list linkage and you will society size dilemmas. Annals out of Applied Statistics, 5(2B), 1553–1585.