Golf ball, P. (2000). Into the P. Ball, H. F. Spirer, & L. Spirer (Eds.), Putting some Case: Exploring Large-scale Person Legal rights Violations Having fun with Information Possibilities and you will Studies Research. AAAS.
Belin, T. Roentgen., & Rubin, D. B. (1995). A strategy for calibrating incorrect-matches pricing when you look at the number linkage. Diary of one’s American Mathematical Connection, 90(430), 694–707.
Bilenko, M., & Mooney, Roentgen. J. (2003). Adaptive Duplicate Recognition Using Learnable String Similarity Steps. Into the KDD ’03 (pp. 39–48). ACM.
Christen, P. (2008). Automated Record Linkage Using Seeded Nearest Neighbor and you may Help Vector Host Class. Into the KDD ’08 (pp. 151–159). ACM.
Christen, P. (2012). A study out of indexing strategies for scalable list linkage and you will deduplication. IEEE Deals towards Studies and you may Analysis Systems, 24(9), 1537–1555.
Cohen, W., Raviku). A comparison out of sequence metrics to have complimentary labels and you will information. In KDD workshop for the study cleanup and you will target combination (Vol. step three, pp. 73–78).
Copas, J., & Hilton, F. (1990). List linkage: Analytical activities having coordinating desktop records. Diary of the Royal Analytical Neighborhood, Series A, 153(3), 287–320.
Dai, An excellent. M., & Storkey, An effective. J. (2011). The latest grouped blogger-issue design to own unsupervised entity solution. In Phony sensory communities and machine reading–icann 2011 (pp. 241–249). Springer.
Fortini, Meters., Liseo, B., Nuccitelli, A good., & Scanu, Yards. (2001). Toward Bayesian Number Linkage. Lookup from inside the Authoritative Statistics, 4(1), 185–198.
Gutman, Roentgen., Afendulis, C., & Zaslavsky kissbrides.com essential hyperlink, An excellent. (2013). Good bayesian procedure for file hooking up to analyze avoid- of-lives medical will set you back. Record of one’s Western Statistical Association, 108(501), 34–47.
Hsu, W., Lee, Yards. L., Liu, B., & Ling, T. W. (2000). Mining Exploration for the Diabetics Databases: Conclusions and you may Conclusions. For the KDD ’00 (pp. 430–436). ACM.
A torn-mix Markov chain Monte Carlo procedure of this new Dirichlet techniques mix model
Jewell, Letter. P., Spagat, M., & Jewell, B. L. (2013). MSE and you can Casualty Matters: Presumptions, Translation, and you may Challenges. In T. B. Seybolt, J. D. Aronson, & B. Fischhoff (Eds.), Relying Civil Casualties: An overview of Recording and Estimating Nonmilitary Fatalities incompatible. Oxford, UK: Oxford University Push.
Larsen, Yards. D. (2002)ments toward Hierarchical Bayesian Record Linkage. In Process of the mutual statistical meetings, section towards the survey search procedures (pp. 1995–2000). The American Mathematical Association.
Larsen, Meters. D. (2005). Enhances in the List Linkage Theory: Hierarchical Bayesian Number Linkage Idea. From inside the Process of one’s joint analytical meetings, point with the survey research strategies (pp. 3277–3284). The Western Statistical Association.
Larsen, Meters. D., & Rubin, D. B. (2001). Iterative automatic checklist linkage having fun with mix habits. Log of Western Analytical Organization, 96(453), 32–41.
Lum, K., Price, M. Elizabeth., & Financial institutions, D. (2013). Programs regarding Multiple Solutions Estimation when you look at the People Liberties Search. New Western Statistician, 67(4), 191–200.
Marchant, Letter. G., C., Kaplan, A beneficial., Rubinstein, B. We. P., & Elazar, D. Letter. (2019). D-blink: Distributed stop-to-end bayesian organization solution.
McCallum, A., & Wellner, B. (2004). Conditional Type Name Suspicion that have App to Noun Coreference. For the Advances within the neural advice processing expertise (nips ’04) (pp. 905–912). MIT Press.
Miller, P. L., Frawley, S. J., & Sayward, F. G. (2000). IMM/Scrub: A site-Specific Equipment with the Deduplication of Inoculation Background Information when you look at the Young people Immunization Registriesputers and you will Biomedical Lookup, 33(2), 126–143.
Murphy, J., Brackbill, R. M., Thalji, L., Dolan, M., Pulliam, P., & Walker, D. J. (2007). Measuring and you can Boosting Publicity international Trading Center Fitness Registry. Statistics in Drug, 26(8), 1688–1701.
Murray, J. S. (2016). Probabilistic list linkage and deduplication just after indexing, blocking, and filtering. Journal out-of Confidentiality and Privacy, 7(1), 3–24.
Newcombe, H. B., Kennedy, J. Meters., Axford, S. J., & James, A beneficial. P. (1959). Automated linkage off vital records machines are often used to extract» follow-up» analytics out of household regarding records out-of regimen facts. Technology, 130(3381), 954–959.
Sadinle, Meters. (2014). Detecting Copies in a murder Registry Using a beneficial Bayesian Partitioning Method. Annals off Applied Statistics, 8(4), 2404–2434.
Sariyar, Meters., Borg, A great., & Pommerening, K. (2012). Productive Reading Tricks for brand new Deduplication out of Digital Diligent Analysis Using Category Woods. Journal off Biomedical Informatics, 45(5), 893–900.
C., Hall, Roentgen., & Fienberg, S. E. (2016). A Bayesian Way of Visual Checklist Linkage and you may Deduplication. Record of the American Mathematical Association, 111(516), 1660–1672.
Tancredi, A good., & Liseo, B. (2011). Good hierarchical Bayesian method of number linkage and inhabitants size dilemmas. Annals of Applied Analytics, 5(2B), 1553–1585.