+ All Categories
Home > Documents > THE DATA FUSION BIBLIOGRAPHY

THE DATA FUSION BIBLIOGRAPHY

Date post: 02-Jan-2017
Category:
Upload: doandiep
View: 222 times
Download: 0 times
Share this document with a friend
102
THE DATA FUSION BIBLIOGRAPHY (compiled by Roland Soong, updated as of 6/10/2006) Abello, R. and Phillips, B. (2004) Statistical matching of the HES and NHS: an exploration of issues in the use of unconstrained and constrained approaches in creating a basefile for a microsimulation model of the pharmaceutical benefits segment. Technical report, Australian Bureau of Statistics. Methodology Advisory Committee Paper, June. Achen, C.H. and Shively, W.P. (1995) Cross-Level Inference. University of Chicago Press: Chicago, IL. Adamek, J. (1994) Fusion: combining data from separate sources. Marketing Research: A Magazine of Management and Applications, 6, 48-50. Aerts, M., Claeskens, G., Hens, N. and Molenberghs (2002) Local multiple imputation. Biometrika, 89, 375-388. Aglietta, J. (2003) La France Plurimedia. Lessons drawn from a fusion between surveys and a clustering. ARF/ESOMAR Week of Audience Measurement, Los Angeles, California, USA, June 18, 2003. Ahmed, S. and Lachenbruch, P.A. (1983) Discriminant analysis when scale contamination is present in the initial sample. In Classification and Clustering. (J. van Ryzin (ed.)). New York: Academic Press. Ahuja, R.K., Magnanti, T.L. and Orlin, J.B. (1993) Network Flows: Theory, Algorithms and Applications. New York: Prentice Hall. Aitchison, J. and Aitkin, C.G.G. (1976) Multivariate binary discrimination by the kernel method. Biometrika, 63, 413-420. Aitchison, J., Habbema, J.D.F. and Kay, J.W. (1977) A critical comparison of two methods of statistical discrimination. Applied Statistics, 26, 15-25. Aitchison, J. and Silvey, S.D. (1958) Maximum likelihood estimation of parameters subject to restraints. Annals of Mathematical Statistics, 29, 813-828. Akaike, H. (1974) A new look at the statistical model identification. IEEE Transactions on Automatic Control, 19, 716-723. Alegre, J., Arcarons, J., Calonge, S. and Manresa, A. (2000) Statistical matching between datasets: An application to the Spanish 1
Transcript
Page 1: THE DATA FUSION BIBLIOGRAPHY

THE DATA FUSION BIBLIOGRAPHY(compiled by Roland Soong, updated as of 6/10/2006)

Abello, R. and Phillips, B. (2004) Statistical matching of the HES and NHS: an exploration of issues in the use of unconstrained and constrained approaches in creating a basefile for a microsimulation model of the pharmaceutical benefits segment. Technical report, Australian Bureau of Statistics. Methodology Advisory Committee Paper, June.

Achen, C.H. and Shively, W.P. (1995) Cross-Level Inference. University of Chicago Press: Chicago, IL.

Adamek, J. (1994) Fusion: combining data from separate sources. Marketing Research: A Magazine of Management and Applications, 6, 48-50.

Aerts, M., Claeskens, G., Hens, N. and Molenberghs (2002) Local multiple imputation. Biometrika, 89, 375-388.

Aglietta, J. (2003) La France Plurimedia. Lessons drawn from a fusion between surveys and a clustering. ARF/ESOMAR Week of Audience Measurement, Los Angeles, California, USA, June 18, 2003.

Ahmed, S. and Lachenbruch, P.A. (1983) Discriminant analysis when scale contamination is present in the initial sample. In Classification and Clustering. (J. van Ryzin (ed.)). New York: Academic Press.

Ahuja, R.K., Magnanti, T.L. and Orlin, J.B. (1993) Network Flows: Theory, Algorithms and Applications. New York: Prentice Hall.

Aitchison, J. and Aitkin, C.G.G. (1976) Multivariate binary discrimination by the kernel method. Biometrika, 63, 413-420.

Aitchison, J., Habbema, J.D.F. and Kay, J.W. (1977) A critical comparison of two methods of statistical discrimination. Applied Statistics, 26, 15-25.

Aitchison, J. and Silvey, S.D. (1958) Maximum likelihood estimation of parameters subject to restraints. Annals of Mathematical Statistics, 29, 813-828.

Akaike, H. (1974) A new look at the statistical model identification. IEEE Transactions on Automatic Control, 19, 716-723.

Alegre, J., Arcarons, J., Calonge, S. and Manresa, A. (2000) Statistical matching between datasets: An application to the Spanish Household Survey (EPF90) and the Income Tax File (IRPF90). In The Workshop on Fighting Poverty and Inequality Through Tax Benefit Reform: Empirical Approaches. Barcelona.

Allison, P. (2000) Multiple imputation for missing data: A cautionary tale. Sociological Methods and Research, 28 (3), 301-309.

Allison, P.D. (2001) Missing Data. Sage: Thousand Oaks, CA.

Alós, J.S. and Nafria, E. (1997) AUDE integral information analysis derived from consumer panels and audience panels. Marketing and Research Today, 25(2), 106-114.

1

Page 2: THE DATA FUSION BIBLIOGRAPHY

Alter, H. E. (1974) Creation of a synthetic data set by linking records of the Canadian Survey of Consumer Finances with the Family Expenditure Survey 1970. Annals of Economic and Social Measurement, 2, 373-394.

Althauser, R. and Rubin, D.B. (1971) The computerized construction of a matched sample. American Journal of Sociology, 76, 325-346.

Aluja, T., Nonell, R., Rius, R., and Martínez, M. (1995) File grafting. In F. Mola and A. Morineau (eds) Actes du IIIème Congrès International d'Analyses Multidimensionnelles des Données - NGUS'95, Centre Int. de Statistique et d'Informatique Appliquées, CISIA-CERESTA, 23-32.

Aluja-Banet, T. and Thio, S. (2001) Survey data fusion. Bulletin of Sociological Methodology, 72, 20-36.

Alvey, W. and Jamerson, B. (eds.) (1997) Record Linkage Techniques – 1997. (Proceedings of an International Record Linkage Workshop and Exposition on March 20-21, 1997 in Arlington, Virginia) Washington, DC: Federal Committee on Statistical Methodology.

Alvord, C.H. III (1983) The pros and cons of goal programming: a reply. Computers and Operations Research, 10, 61-62.

Amemiya, T. (1984) Tobit models: a survey. Journal of Econometrics, 24, 3-61.

Anagnoson, J.T. (2000) Microsimulation of public policy. In Handbook of Public Information Systems (G.D. Garson (ed.)). New York: Marcel Dekker.

Anderson, J.A. (1969) Discrimination between k populations with constraints on the probabilities of misclassification. Journal of the Royal Statistical Society, B31, 123-139.

Anderson, J. (1972) Separate sample logistic discrimination. Biometrika, 59, 19-36.

Anderson, T.W. (1951) Classification by multivariate analysis. Psychometrika, 16, 31-50.

Anderston, T.W. (1973) Asymptotic evaluation of the probabilities of misclassification. In Discriminant Analysis and Applications (T. Cacoullos (ed)), 17-36. New York: Academic Press.

Anderson, T.W. and Bahadur, R.R. (1962) Classification into two multivariate normal distributions with different covariance matrices. Annals of Mathematical Statistics, 33, 420-431.

Antoine, J. (1985) A case study illustrating the objectives and perspectives of fusion techniques. Proceedings of the Salzburg Readership Symposium.

Antoine, J. and G. Santini (1987) Fusion techniques: alternative to single-source methods. European Research, 15 (August), 178-187.

Armstrong, J. (1989) An evaluation of statistical matching methods. Working Paper no. BSMD 90-003E, Methodology Branch, Statistics Canada, Ottawa.

Arnold, B.C., Castillo, E. and Sarabia, J.M. (1999) Conditional Specification of Statistical Models. New York: Spinger-Verlag New York.

Arnold, B.C., Castillo, E. and Sarabia, J.M. (2001) Conditionally specified distributions: An Introduction. Statistical Science, 16, 249-274.

2

Page 3: THE DATA FUSION BIBLIOGRAPHY

Arnold, B.C. and Gokhale, D.V. (1994) On uniform marginal representation of contingency tables. Statistics and Probability Letters, 21, 311-316.

Arsham H (1989) A simplex type algorithm for general transportation problem: An alternative to stepping stone. Journal of the Operational Research Society, 40, 581-590.

Arsham H. (1992) Postoptimality analyses of the transportation problem. Journal of the Operational Research Society, 43, 121-139.

Arthanari, T.S. and Dodge, Y. (1981) Mathematical Programming in Statistics. New York: John Wiley & Sons.

Arthur, J.L. and Ravindran, A. (1978) An efficient goal programming algorithm using constraint partitioning and variable elimination. Management Science, 24, 867-868.

Assael, H. and Cannon, H. (1979) Do demographics help in media selection? Journal of Advertising Research, 19(6), 7-11.

Assael, H. and Poltrack, D.F. (1991) Using single-source data to select TV programs based on purchasing behavior. Journal of Advertising Research, 31(4), 9-17.

Assael, H. and Poltrack, D.F. (1993) Using single-source data to select TV programs based on purchasing behavior. Part II. Journal of Advertising Research, 33(1), 48-56.

Assael, H. and Poltrack, D.F. (1994) Can demographic profiles of heavy users serve as a surrogate for purchase behavior in selecting TV programs? Journal of Advertising Research, 34(1), 11-17.

Assael, H. and Poltrack, D.F. (1996) Single vs. double source data for TV program selection. Journal of Advertising Research, 36(6), 73-81.

Assael, H. and Poltrack, D.F. (1999) Relating products to TV program clusters. Journal of Advertising Research, 39(2), 41-52.

Assael, H. and Poltrack, D.F. (2002) Consumer surveys vs. electronic measures for single-sources data. Journal of Advertising Research, 42(5), 19-25.

Atrostic, B. K. (1994) A multiple imputation approach to microsimulation. Proceedings of the Survey Research Methods Section, American Statistical Association, 529-534. Washington, DC: American Statistical Association.

Bäck, T., Fogel, D.B. and Michalewicz, Z. (eds.) (1997) Handbook of Evolutionary Computation. New York: Oxford University Press.

Bäck, T. and Schwefel, H.P. (1993) An overview of evolutionary algorithms for parameters optimization. Evolutionary Computation, 1(1), 1-24.

Bajgier, S.M. and Hill, A.V. (1982) An experimental comparison of statistical and linear programming approaches to the discriminant problems. Decision Sciences, 13, 604-618.

Bailar, B. A. and Bailar, J. C. (1978) Comparison of two procedures for imputing missing survey values. Proceedings of the Section on Survey Research Methods, American Statistical Association, 462-467. Washington, DC: American Statistical Association.

Bailey, J. and Boller, S. (2001) Data fusion from the media sellers’ perspective. 10th Worldwide Readership Research Symposium, Venice, 119-127.

3

Page 4: THE DATA FUSION BIBLIOGRAPHY

Baim, J. and Frankel, M.R. (1997) Enhanced ascription. Proceedings of the Magazine Audience Measurement Research. New York: Advertising Research Foundation.

Baker, K. (1990) The BARB/TGI Fusion. Technical Report on the Fusion Conducted in February/March 1990. Ickenham, UK: Ken Baker Associates.

Baker, K. (1996) What do we know about … data fusion? Admap, July.

Baker, K., Harris, P. and O’Brien, J. (1989) Data fusion: an appraisal and experimental evaluation. Journal of the Market Research Society, 31(2), 153-212.

Baker, K. Harris, P. and O’Brien, J. (1990) Data fusion: can perfection be reached? Journal of the Market Research Society, 32(3), 473-474.

Bakker, B.F.M. and Winkels, J.W. (1998) Why integration of household surveys? – Why POLS? Netherlands Official Statistics, 13, 5.7.

Balakrishnan, N. (1990) Modified Vogel's approximation method for the unbalanced transportation problem. Applied Mathematics Letters, 3(2), 9-11.

Balas, E. (1964) Extension de l’algorithme additif à la programmation en nombres entiers et à programmation nonlinéare. Comptes Rendues de l’Academie des Sciences, 258, 5136-5139.

Balinski, M.L. (1986) A competitive simplex method for the assignment method. Mathematical Programming, 34, 125-141.

Balinski, M.L. and Gomory, R.E. (1964) A primal method for the assignment and transportation problems. Management Science, 10, 578-593.

Ballas, D., Clarke, G. and Turton, I. (1999) Exploring microsimulation methodologies for the estimation of household attributes. Paper presented at the Fourth International Conference on Geo-Computation, Mary Washington College, Virginia, USA, July 25-28.

Bals, W. (2002) Controlled split survey in media practice. IMPUTE: Symposium on Nonresponse, Questionnaire Split and Multiple Imputation, Nuremberg, Germany, September 25.

Bandyopadhyay, S., Murphy, C.A. and Pal, S.K. (1995) Pattern classification using genetic algorithms. Pattern Recognition Letters, 16, 801-808.

Baniel, L. and Monistere, D. (2001) Single source data – qualitative and ratings data combined. Paper presented at ARF Week of Workshops, Chicago, IL.

Bar-Hen, A. and Daudin, J.-J. (1995) Generalization of the Mahalanobis distance in the mixed case. Journal of the Multivariate Analysis, 53, 332-342.

Barnard, J. and Meng, X.L. (1999) Application of multiple imputation in medical studies: from AIDS to NHANES. Statistical Methods in Medical Research, 8, 17-36.

Barnard, J. and Rubin, D.B. (1999) Small sample degrees of freedom with multiple imputation. Biometrika, 86, 948-955.

Baron, R. (2001) A new practical approach to data fusion. Paper presented at ARF Week of Workshops, Chicago, IL.

4

Page 5: THE DATA FUSION BIBLIOGRAPHY

Barr, R.S., Hickman, B.L. and Turner, J.S. (1994) Optimal microdatabase merging: a new method and applications. The Impact of Emerging Technology On Computer Science and Operations Research: ORSA CSTS Conference, Williamsburg, Virginia, January 5-7, 1994.

Barr, R., Hickman, B. and Turner, J.S. (2000) Optimal microdata file merging: a new model and network optimization algorithm. Presentation at the 2000 INFORMS National Conference, San Antonio, TX.

Barr, R.S., Stewart, W.H. and Turner, J.S. (1982) An empirical evaluation of statistical matching methodologies. Mimeograph, Edwin L. Cox School of Business, Southern Methodist University, Dallas, Texas.

Barr, R. S. and Turner, J. S. (1978) A new, linear programming approach to microdata file merging. In 1978 Compendium of Tax Research. Washington, DC: Office of Tax Analysis, U.S. Department of the Treasury, 131-149.

Barr, R.S. and Turner, J.S. (1981) Microdata file merging through large-scale network technology. Mathematical Programming Study, 15, 1-22

Barr, R.S. and Turner, J.S. (1990) Quality issues and evidence in statistical file merging. In Data Quality Control: Theory and Pragmatics (G.E. Liepins and V.R.R. Uppuluri (eds)), 245-313. New York: Marcel Dekker.

Barry, J.T. (1988) An investigation of statistical matching. Journal of Applied Statistics, 15, 275-283.

Bartlett, M.S. (1939) The standard errors of discriminant function coefficients. Journal of the Royal Statistical Society, Supplement, 6, 169-173.

Bartlett, M.S. (1951) The goodness of fit of a single hypothetical discriminant function in the case of several groups. Annals of Eugenics, 16, 199-214.

Bartlett, M.S. and Please, N.W. (1983) Discrimination in the case of zero mean differences. Biometrika, 50, 17-21.

Barlett, S., Krewski, D., Wang, Y. and Zielinksi, J.M. (1993) Evaluation of error rates in large scale computerized record linkage studies. Survey Methodology, 19(1), 3-12.

Bass, F.M. and Lonsdale, R.T. (1966). An exploration of linear programming in media selection. Journal of Marketing Research, 3, 179-188.

Bass, F.M., Tigert, D.J. and Lonsdale, R.T. (1968) Market segmentation: group versus individual behavior. Journal of Marketing Research, 5(3), 264-270.

Bauer, E. and Kohavi, R. (1999) An empirical comparison of voting classification algorithms: bagging, boosting, and variants. Machine Learning, 36, 105-139.

Baynton, P. (2003) Multi-media analysis. What is needed? ARF 49th Annual Convention and Research Infoplex, New York City, USA, April 2003.

Baynton, P. (2003) Data integration or fusion. Which is best for mixed media schedule analysis? ARF/ESOMAR Week of Audience Measurement, Los Angeles, California, USA, June 18, 2003.

Beal, G. (1945) Approximate methods in calculating discriminant functions. Psychometrika, 10, 205-218.

5

Page 6: THE DATA FUSION BIBLIOGRAPHY

Beale, E.M.L., Hughes, P.A.B. and Broadbent, S.R.M. (1966). A computer assessment of media schedules. Operational Research Quarterly, 17, 381-411.

Beale, M.E. (1955) Cycling in the dual simplex algorithm. Naval Research Logistics Quarterly, 2(4), 269-275.

Beasley, J.E. (1997) A genetic algorithm for the generalised assignment problem. Computers and Operations Research, 23, 17-23.

Bedrick, E.J., Lapidus, J. and Powell, J.F. (2000) Estimating the Mahalanobis distance from mixed continuous and discrete data. Biometrics, 56, 394-401.

Bedwell, R. (1991) Fusion – Britain’s latest experience. In Fifth Worldwide Readership Symposium, Hong Kong.

Belin, T.R. (1993) Evaluation of sources of variation in record linkage through a factorial experiment. Survey Methodology, 19, 13-29.

Belin, T.R. and Rubin, D.B. (1990) Calibration of errors in computer matching for Census undercount (with discussion) Proceedings of the Government Statistics Section of the American Statistical Association, 124-131. . Washington, DC: American Statistical Association.

Belin, T.R. and Rubin, D.B. (1995) A method for calibrating false-match rates in record linkage. Journal of the American Statistical Association, 90, 694-707.

Bello, A. (1993). Choosing among imputation techniques for incomplete multivariate data: a simulation study. Communications in Statistics - Theory and Methods, 22, 853-877.

Bello, A.L. (1995) Imputation techniques in regression analysis: Looking closely at their implementation. Computational Statistics and Data Analysis, 20, 45-57.

Bennett, K.P. (1999) Combining support vector and mathematical programming methods for classification. In Schölkopf, B., Burges, C. and Smola, A. (eds) Advances in Kernel Methods – Support Vector Machines, p. 307-326. MIT Press: Cambridge, MA.

Bennett, K.P. and Mangasarian, O.L. (1992) Robust linear programming discrimination of two linearly inseparable sets. Optimization Methods and Software, 1, 23-34.

Bennike, S (1985) Fusion --- an overview by an outside observer. Proceedings of the Salzburg Readership Symposium.

Bergg. D., Kite, J. and Silman, R. (1992) TV-AM Target Group Ratings: How TGR allowed TV-AM to make their clients an offer they couldn’t refuse. In Research Works – Papers from the AMSO Reseach Effectiveness Awards, 1991 (D. Martin and J. Goodyear (eds)), 118-137. NTC Publications Ltd.

Bergsma, W.P. and Rudas, T. (2002) Marginal models for categorical data. Annals of Statistics, 30, 140-159.

Bergstrahl, E.J., Kosanke, J.L. and Jacobsen, S.L. (1996) Software for optimal matching in observational studies. Epidemiology, 7, 331-332.

Bergstralh, E.J., Kosanke, J.L. and Jacobsen, S.L. (1996) Software for optimal matching in observational studies. Epidemiology, 7, 331-332. (http://www.mayo.edu/hsr/sasmac.html)

6

Page 7: THE DATA FUSION BIBLIOGRAPHY

Bermingham, J. (1981) Simulating readership data surveys in Great Britain --- a review of current practices. Proceedings of the New Orleans Readership Symposium.

Bersekas, D.P. (1991) Linear Network Optimization: Algorithms and Codes. Cambridge, MA: MIT Press.

Bertazzi, L., Speranza, M.G. and Ukovich, W. (1996) An algorithm for the transportation problem with given frequencies. In System Modelling and Optimization, (J. Dolezal and J. Fidler, eds.), 535-542. New York: Chapman & Hall.

Berthold, M. and Hand, D.J. (1999) Intelligent Data Analysis. Heidelberg, Germany: Springer Verlag.

Bhattacharyya, A. (1943) On a measure of divergence between two statistical populations defined by their probability distributions. Bulletin of the Calcutta Mathematical Society, 35, 99-109.

Biemer, P.P., Groves, R.M., Lyberg, L.E., Mathiowetz, N.A. and Sudman, S. (eds). (1991) Measurement Errors in Surveys. New York: John Wiley Inc.

Binder, D.A. (1996) On variance estimation with imputed survey data: Comment. Journal of the American Statistical Assocation, 91, 510-512.

Binder, D.A. and Sun, W. (1996) Frequency valid multiple imputation for surveys with a complex design. Proceedings of the Survey Research Methods Section of the American Statistical Association, 281-286. Washington, DC: American Statistical Association.

Bishop, C.M. (1995) Neural Networks for Pattern Recognition. Oxford, UK: Oxford University Press.

Bishop, G.T. (1957) On a problem of production scheduling. Operations Research, 5(1), 97-103.

Bland, R.G. (1976) New finite pivoting rules. Discussion Paper 7612, Center for Operations Research and Econometrics (CORE), Université Catholique de Louvain, Heverlee, Belgium.

Bland, R.G. (1977) New finite pivoting rules for the simplex methods. Mathematics of Operations Research, 2, 103-107.

Bland, R.G., Goldfarb, D. and Todd, M.J. (1981) The ellipsoid method: a survey. Operations Research, 29, 1039-1091.

Bock, T. and Jephcott, J. (1998) The application and validation of data fusion. International Journal of Market Research, 40(3).

Bogwardt, K.-H. (1982) Some distribution-independent results about the asymptotic order of the average number of pivot steps of the simplex methods. Mathematics of Operations Research, 7, 441-462.

Bogwardt, K.-H. (1982) The average number of pivot stpes required by the simplex method is polynomial. Zeitschrift für Operations Research, 26, 157-177.

Boison, G. (1986) Les fusion d’enquêtes – le point de vue d’un utilisateur (The fusion of files – a user’s point of view) . Television Research – International Symposium. Tarrytown, New York, October 5-8, 1986.

7

Page 8: THE DATA FUSION BIBLIOGRAPHY

Bonomi, E. and Lutton, J.L. (1984) The n-city traveling salesman problem: statistical mechanics and the Metropolis algorithm. SIAM Review, 26(4), 551-569.

Bordt, M., Cameron, G.J., Gibble, S.F., Murphy, B.B., Rowe, G.T. and Wolfson, M.C. (1990) The social policy simulation database and model: an integrated tool for tax/transfer policy analysis. Canadian Tax Journal, 48-65.

Bottou, L. and Vapnik, V. (1992) Local learning algorithms. Neural Computation, 4(6), 888-900.

Bowman, E.H. (1956) Production scheduling by the transportation method of linear programming. Operations Research, 4(1).

Breiman, L. (1993) Hinging hyperplanes for regression, classification and function approximation. IEEE Transactions on Information Theory, IT-3, 999-1013.

Brackstone, G. (1999) Managing data quality in a statistical agency. Survey Methodology, 25(2), 139-149.

Bradley, P.S., Fayyad, U.M. and Mangasarian, O.L. (1999) Mathematical programming for data mining: formulation and challenges. INFORMS Journal of Computing, 11, 217-238.

Brameier, M. and Banzhaf, W. (2001) A comparison of linear genetic programming and neural networks in medical data mining. IEEE Transactions on Evolutionary Computation, 5, 17-26.

Brancato, G., Pezzotti, P., Rapiti E., Perucci, C.A., Abeni, D., Babbalaccio, A., Rezza, G. and The Multicenter Prospective HIV Study (1997) Multiple imputation method for estimating incidence of HIV infection. International Journal of Epidemiology, 26, 1107-1114.

Brandt, J.D. and Chalk, M. (1985) The use of automatic editing in the 1981 Census. Journal of the Royal Statistical Society, Series A148, 2.

Breiman, L. (1993) Hinging hyperplanes for regression, classification and function approximation. IEEE Transactions on Information Theory, 39, 999-1013.

Breiman, L. (1996) Stacked regressions. Machine Learning, 24, 49-64.

Breiman, L. (1996) Bagging predictors. Machine Learning, 24, 123-140.

Breiman, L. (1996) Heuristics of instability and stabilization in model selection. Annals of Statistics, 24, 2350-2383.

Breiman, L. (1998) Arcing classifiers (with discussion). Annals of Statistics, 26, 801-849.

Breiman, L. (1999) Prediction games and arcing algorithms. Neural Computing, 11, 1493-1517.

Breiman, L., Friedman, J., Olshen, R. and C. Stone (1984) Classification and Regression Trees. Belmont, California: Wadsworth.

Breiman, L., Meisel, W. and Purcell, E. (1977) Variable kernel estimates of multivariate densities. Technometrics, 19, 135-144.

Brennan, D.N. and Wyndham, J. (1992) Researching audiences to target audiences – a different type of single source. In ESOMAR/ARF Worldwide Broadcast Symposium, Toronto (Canada), 711-727.

8

Page 9: THE DATA FUSION BIBLIOGRAPHY

Broder, A. (1990) Strategies for efficient incremental nearest neighbor search. Pattern Recognition, 23, 171-178.

Brodley, C.E. and Utgoff, P.E. (1995) Multivariate decision trees. Machine Learning, 19, 45-77.

Brown, D.B. and Warshaw, M. (1965). Media selection by linear programming. Journal of Marketing Research, 2, 83-88.

Brown, G.W. (1947) Discriminant functions. Annals of Mathematical Statistics, 18, 514.

Brown, J.G. (2002) Using a multiple imputation technique to merge data sets. Applied Economic Letters, 9, 311-314.

Brown, M. (1986) Data fusion: an introduction. Television Research – International Symposium. Tarrytown, New York, October 5-8, 1986.

Brown, M. (1991) Enhancing media survey value through data fusion. In Proceedings of the ESOMAR Congress, Luxembourg, 573-592.

Brown, M. (2000) Delivering and measuring data fusion quality. Will Fusion Come to America Workshop, Advertising Research Foundation, December 2000, New York City.

Brown, M. and Noordhoff, J.D. (1983) The AG.MA Media Research Model: Philosophy and Consequences. Frankfurt am Main: MMC.

Brown, T.A. and Koplowitz, J. (1979) The weighted nearest neighbor rule for class dependent sample sizes. IEEE Transactions on Information Theory, IT-25, 617-619.

Brownlee, J. (2003) Defining the pragmatic limits of single source media research. 6th Annual One Day Media Research Seminar, London, June 11th, 2003.

Bryan, J.G. (1951) The generalized discriminant function: mathematical foundation and computational routine. Harvard Educational Review, 21, 90-95.

Buck, S. (1960) A method for estimation of missing values in multivariate data suitable for use with an electronic computer. Journal of the Royal Statistical Society, B22, 302-306.

Buck, S. (1989) Single source data --- theory and practice. Journal of the Market Research Society, 31(4), 489-500.

Budd, E. C. (1971) The creation of a microdata file for estimating the size distribution of income. Review of Income and Wealth, 17 (December), 317-333.

Buehlman, P. and Bin, Y. (2002) Analyzing bagging. Annals of Statistics, 30, 927-961.

Buntine, W. (1992) Learning classification trees. Statistics and Computing, 2(2), 63-73.

Burges, C.J.C. (1998) A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery, 2, 121-167.

Burkard, R.E. (1978) A general Hungarian method for the algebraic transportation problem Discrete Mathematics, 22, 219-232.

Burkard, R.E. and Derigs, U. (1980) Assignment and matching problems: solution methods with FORTRAN-Programs. Lectures Notes in Economics and Mathematical Systems, Vol. 184, Berlin-New York: Springer Verlag.

9

Page 10: THE DATA FUSION BIBLIOGRAPHY

Burns, E.M. (1989) Multiple imputation in a complex sample survey. Proceedings of the Survey Research Methods Section of the American Statistical Association, 233-238.

Butler, D. and Pendley, M. (1989) A visualization model based on the mathematics of fiber bundles. Computers in Physics, 3(5).

Butler, D. and Bryson, S. (1992) Vector-bundle classes form powerful tool for scientific visualization. Computers in Physics, 6(6), 576-584.

Buuren, S.V. and Van Rijckevorsel, L.A. (1992) Imputation of missing categorical data by maximizing internal consistency. Psychometrika, 57(4), 567-580.

Byfield, S. and Dodson, R. () To fuse or not to fuse? In S. Byfield (ed.) Better Research for Better Media Decisions. ESOMAR Monograph Volume 7.

Cacoullos, T. (ed.) (1973) Discriminant analysis and applications. New York: Academic Press.

Cannon, H.M. (1984) The naïve approach to media selection. Journal of Advertising Research, 24(3), 21-25.

Cannon, H.M. (1985) Evaluating the ‘profile-distance’ approach to media planning. Journal of Adveritsing, 14(1), 4-9.

Cannon, H.M. (1986) A method for estimating target market ratings in television media selection. Journal of Advertising, 15(2), 21-26.

Cannon, H.M. (1986) Media-market matching versus a ‘random walk’ in television media selection. Journal of Advertising Research, 26(2), 37-41.

Cannon, H.M. (1987) Can national product media indices be used to improve media selection efficiency in local market areas? Journal of Adveristing, 16(4), 34-42.

Cannon, H.M. (1988) Evaluating the ‘simulation’ approach to media selection. Journal of Advertising Research, 28(1), 57-63.

Cannon, H.M. and Linda, G. (1982) Beyond media imperatives: geo-demographic media selection. Journal of Advertising Research, 22, 31-36.

Cannon, H.M. and Rashid, A. (1990/1) When do demographics help in media planning? Journal of Advertising Research, 30(6), 20-26.

Cannon, H.M. and Seamons, B.L. (1995) Simulating single source data: how it fails us just when we need it most. Journal of Advertising Research, 35(6), 53-62.

Cannon, H.M. and Yoon, S-J. (2001) A regression approach for synthesizing single-source data. Wayne State University Department of Marketing Working Paper 2001-005.

Carey, V., Zeger, S.L. and Diggle, P. (1993) Modeling multivariate binary data with alternating logistic regression. Biometrika, 80, 517-526.

Carpaneto, G., Martello, S. and Toth, P. (1988) Algorithms and codes for the assignment problem. Annals of Operations Research, 13, 193-223.

Carpenter, R.G. (1977) Matching when covariates are normally distributed. Biometrika, 64, 299-307.

10

Page 11: THE DATA FUSION BIBLIOGRAPHY

Carpenter, R. and Wilcox, S. (1995) Data fusion in the British National Readership Survey – an experiment. Worldwide Readership Research Symposium VII, Berlin.

Cassel, C.M. (1983) Statistical matching – statistical prediction. What is the difference? An evaluation of statistical matching and a special type of prediction using data from a survey on living conditions. Statistisk Tidskrift, 5, 55-63.

Causey, B., Cox, L. H., and Ernst, L. R. (1985) Application of transportation theory to statistical problems. Journal of the American Statistical Association, 80, 903-909.

Cavalier, T.M., Ignizio, J.P. and Soyster, A.L. (1989) Discriminant analysis via mathematical programming: certain problems and their causes. Computer and Operations Research, 16, 363-362.

Cavanaugh, J.E. and Oleson, J.J. (2001) A diagnostic for assessing the influence of cases on the prediction of missing data. Journal of the Royal Statistical Society, D, 50.

Cessie, S.L. and Houwelingen, J.C.V. (1994) Logistic regression for correlated binary data. Applied Statistics, 43, 95-108.

Chaddha, R.O. and Marcus, L.F. (1968) An empirical comparison of distance statistics for populations with unequal covariance matrices. Biometrics, 24, 683-694.

Chang, C. (1974) Finding prototypes for nearest neighbor classifiers. IEEE Transactions on Computers, 26, 1179-1184.

Chang, P.C. and Afifi, A. A. (1972) Classification based on dichotomous and continuous variables. Journal of the American Statistical Association, 67, 336-339.

Chao, M.T. (1994) A short review of recent survey methods for nonresponse. Journal of the Chinese Statistical Association, 32, 2, 169-177.

Chapman, C.W. (1976) A survey of nonresponse imputation procedures. Proceedings of the Social Statistics Section, American Statistical Association, 431-436.

Chapman, C.W. (1983) The impact of substitution on survey estimates. In Incompelte Data in Sample Surveys, Volume 2: Theory and Bibliographies (W.G. Madow and I. Olkin (eds.)). New York: Academic Press.

Chapman, P., Clinton, J., Kerber, R., Khabaza, T., Reinartz, T., Shearer, C. and Wirth, R. (2000) CRISP-DM 1.0. Cross-Industry Standard Process for Data Mining (http:www.crisp-dm.org).

Charnes, A. (1952) Optimality and degeneracy in linear programming. Econometrica, 20, 160-170.

Charnes, A. and Cooper, W.W. (1954) The stepping stone method of explaining linear programming calculations in transportation problems. Management Science, 1, 49-69.

Charnes, A., Cooper, W.W., Learner, D. and Snow, E.F. (1968) Note on an application of goal programming model for media planning. Management Science, 14(8), B431-436.

Charnes, A., Copper, W.W., DeVoe, J.K., Learner, D.B. and Reinecke, W. (1968) A goal programming model for media planning. Management Science, 15, B423-430.

11

Page 12: THE DATA FUSION BIBLIOGRAPHY

Charnes, A., Cooper, W.W. and Ferguson, R. (1955) Optimal estimation of executive compensation by linear programming. Management Science, 1, 138-151.

Charnes, A., Cooper, W.W., Learner, D.B. and Snow, E.F. (1968) Note on an application of goal programming model for media planning. Management Science, 14, B431-436.

Chatfield, C. (1995) Model uncertainty, data mining and statistical inference (with discussion). Journal of the Royal Statistical Society, Series A, 158, 419-466.

Chen, J. and Shao, J. (2000) Nearest neighbor imputation for survey data. Journal of Official Statistics, 16(2), 113-131.

Chen, J. and Shao, J. (2001) Jackknife variance estimation for nearest-neighbor imputation. Journal of the American Statistical Association, 96, 260-269.

Chen, R. and Liu, J.S. (1996) Predictive updating methods with application to Bayesian classification. Journal of the Royal Statistical Society B, 58, 2.

Chen, J., Rao, J.N.K. and Sitter, R.R. (2000) Efficient random imputation for missing data in complex surveys. Statistica Sinica, 10, 1153-1169.

Chernoff, H. (1973) The use of faces to represent points in k-dimensional space graphically. Journal of the American Statistical Association, 68, 361-368.

Cherkaoui, O. and Cleroux, R. (1991) Comparative study of six discriminant analysis procedures for mixtures of variables. In: Proceedings of Interface Conference 1991. San Francisco, California: Morgan Kaufmann Publishers Inc.

Cherkassky, V. and Mulier, F. (1998) Learning From Data. John Wiley & Sons: New York.

Cheung, S. and Seko, C. (1986) A study of the effects of imputation groups in the nearest neighborhood imputation method for the National Farm Survey. Survey Methodology, 12(1), 99-106.

Chiananda Gowda K. and Krishna, G. (1979) The condensed nearest neighbor rule using the concept of mutual nearest neighborhood. IEEE Transactions on Information Theory, 25, 488-490.

Chipman, H.A., George, E.I. and McCulloch, R.E. (1998) Bayesian CART model search. Journal of the American Statistical Association, 93, 935-947.

Chou, P.A. (1991) Optimal partitioning for classification and regression trees. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI-13, 340-354.

Chow, W.S. and Chen, Y.C. (1992) A new fast algorithm for effective training of neural classifiers. Pattern Recognition, 25, 423-429.

Citoni, G., Di Nicola, F., Lugaresi, S. and Proto, G. (1991) Statistical matching for tax-benefit microsimulation modelling: a project for Italy. Workshop on Statistical Matching, ISPE, 26 September 1991, Rome, Italy.

Citro, C.F. and E. A. Hanushek (1991) Improving Information for Social Policy Decisions: The

Uses of Microsimulation Modeling, Volume II. Washington, DC: National Academy Press.

Clarke, W.R., Lachenbruch, P.A. and Broffitt, B. (1979) How nonnormality affects the quadratic discrimination function. Communications in Statistics – Theory & Methodology, 16, 41-46.

12

Page 13: THE DATA FUSION BIBLIOGRAPHY

Clayton, E.R. and Moore, L.J. (1972) An internactive model advertising media selection. Southern Journal of Business, 7, 37-45.

Clayton, E.R. and Moore, L.J. (1972) Journal of Systems Management, 23(11), 26-31.

Clemens, J. (1989) The marriage of BARB and G-TRACK: a data fusion case study. The case for divorce. Admap, December 1989.

Clogg, C.C., Rubin, D.B., Schenker, N., Schultz, B. and Weidman, L. (1991) Multiple imputation of industry and occupation codes in census public-use samples using Bayesian logistic regression. Journal of the American Statistical Association, 86, 68-78.

Clunies-Ross, C.W. and Riffenburgh, R.H. (1960) Geometry and linear discrimination. Biometrika, 47, 185-189.

Cochran, W.G. (1964) On the performance of the linear discriminant function. Technometrics, 6, 179-190.

Cochran, W.G. (1965) The planning of observational studies of human populations (with discussion). Journal of the Royal Statistical Society, Series A, 128, 134-155.

Cochran, W.G. (1968) The effectiveness of adjustment by subclassification in removing bias in observational studies. Biometrics, 24, 295-313.

Cochran, W.G. (1968) Commentary on estimation of error rates in discriminant analysis. Technometrics, 10, 204.

Cochran, W.G. and Bliss, C. I. (1948) Discriminant functions with covariance. Annals of Mathematical Statistics, 19, 151.

Cochran, W.G. and Hopkins, C.E. (1961) Some classification problems with multivariate qualitative data. Biometrics, 17, 10-32.

Cochran, W.G. and Rubin, D.B. (1973) Controlling bias in observational studies: a review. Sankhya, A35, 417-446.

Cohen, M.L. (1991) Statistical matching and microsimulation models. In Improving Information for Social Policy Decisions: The Uses of Microsimulation Modelling, Volume II, Technical Papers (C.F. Citro and E.A. Hanushek (eds.)). Washington, DC: National Academy Press.

Cohen M L (1991) Variance estimation of microsimulation models through sample reuse. In Improving Information for Social Policy Decisions: The Uses of Microsimulation Modelling, Volume II, Technical Papers (C.F. Citro and E.A. Hanushek (eds.)). Washington, DC: National Academy Press.

Cohen, M.P. (1996) A new approach to imputation. Proceedings of the Section on Survey Research Methods, American Statistical Association, 293-298. Washington, DC: American Statistical Association.

Coli, A. and Tartamella, F. (2000) The link between national accounts and households micro data. Paper presented at the Meeting of the Siena Group on Social Statistics, Maastricht (Netherlands), 22-25, May 2000.

13

Page 14: THE DATA FUSION BIBLIOGRAPHY

Coli, A. and Tartamella, F. (2000) A pilot social accounting matrix for Italy with a focus on households. Paper presented to the 26th General Conference of the International Association for Research in Income and Wealth, Cracow (Poland), 27 August – 2 September, 2000).

Colledge, M.J., Johnson, J.H., Pare, R. and Sande, I.G. (1978) Large scale imputation of survey data. Proceedings of the Section on Survey Research Methods, American Statistical Association, 431-436. Washington, DC: American Statistical Association. -254

Colledge, M.J., Johnson, J.H., Paré, R.M. and Sange, I.G. (1978) Large scale imputation of survey data. Survey Methodology, 4, 203-224.

Collins, J. H. and Bhatia, M. (2001) Integrating internet site audience measurement into media planning and buying – a unified method for reach/frequency. ESOMAR Worldwide Radio Conference, Athens, Greece.

Collins, J.H. and Mallett, D.T. (2002) Internet inclusive multi-media reach/frequency and optimization. ESOMAR/ARF Week of Audience Measurement. June 12, Cannes, France.

Collins, J.H., Mallett, D.T. and Mulligan Traug, J. (2002) Multi-media reach/frequency and optimization: Questions, answers and consequences for print. ESOMAR/ARF Week of Audience Measurement, Cannes, France.

Cooper, P.W. (1963) Statistical classification with quadratic forms. Biometrika, 50, 439-448.

Copas, J. R., and F. J. Hilton (1990), Record linkage: statistical models for matching computer records. Journal of the Royal Statistical Society, A, 153, 287-320.

Cornfield, J. (1967) Discriminant functions. Review of the International Statistical Institute, 35, 142-153.

Cornish, P. (1995) Confusion by fusion. Admap, February 1995 issue.

Cortes, C. and Vapnik, B. (1995) Support vector networks. Machine Learning, 20, 273-297.

Cover, T.M. (1965) Geometrical and statistical properties of systems of linear inequalities with applications to pattern recognition. IEEE Transactions on Electronic Computers, 14, 326-334.

Cover, T.M. (1968) Estimation by the nearest neighbor rule. IEEE Transactions on Information Theory, 14(1), 50-55.

Cover, T.M. (1968) Rates of convergence for nearest neighbor procedures. Proceedings of the Hawaii International Conference on System Sciences, 413-415. Honolulu: Western Periodicals.

Cover, T.M. and Hart, P.E. (1967) Nearest neighbor pattern classification. IEEE Transactions on Information Theory, 13, 21-27.

Cowling, T. (1999) New data links for better targeting and planning. Admap, 392, 28-31.

Cox, B.G. (1980) The weighted sequential hot deck imputation procedure. Proceedings of the Section on Survey Research Methods, American Statistical Association, 721-725. Washington, DC: American Statistical Association.

14

Page 15: THE DATA FUSION BIBLIOGRAPHY

Cox, B.G. (1981) An evaluation of weighted hot deck imputation for unreported health care visits. Proceedings of the Section on Survey Research Methods, American Statistical Association, 412-417. Washington, DC: American Statistical Association.

Cox, B.G. and Folson, R.E. Jr. (1978) An empirical investigation of alternative non-response adjustments. Proceedings of the Section on Survey Research Methods, American Statistical Association. Washington, DC: American Statistical Association.

Cox, D.R. (1957) Note on grouping. Jounral of the American Statistical Association, 52, 543-547.

Craig, C.S. and McCann, J.M. (1978) Item non-response in mail surveys: extent and correlates. Journal of Marketing Research, 15, 285-289.

Crask, M.R., and W.D. Perreault, Jr. (1977), Validation of discriminant analysis in marketing research. Journal of Market Research, 14 (February), 60-68.

Crellin, M. and Dodd, M. (2002) Mixed-media planning for the 21st Century. Admap, 427 (April 2002), p.37-39.

Crellin, M., Rudman, L. and Heath, M. (2002) Media planning enters the 21st Century. ESOMAR/ARF Week of Audience Measurement, Cannes, France.

Cristianini, N. and Shawe-Taylor, J. (2000) An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods. New York: Cambridge University Press.

Crivellaro, A. and Ferreira, A. (2002) Building a radio panel using fusion and recall surveys. ESOMAR/ARF Week of Audience Measurement, Cannes, France.

Cuadras, C.M. and Rodríguez-Lallena, J.A. (2002) Proceedings of the Conference on Distributions with Given Marginals and Statistical Modelling. Kluwer Academic Publishers: Dordrecht.

Cucker, F. and Smale, S. (2001) On the mathematical foundations of learning. Bulletin of the American Mathematical Society, 39, 1-49.

Cunningham, W. and Klincewicz, J. (1983) On cycling in the network simplex method. Mathematical Programming, 26, 182-189.

Currim, I.S. and Shoemaker, R.W. (1990) Is television advertising being placed to reach product users? Marketing Letters, 1(3), 209-220.

Czaia, U. (1993) Interactive fusion: step two. In Sixth Worldwide Readership Research Symposium, San Francisco, 489-493.

Czaia, U. and Speetzen, R. (1991) To fuse or not to fuse, or why make it simple when it can be complicated. In Fifth Worldwide Readership Symposium, Hong Kong.

D’Agostino, R.B. Jr. (1998) Propensity score methods for bias reduction in the comparison of a treatment group to a non-randomized control group. Statistics in Medicine, 17, 2265-2281.

D’Agostino, R.B. J. and Rubin, D.B. (2000) Estimating and using propensity scores with partially missing data. Journal of the American Statistical Association, 95, 749-759.

Dall’Aglio, G., Kotz, S. and Salinetti, G. (1991) Advances in Probability Distributions with Given Marginals (Beyond the Copulas). Dordrecht: Kluwer Academic Publishers.

15

Page 16: THE DATA FUSION BIBLIOGRAPHY

Danaher, P.J. and Rust, R.T. (1992) Linking segmentation studies. Journal of Advertising Research, 32(3), 18-23.

Dantzig, G.B. (1951) Application of the simplex method to a transportation problem. In Activity Analysis of Production and Allocation (T.C. Koopmans (ed.)). New York: John Wiley and Sons.

Dantzig, G.B. (1963) Linear Programming and Extensions. Princeton, NJ: Princeton University Press.

Dantzig, G.B. and Fulkerson, D.R. (1954) Minimizing the number of tankers to meet a fixed schedule. Naval Research Logistics Quarterly, 1, 217-222.

Dantzig, G.G., Orden, A. and Wolfe, P. (1955) The generalized simplex method for minimizing a linear form under linear inequality restraints. Pacific Journal of Mathematics, 5, 183-195.

Darkow, M. (1996) Compatible or not? Results of a single source field experiment within a TV audience research panel. Marketing and Research Today, August 1996, 150-161.

Das Gupta, S. (1964) Nonparametric classification rules. Sanhkya, 26, 25-30.

Das Gupta, S. (1964) Optimum classification rules for classification into two multivariate normal populations. Annals of Mathematical Statistics, 36, 1174-1184.

Das Gupta, S. (1968) Some aspects of discriminant function coefficients. Sankhya, A30, 387-406.

Dasarathy, B. V. (ed.) (1991) Nearest Neighbor (NN) Norms: NN Pattern Classification Techniques. Los Alamitos, California: IEEE Computer Society.

Dasarthy, B.V. (ed.) (1994) Decision Fusion. Washington, DC: IEEE Computer Society.

David, M., Little, R.J.A., Samuhel, M.E. and Triest, R.K. (1986) Alternative methods for CPS income imputation. Journal of the American Statistical Association, 81, 29-41.

Davies, E. R. (1988) Training sets and a priori probabilities with the nearest neighbor method of pattern recognition. Pattern Recognition Letters, 8, 11-13.

Davies, M.G. (1970) The performance of the linear discriminant function in two variables. British Journal of Statistical Psychology, 23, 165-176.

Davis, R.H., Edelman, D.B. and Gammerman, A.J. (1992) Machine-learning algorithms for credit-card applications. IMA Journal of Mathematical Applications in Business and Industry, 4, 43-52.

Dawid, A.P. (1976) Properties of diagnostic data distributions. Biometrics, 32, 647-658.

Dawid, A.P. (1979) Conditional independence in statistical theory (with discussion). Journal of the Royal Statistical Society, B41,1-31.

Dawid, A.P. (1980) Conditional independence for statistical operations Annals of Statistics, 8, 598-617.

Day, N.E. and Kerridge, D.F. (1967) A general maximum likelihood discriminant. Biometrics, 23, 313-324.

16

Page 17: THE DATA FUSION BIBLIOGRAPHY

Day, R.L. (1962). Linear programming in media selection. Journal of Advertising Research, 2, 40-44.

DeGroot, M. H. (1980) Optimal matching: a survey. Symposia Mathematica, XXV. Instituto Nazionale de Alta Matematica, Francesco Saveri, Roma. Academic Press: New York.

DeGroot, M.H., Feder, P.I. and Goel, P.K. (1971) Matchmaking. Annals of Mathematical Statistics, 42, 578-593.

DeGroot, M.H. and Goel, P. (1976) The matching problem for multivariate normal data. Sankyā, 38 (Series B, Part 1), 14-28.

Dehejia, R.H. and Wahba, S. (1998) Propensity score matching methods for non-experimental causal studies. Working Paper 6829, National Bureau of Economic Research (http://www.nber.og)

Dehejia, R.H. and Wahba, S. (1999) Causal effects in nonexperimental studies: reevaluating the evaluation training programs. Journal of the American Statistical Association, 94, 1053-1062.

De Kluyer, C.A. (1978) Hard and soft constraints in media scheduling. Journal of Advertising Research, 18, 273-278.

De Kluyer, C.A. (1979) An exploration of various goal programming formulations – with applications to advertising media selection. Journal of the Operational Research Society, 30, 167-171.

Dell’Amico, M., Maffioli, F. and Martello, S. (eds) (1997) Annotated Biolographies in Combinatorial Optimization. Chichester: Wiley.

DeLong, E.R., DeLong, D. and Clarke-Pearson, D.L. (1988) Comparing the area under two or more correlated receiver operating characteristic curves: a nonparamteric approach. Biometrics, 44, 837-845.

Deming, W. E., and G. J. Gleser (1959) On the problem of matching lists by samples. Journal of the American Statistical Association, 54, 403-415.

de Montigny, M. and Lima, A.L. (2001) Brazil fusion and multi-media duplication. 10th Worldwide Readership Research Symposium, Venice, 555-558.

Dempster, A.P. (1964) Tests for the equality of two covariance matrices in relation to a best linear discrminator analysis. Annals of Mathematical Statistics, 35, 190-199.

Dempster, A.P., Laird, N.M. and Rubin, D.B. (1977) Maximum likelihood from incomplete data via the EM algorithm (with discussion). Journal of Royal Statistical Society, B39, 1-38.

Denison, D.G.T., Mallick, B.K. and Smith, A.F.M. (1988) A Bayesian CART algorithm. Biometrika, 85, 363-377.

Denk, M. and Hackl, P. (2003) Data integration and record matching: an Austrian contribution to research in official statistics. Austrian Journal of Statistics, 32, 305-321.

Derigs, U. (1988) Solving non-bipartite matching problems via short path techniques. Annals of Operations Research, 13, 225-261.

Devijver, P. (1978) A note on ties in voting with the k-NN rule. Pattern Recognition, 10, 297-298.

17

Page 18: THE DATA FUSION BIBLIOGRAPHY

Devroye, L. (1981) On the inequality of Cover and Hart in nearest neighbor discrimination. IEEE Transactions on Pattern Analysis and Machine Learning, PAMI-3, 75-58.

Devroye, L. (1987) Automatic selection of a discriminant rule based upon minimization of the empirical risk. In Pattern Recognition Theory and Application (edited by P.A. Devijver and J. Kittler). Berlin: Springer-Verlag.

Devroye, L., Györfi, L. and Lugosi, G. (1996) A Probabilistic Theory of Pattern Recognition. New York: Springer-Verlag New York.

Devroye, L.P. and Wagner, T.J. (1982) Nearest neighbor methods in discrimination. In Krishnaiah, P.R. and Kanal, L.N. (eds) Handbook of Statistics Volume 2, 193-197. Amsterdam: North-Holland.

Dietterich, T.G. (1995) Overfitting and undercomputing in machine learning. Computing Surveys, 27(3), 326-327.

Dietterich, T.G. (1998) Approximate statistical tests for comparing supervised classification learning algorithms. Neural Computation, 10(7), 1895-1924.

Dietterich, T.G. (2000) An experimental comparison of three methods for constructing ensembles of decision trees: bagging, boosting, and randomization. Machine Learning, 40(2):139-158.

Dillon, W.R. (1979) The performance of the linear discriminant function in non-optimal situations and the estimation of classification error rates: A review of recent findings. Journal of Marketing Research, 16, 370-391.

Dillon, W.R., and M. Goldstein (1978) On the performance of some multinomial classification rules. Journal of the American Statistical Association, 73, 305-313.

Dillon, W.R., Goldstein, M. and Schiffman, L. (1978) On the appropriateness of linear discriminant analysis and multinomial classification analysis in marketing research. Journal of Marketing Research, 15, 103-112.

Dillon, W.R., and L. Schiffman (1978) Appropriateness of linear discriminant and multinomial classification analysis in marketing research. Journal of Marketing Research, 15, 103-112.

Dodd, L.E. and Sullivan Pepe, M. (2003) Semiparametric regression for the area under the receiver operating characteristic curve. Journal of the American Statistical Association, 98, 409-417 .

Dodson, D. (2003) Fusing data effectively. 6th Annual One Day Media Research Seminar, London, June 11th, 2003.

Doe, P. (2000) Data fusion use in Europe and the UK. Will Fusion Come to America Workshop, Advertising Research Foundation, December 2000, New York City.

Domeniconi, P. and Gunopulos, D. (2002) Efficient local flexible nearest neighborhood discrimination. Proceedings of the Second International Conference on Data Mining. SIAM: Philadelphia, PA.

Donato, P. (2001) Towards data fusion. Speech at Maximizing Media, Toronto, Canada, September 21, 2001.

18

Page 19: THE DATA FUSION BIBLIOGRAPHY

Donato, P.D. (2001) Fusion and other futures. Canadian Advertising Research Foundation Newsletter, 250, September 2001.

D’Orazio, M., Di Zio, M. and Scanu, M. (2001) Statistical matching: a tool for integrating data in National Statistical Institutes. Second International Seminar of Exchange of Technology and Know-How/Fourth New Techniques and Technologies for Statistics Seminar. Crete.

D’Orazio, M., Di Zio, M. And Scanu, M. (2002) Statistical matching and official statistics. Rivista di Statistica Ufficiale, 2002/1, 5-24.

D’Orazio, M., Di Zio, M. and Scanu, M. (2006) Statistical Matching: Theory and Practice. John Wiley & Songs: New York, NY.

Dorinski, S.M. and Griffin, R. (1997) Accounting for variance due to imputation in the integrated coverage measurement survey. Proceedings of the Survey Research Methods Section, American Statistical Association, 748-753. Washington, DC: American Statistical Association.

Dow, H.F. (1989) Data fusion – the Canadian experiment. In International Symposium on Television Research, Tarrytown, New York.

Dow, H.F. (1989) Data fusion – the Canadian experiment. Canadian Journal of Marketing Research, 9.

Doyle, P. (1973) The use of automatic interaction detector and similar search procedures. Operational Research Quarterly, 24, 465-467.

Drucker, H. and Cortes, C. (1996) Boosting decision trees. Advances in Neural Information Processing Systems 8, 479-485.

Drucker, H., Schapire, R. and Simard, P. (1993) Boosting performance in neural networks. International Journal in Pattern Recognition and Artificial Intelligence, 7(4), 705-719.

Duan, N. and Li, K.C. (1991) Slicing regression: a link-free regression method. Annals of Statistics, 19, 505-530.

DuBois, N.S., D’Andre. (1969) A solution to the problem of linking multivariate documents. Journal of the American Statistical Association, 69, 163-174.

Duda, R.O. and Fossen, H. (1966) Pattern classification by iteratively determined linear and piecewise linear discriminant functions. IEEE Transactions on Electronic Computers, EC-15, 220-223.

Duda, R.O. and Hart, P.E. (1973) Pattern Classification and Scene Analysis. New York: Wiley.

Duda, R.O., Hart, P.E. and Stork, D.G. (2001) Pattern Classification. New York: John Wiley & Sons.

Dudani, S.A. (1976) The distance-weighted k-nearest-neighbor rule. IEEE Transactions on Systems, Man, and Cybernetics, SMC-6(4), 325-327.

Duin, R.P.W. (1994) Superlearning and neural network magic. Pattern Recognition Letters, 15, 215-217.

Duin, R.P.W. (1996) A note on comparing classifiers. Pattern Recognition Letters, 17(5), 529-536.

19

Page 20: THE DATA FUSION BIBLIOGRAPHY

Dunn, O.J. (1971) Some expected values for probabilities of correct classification in discriminant analysis. Technometrics, 13, 345.

Dunn, O.J. and Varady, P.V. (1966) Probabilities of correct classification in discriminant analysis. Biometrics, 22, 908-924.

Dunsmore, I.R. (1968) A Bayesian approach to classification. Journal of the Royal Statistical Society, B30, 396-405.

Dybowski, R. and Weller, P.R. (2001) Prediction regions for the visualization of incomplete datasets. Computational Statistics, 16, 25-41.

Efron, B. (1975) The efficiency of logistic regression compared to normal discriminant analysis. Journal of the American Statistical Association, 70, 892-898.

Efron, B. (1982) The Jackknife, the Bootstrap and other Resampling Plans. Philadelphia, Pennsylvania: SIAM.

Efron, B. (1983) Estimating the error rate of a prediction rule: improvements on cross-validation. Journal of the American Statistical Association, 78, 316-330.

Efron, B. (1986) How biased is the apparent error rate of a prediction rule? Journal of the American Statistical Association, 81, 461-470.

Efron, B. (1994) Missing data, imputation, and the bootstrap. Journal of the American Statistical Association, 89, 463-479.

Efron, B. and Gong, G. (1983) A leisurely look at the bootstrap, the jackknife, and cross-validation. American Statistician, 37, 46-48.

Eisemann, K. (1964) Generalized stepping stone method for the machine loading problem. Management Science, 11.

Ejigou, A. (1996) Power and sample size for matched case-control studies. Biometrics, 52, 925-933.

Elashoff, J.D., Elashoff, R.M. and Goldman, G.E. (1967) On the choice of variables in classification problems with dichotomous variables. Biometrika, 54, 668.

Ellis, D.M. (1965). Building up a sequence of optimum media schedules. Operational Research Quarterly, 16, 413-24.

Ellison, B.E. (1965) Multivariate normal classification with covariance known. Annals of Mathematical Statistics, 36, 1787-1793.

Elms, S. (1997) Linking sales tracking data to television viewing. Admap, April issue.

Elmaghraby, S. (1957) A note on production scheduling by the use of the transportation method. Operations Research, 5, 565-566.

Elton, R.A. and Duffy, S.W. 1983) Correcting for the effect of misclassification bias in a case-control study using data from two different questionnaires. Biometrics, 39, 659-665.

Engel, J.F. and Warshaw, M. (1964) Allocating advertising dollars by linear programming. Journal of Advertizing Research, 5, 42-48.

20

Page 21: THE DATA FUSION BIBLIOGRAPHY

Epanechnikov, V.A. (1969) Non-parametric estimation of a multivariate probability density. Theory of Probability and Its Applications, 14, 153-158.

Ephron, E. (2002) The promise of fusion. Admap, 434 (December 2002), 42-44.

Ephron, E. and Peacock, J. (1999) MRI and media-mix. ARF Television Research Workshop, October 28, 1999. New York: Advertising Research Foundation.

Ernst, L.F. (1980) Variance of the estimated mean for several imputation procedures. Proceedings of the Section on Survey Research Methods, American Statistical Association, 716-720. Washington, DC: American Statistical Association.

Ernst, L.R. and Ikeda, M.M. (1995) A reduced-size transportation algorithm for maximizing the overlap between surveys. Survey Methodology, 21, 147-157.

Ettlinger, M.P. and O’Hare, J.F. (1996) Revenue and incidence analysis of state and local tax systems: tools, techniques and tradition. 89th Annual Conference on Taxation. National Tax Association.

Evans, M. (1998) From 1086 and 1984: direct marketing into the millennium. Marketing Intelligence and Planning, 16(1), 56-67.

Ezzati-Rice, T.M., Johnson, W., Khare, M., Little, R.J.A., Rubin, D.B. and Schafer, J.L. (1995) A simulation study to evaluate the performance of model-based multiple imputations in NCHS health examination surveys. In Proceedings of the Annual Research Conference, p. 257-266. Bureau of the Census: Washington, D.C.

Fairley, W.B., Izenman, A. and Crunk, S.M. (2001) Combining incomplete information from independent assessment surveys for estimating masonry deterioration. Journal of the American Statistical Association, 96, 488-499.

Falkenauer, E. (1998) Genetic Algorithms and Grouping Problems. New York: John Wiley and Sons Inc.

Farina, M.S. (1991) New experiences on merging in Italy. In Fifth Worldwide Readership Research Symposium, Hong Kong.

Farina, M.S. (2003) Back to the future: Disproportional sample ascription. Worldwide Readership Research Symposium, Cambridge, MA, USA.

Faris, P.D., Ghali, W.A., Brant, R., Norris, C.M., Galbraith, P.D. and Knudtson, M.L. (2002) Multiple imputation versus data enhancement for dealing with missing data in observational health care outcome analyses. Journal of Clinical Epidemiology, 55, 184-191.

Fay, R.E., (1992) When are inferences from multiple imputation valid? Proceedings of the Section on Survey Research Methods, American Statistical Association, 227-232. Washington, DC: American Statistical Association.

Fay, F.E. (1993) Valid inferences from imputed survey data. Proceedings of the Survey Research Methods, American Statistical Association, 41-48. Washington, DC: American Statistical Association.

Fay, R. (1996) Alternative paradigms for the analysis of imputed survey data. Journal of the American Statistical Association, 91, 490-498.

21

Page 22: THE DATA FUSION BIBLIOGRAPHY

Fay, R. (1999) Theory and application of nearest neighbor imputation in Census 2000. Proceedings of the Section on Survey Research Methods, American Statistical Association. Washington, DC: American Statistical Association.

Fellegi, I. P., and A. B. Sunter (1969) A theory for record linkage. Journal of the American Statistical Association, 64, 1183-1210.

Fellegi, I.P. and Holt, D. (1976) A systematic approach to automatic edit and imputation. Journal of the American Statistical Association, 71, 17-35.

Fend, C. and Michie, D. (1994) Machine learning of rules and trees. In Machine Learning, Neural and Statistical Classification (edited by D. Michie, D.J. Spiegelhalter, and C. C. Taylor). Chichester, UK: Ellis Horwood.

Ferrari, F. and Crivellaro, A. (2003) Building a panel from a recent reading databse through fusion. Worldwide Readership Research Symposium, Cambridge, MA, USA.

Fiering, M.B. (1962) On the use of correlation to augment data. Journal of the American Statistical Association, 57, 20-32.

Fisher, L., and J.W. Van Ness (1973) Admissible discriminant analysis. Journal of the American Statistical Association, 68, 603-607.

Fisher, N., Derquenne, C. and Saporta, S. (2001) A new method to match data set applied to the electric market. Fourth New Techniques and Technologies for Statistics Seminar- Fourth Annual Exchange of Technology and Knowhow Seminar. June 18-21, Crete.

Fisher, R.A. (1936) The use of multiple measurements in taxonomic problems. Annals of Eugenics, 7, 179-188.

Fisher, R.A. (1940) The precision of discriminant functions. Annals of Eugenics, 10, 422-429.

Fitzmaurice, G.M., Laird, N.M. and Zahner, G.E.P. (1996) Multivariate logistic models for incomplete binary responses. Journal of the American Statistical Association, 91, 99-108.

Fitzsimmons, M., Khabaza, T. and Shearer, C. (1993) The application of rule induction and neural networks for television audience prediction. Proceedings of the ESOMAR/EMAC/AFM Symposium on Information-based Decision Making in Marketing, 69-82.

Fix, E. and Hodges, J.L. (1951) Discriminatory analysis: nonparametric discrimination: consistency properties. USAF School of Aviation Medicine, Project 21-49-004, Report Number 4, pages 261-279.

Fix, E. and Hodges, J.L. (1952) Discriminatory analysis: nonparametric discrimination: small sample performance. USAF School of Aviation Medicine, Project 21-49-004, Report Number 11, pages 280-322.

Fogarty, T.C. and Ireson, N.S. (1994) Evolving Bayesian classifiers for credit control – a comparison with other machine learning methods. IMA Journal of Mathematical Applications in Business and Industry, 5, 63-75.

Ford, B.L. (1983) An overview of hot-deck procedures. In Incomplete Data in Sample Surveys, Volume 2, Theory and Bibliographies. W.G. Madow, I. Olkin and D.B. Rubin (eds.). New York: Academic Press.

22

Page 23: THE DATA FUSION BIBLIOGRAPHY

Ford, L.R. and Fulkerson, D.R. (1956) Solving the transportation problem. Management Science, 3, 24.

Ford, L.R. Jr. and Fulkerson, D.R. (1957) A simple algorithm for finding maximal network flows and an application to the Hitchcock problem. Canadian Journal of Mathematics, 9, 210-218.

Fraker, T. and Maynard, R. (1987) The adequacy of comparison group designs for evaluations of employment-related programs. The Journal of Human Resources, 22, 194–297.

Frank, I.E. and Friedman, J.H. (1989) Classification: oldtimers and newcomers. Journal of Chemometrics, 3, 463-475.

Frank, R.E., W.F. Brassy, and D.G. Morrison (1965) Bias in multiple discriminant analysis. Journal of Marketing Research, 2 (August), 250-258.

Frankel, M. R. (1981) Ascription in magazine audience research. Readership Research: Theory and Practice. Proceedings of the First International Symposium. H. Henry (ed.). New Orleans.

Frankel, M.R., Baim, J., Agresti, J. and Galin, M. (2001) Toward a mathematical theory of cross-survey inference. 10th Worldwide Readership Research Symposium, Venice, 69-86.

Frankel, M.R. and Baxter, P. (1988) Fusion, integration, ascription and imputation. Readership Research: Theory and Practice. Proceedings of the Fourth International Symposium. H. Henry (ed.). Barcelona.

Franz, G. (2000) The future of multimedia research. International Journal of Market Research, 42, 459-472.

Freed, N. and Glover, F. (1981) A linear programming approach to the discriminant problem. Decision Sciences, 12, 68-74.

Freed, N. and Glover, F. (1986) Evaluating alternative linear programming models to solve the two-group discriminant problem. Decision Sciences, 17, 151-162.

Freed, N. and Glover, F. (1986) Resolving certain difficulties and improving the classification power of LP discriminant analysis formulations. Decision Sciences, 17, 589-595.

Freed, N. and Glover, F. (1971) Simple but powerful goal programming models for discriminant problems. European Journal of Operational Research, 7, 44-60.

Freedman, V.A. and Wolf, D.A. (1995) A case study on the use of multiple imputation. Demography, 32, 459-70.

Frees, E.W. and Valdez, E.A. (1998) Understandnig relationships using copulas. North American Actuarial Journal, 2, 1-25.

Freund, Y. (1995) Boosting a weak learning algorithm by majority. Information and Control, 121, 256-285.

Freund, Y. and Schapire, R.E. (1995) A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences, 55(1), 119-139.

Freund, Y. and Schapire, R.E. (1996) Experiments with a new boosting algorithm. In Machine Learning: Proceedings of the Thirteenth International Conference, (Siatta, L. (ed.)),145-156. San Francisco, California: Morgan Kaufmann Publishers Inc.

23

Page 24: THE DATA FUSION BIBLIOGRAPHY

Friedlander, D. and Robins, R. (1995) Evaluating program evaluations: new evidence on commonly used nonexperimental methods. American Economic Review, 85, 923-937.

Friedman, J.H. (1977) A recursive partitioning decision rule for nonparametric classification. IEEE Transactions on Computers, 26, 404-408.

Friedman, J.H. (1989) Regularized discriminant analysis. Journal of the American Statistical Association, 84, 165-175.

Friedman, J. H. (1994) Flexible metric nearest neighbor classification. Technical Report 113, Stanford University Statistics Department. Palo Alto: Stanford University.

Friedman, J.H. (1997) On bias, variance, 0/1-loss and the curse of dimensionality. Data Mining and Knowledge Discovery, 1(1), 55-77.

Friedman, J.H., Baskett, F. and Shustek, L.J. (1975) An algorithm for finding nearest neighbors. IEEE Transactions on Computers, 24, 1000-1006.

Friedman, J.H., Bentley, J.L. and Finkel, R.A. (1977) An algorithm for finding the best matches in logarithmic expect time. ACM Transactions on Mathematical Software, 3, 209-226.

Friedman, J., Hastie, T. and Tibshirani, R. (2000) Additive logistic regression: a statistical view of boosting (with discussion). Annals of Statistics, 28(2), 374-407.

Fukunaga, K. (1990) Introduction to Statistical Pattern Recognition. Boston: Academic Press.

Fukunaga, K. and Flick, T.E. (1984) An optimal global nearest neighbor metric. IEEE Transactions on Pattern Recognition and Machine Intelligence, 6, 314-318.

Fukunaga, K. and Hayes, R.R. (1989) Effects of sample size in classifier design. IEEE Transactions on Pattern Analysis and Machine Intelligence, 11, 873-885.

Fukunaga, K. and Hummels, D.M. (1987) Bias of nearest neighbor error estimates. IEEE Transactions on Pattern Analysis and Machine Intelligence, 9, 103-112.

Fukunaga, K. and Kessell, D. (1981) Estimation of classification error. IEEE Transactions on Computers, C-20, 1521-1527.

Fukunaga, K. and Narendra, P.M. (1975) A branch and bound algorithm for computing k-nearest neighbor. IEEE Transactions on Computers, 24, 750-753.

Gäbler, S. (1997) Datenfusion. ZUMA-Nachrichten, 40, 81-92.

Ganesalingam, S. and MacLachlan, G. J. (1980) Error rate estimation on the basis of posterior probabilities. Pattern Recognition, 12, 405-413.

Garfinkel, N. (1963) A marketing approach to media selection. Journal of Advertising Research, 3(3), 7-15.

Garfinkel, R. S., Kunnathur, A. S. and Liepins, G. E., (1986) Optimal imputation of erroneous data: categorical data, general edits. Operations Research, 34, 744-751.

Garland, I. (2002) Measuring everything — first results from a convergence panel. Staying Tuned 2002. February 11, Toronto, Canada.

24

Page 25: THE DATA FUSION BIBLIOGRAPHY

Garland, I. (2002) Convergence: Understanding cross-media usage patterns. ESOMAR/ARF Week of Audience Measurement, Cannes, France.

Garnett, J.M. and Yau, S.S. (1977) Nonparametric estimation of the Bayes error of feature extractors using ordered nearest neighbor sets. IEEE Transactions on Computers, 26, 46-54.

Garvish B., Schweitzer P., and Shlifer, E. (1977) The zero pivot phenomenon in transportation and assignment problems and its computational implications. Mathematical Programming, 13, 226-240

Gass, S.I. (1986) A process for determining priorities and weights for large-scale goal programmes. Journal of the Operational Research Society, 37, 779-785.

Gass, S.I. (1987) The setting of weights in linear goal programming problems. Computers and Operations Research, 14, 227-230.

Gass, S. (1990) On solving the transportation problem. Journal of the Operational Research Society, 41, 291-297.

Gassner, B.J. (1964) Cycling in the transportation problem. Naval Research Logistics Quarterly, 11, 43--58.

Gastwirth, J.L., Krieger, A.M. and Rosenbaum, P.R. (1998) Dual and simultaneous sensitivity analysis for matched pairs. Biometrika, 85, 907-920.

Gates, G.W. (1972) The reduced nearest neighbor rule. IEEE Transactions on Information Theory, IT-18, 431.

Gavin, N.I. (1984) An application of statstical matching with the Survey of Income and Education and the 1976 Health Interview Survey. Health Sciences Research, 20(2), 183-198.

Gelman, A. and Speed, T.P. (1993) Characterizing a joint probability distribution by conditions. Journal of the Royal Statistical Society, Series B, 55, 185-188.

Gessaman, M.P. and Gessaman, P.H. (1972) A comparison of some multivariate discrimination procedures. Journal of the American Statistical Association, 67, 468-472.

Geisser, S. (1966) Predictive discrimination. In Multivariate Analysis (P.R. Krishnaiah (ed.)), 149-163. New York: Academic Press.

Geisser, S. (1967) Estimation associated with linear discriminants. Annals of Mathematical Statistics, 38, 807-817.

Geisser, S. (1975) The predictive sample reuse method for application. Journal of the American Statistical Association, 70, 320-328.

Gelman, A., King, G., and Liu, C. (1998) Not asked and not answered: multiple imputation for multiple surveys (with discussion and rejoinder). Journal of the American Statistical Association, 93, 846-874.

Gelman, A. and Rubin, D.B. (1992) Inference from iterative simulation using multiple sequences (with discussion). Statistical Science, 7, 457-472.

Geman, S., Bienenstock, E. and Doursat, R. (1992) Neural networks and the bias/variance dilemma. Neural Networks, 4, 1-58.

25

Page 26: THE DATA FUSION BIBLIOGRAPHY

Genscher, D. H. (1969) A computer simulation model for selecting advertising schedules. Journal of Marketing Research, 7(2), 203-214.

Gerzon, K. (1972) A modified stepping-stone algorithm for the transportation problem. Mathematische Operationsforschung und Statistik, 3, 327-331.

Gerzon, K. (1978) A szállítási problémára alkalmazott szimplex módszer ciklizálásának a lehetõségérõl (On the possibility of cycling in the simplex algorithm applied to the transportation problem). Alkalmazott Matematikai Lapok, 4, 193-196.

Gesssaman, M. and Gessaman, P. (1970) A comparison of some multivariate discrimination procedures. Journal of the American Statistical Association, 67, 468-472.

Gessaman, M.P. and Gessaman, P.H. (1972) A comparison of some multivariate discrimination procedures. Journal of the American Statistical Association, 67, 468-472.

Geva, S. and Sitte, J. (1991) Adaptive nearest neighbor pattern classification. IEEE Transasctions on Neural Networks, 2, 318-322.

Gibbs, T.E. (1973) Goal programming. Journal of Systems Management, 24(5), 38-41.

Gilbert, E.S. (1968) On discrimination using qualitative variables. Journal of the American Statistical Association, 63, 1399.

Gilbert, E.S. (1969) The effect of unequal variance-covariance matrices on Fisher’s linear discriminant function. Biometrics, 25, 505-516.

Gill, P., Murray, W., Saunders, M. and Wright, M. (1989) A practical anti-cycling procedure for linear constrained optimization. Mathematical Programming, 45, 437-474.

Glick, N. (1972) Sample-based classification procedures derived from density estimators. Journal of the American Statistical Association, 67,116-122.

Glick, N. (1972) Sample-based multinomial classification. Biometrics, 29, 241-256.

Glick, N. (1976) Sample-based classification procedures related to empiric distributions. IEEE Transactions on Information Theory, IT-22, 454-461.

Glick. N. (1978) Additive estimators for probabilities of correct classification. Pattern Recognition, 10, 211-222.

Glonek, G.F.V. and McCullagh, P. (1995) Multivariate logistic models. Journal of the Royal Statistical Society, B57, 533-546.

Glover, F. (1986) Future paths for integer programming and links to artificial intelligence. Computers and Operations Research, 13, 533-549.

Glymour, C., Madigan, D., Pregibon, D. and Singh, P. (1997) Statistical themes and lessons for data mining. Data Mining and Knowledge Discovery, 1(1), 11-28.

Goel, P.K. (1975) On re-pairing observations in a broken random sample Annals of Statistics, 3, 1364-1369.

Goel, P.K. and Ramalingam, T. (1987) Some properties of the maximum likelihood strategy for re-pairing a broken random sample. Journal of Statistical Planning and Inference, 16, 237-248.

26

Page 27: THE DATA FUSION BIBLIOGRAPHY

Goel, P.K. and Ramalingam, T. (1989) The matching methodology: some statistical properties. Lecture Notes in Statistics, Volume 52. Springer-Verlag New York.

Goldberg, D. E. (1989) Genetic Algorithms in Search, Optimization and Machine Learning. Reading, Massachusetts: Addison-Wesley Publishing Company.

Goldberg, D.E. and Smith, R.E. (1986, October) AI meets OR: Blind inferential search with genetic algorithms. Paper presented at the ORSA/TIMS Joint National Meeting, Miami, FL.

Goldfarb, D. (1985) Efficient simplex algorithms for the assignment problem. Mathematical Programming, 33, 187-203.

Goldstein, J.R. and Watchter, K.W. (1999) A statistical matching method for constructing kinship censuses with an application to U.S. kinship forecasts. In Population Association of America Annual Meeting, New York, NY.

Goldstein, M. (1972) k-nearest neighbor classification. IEEE Transactions on Information Theory, IT-18, 627-630.

Goldstein, M. (1975) Comparison of some density estimate classification procedures. Journal of the American Statistical Association, 70, 666-669.

Goldstein, M. (1976) An approximate test for comparative discriminatory power. Multivariate Behavioral Research, 11, 157-163.

Goldstein, M. (1977) A two-group classification procedure for multivariate dichotomous responses. Multivariate Behavioral Research, 12, 335-346.

Goldstein, M. and Dillon, W.R. (1977) A stepwise discrete variable selection procedure. Communications in Statistics: Theory and Methods, 6, 1423-1436.

Goldstein, M. and Dillon, W.R. (1978) Discrete Discriminant Analysis. New York: Wiley Interscience.

Goldstein, M. and Rabinowitz, T. (1975) Selection of variates for the two group classification problem. Journal of the American Statistical Association, 70, 776-781.

Goldstein, M. and E. Wolf (1977) On the problem of bias in multinomial classification. Biometrics, 33, 325-331.

Good, I.J. (1965) The Estimation of Probabilities. Cambridge, MA: MIT Press.

Good, P. (2000) Permutation Tests. New York: Springer Verlag New York.

Goodman, L. (1953) Ecological regression and the behavior of individuals. American Sociological Review, 18, 663-664.

Goodman, L. (1959) Some alternatives to ecological correlation. American Sociological Review, 64, 610-625.

Goodman, M.M. (1972) Distance analysis in biology. Systematic Zoology, 21(2), 174-186.

Gottlieb, J. and Paulmann, L. (1998) Genetic Algorithms for the Fixed Charge Transportation Problem. Proceedings of the 5th IEEE International Conference on Evolutionary Computation, Anchorage, Alaska, 330 – 335. Washington, DC: IEEE.

27

Page 28: THE DATA FUSION BIBLIOGRAPHY

Gowda, K. and Krishna, G. (1979) The condensed nearest neighbor rule using the concept of mutual nearest neighborhood. IEEE Transactions on Information Theory, 25, 488-490.

Gower, J.C. (1974) Maximal predictive classification. Biometrics, 30, 643-654.

Graham, J.W. and Schafer, J.L. (1999) On the performance of multiple imputation for multivariate data with small sample size. In Hoyle, R. (ed.), Statistical Strategies for Small Sample Research. Sage: Thousand Oaks, CA.

Green, A. (1986) Media research: what the agencies want. Admap, 253, 316-319.

Green, A. (2001) Multi-media optimisers: the new couture? Admap, March 2001.

Green, A. (2001) Multi-media optimizers: the new couture? 10th Worldwide Readership Research Symposium, Venice, 583-587.

Green, A. (2002) Multimedia optimizers: another fad or a new way of working? Staying Tuned 2002. February 11, Toronto, Canada.

Greenlees, J.S., Reece, W.S. and Zieschang, K.D. (1982) Imputation of missing values when the probability of response depends on the variable being imputed. Journal of the American Statistical Association, 77, 251-261.

Grefensette, J.J., Gopal, R., Rosmaita, B.J. and Van Gucht, D. (1985) Genetic algorithms for the traveling salesman problem. Proceedings of an International Conference on Genetic Algorithms and Their Applications, 160-168.

Griner, P.F., Mayewski, R.J., Mushlin, A.I. and Greenland, P. (1981) Selection and interpretation of diagnostic tests and procedures. Annals of Internal Medicine, 94, 555-600.

Grinold, R.C. (1969) Comment on ‘Pattern classification design by linear programming.’ IEEE Transactions on Computers, 18, 378-379.

Groves, R.M., Dillman, D.A., Eltinge, J.L. and Little, R.J.A. (2001) Survey nonresponse. New York: John Wiley & Sons Inc.

Gu, X.S. and Rosenbaum, P.R. (1993) Comparison of multivariate matching methods: structures, distances and algorithms. Journal of Computational and Grahpical Statistics, 2, 405-420.

Gugel, C. and Idel, C. (2002) The ROI ratio: assessing individual media impact on cross media advertising campaigns. ESOMAR/ARF Week of Audience Measurement, Cannes, France.

Gullen, P. and Johnson, H. (1986) Product purchasing and TV viewing: measuring and relating the two. In New Developments in Media Research, ESOMAR, Amsterdam, 345-363.

Guyon, I., Makhoul, J., Schwartz, R. and Vapnik, V. (1998) What size test set gives good error rate estimates? IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI-20, 52-64.

Habbema, J.D.F., Hermans, J. and van der Burgt, A.T. (1974) Cases of doubt in allocation problems. Biometrika, 61, 313-324.

Hahn, J. (1998) On the role of the propensity score in efficient semiparametric estimation of average treatment effects. Econometrica, 66, 315-331.

28

Page 29: THE DATA FUSION BIBLIOGRAPHY

Hall, P. (1981) On nonparametric multivariate binary discrimintation. Biometrika, 68, 287-294.

Hall, P. and Wand, M.P. (1988) On nonparametric discrimination using density differences. Biometrika, 75, 541-547.

Han, J. and Kamber, M. (2001) Data Mining: Concepts and Techniques. San Francisco, CA: Morgan Kaufmann Publishers.

Hand, D.J. (1981) Discrimination and Classification. Chicester: John Wiley & Sons.

Hand, D.J. (1981) Branch and bound in statistical data analysis. Statistician, 30, 1-13.

Hand, D.J. (1982) Kernel discriminant analysis. Chichester: Research Studies Press.

Hand, D.J. (1983) A comparison of two methods of discriminant analysis applied to binary data. Biometrics, 39, 683-694.

Hand, D.J. (1986) Recent advances in error rate estimation. Pattern Recognition Letters, 4, 335-346.

Hand, D.J. (1987) A shrunken leaving-one-out estimator of error rate. Computers and Mathematics with Applications, 14, 161-167.

Hand, D.J. (1994) Assessing classification rules. Journal of Applied Statistics, 21, 3-16.

Hand, D.J. (1995) Comparing allocation rules. Statistics in Transition, 2, 137-150.

Hand, D.J. (1997) Construction and Assessment of Classification Rules. New York: John Wiley & Sons.

Hand, D.J. (1998) Data mining: statistics and more? American Statistician, 52,112-118.

Hand, D.J. and Batchelor, B.G. (1978) An edited condensed nearest neighbor rule. Information Sciences, 14, 171-180.

Hand, D.J., Blunt, G., Kelly, M.G. and Adams, N.M. (2000) Data mining for fun and profit. Statistical Science, 15, 111-131.

Hand, D.J. and Henley, W. E. (1994) Inference about rejected cases in discriminant analysis. In New Approaches in Classification and Data Analysis (edited by E. Diday et al.). Berlin: Springer-Verlag.

Hand, D.J. and Henley, W.E. (1996) Statistical classification methods in consumer credit scoring: a review. Journal of the Royal Statistical Society, A160, 522-41. 

Hand, D.J., Mannila, H. and Smyth, P. (1999) Principles of Data Mining. Cambridge, MA: MIT Press.

Hanley, J.A. and McNeil, B.J. (1982) The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology, 143, 29-36.

Hanley, J.A. and McNeil, B.J. (1983) A method of comparing the areas under receiver operating characteristic curves derived from the same cases. Radiology, 148, 839-843.

29

Page 30: THE DATA FUSION BIBLIOGRAPHY

Hannan, E.L. (1985) An assessment of some of the criticisms of goal programming. Computers and Operations Research, 12, 525-541.

Hansen, L. and Andersen, H.T. (1992) The Danish Multi-Media Index – a solution for multi-media planning. In ESOMAR Seminar on Media Research Meets the Future: How New Challenges Promote New Media Uses and Media Research, Lisbon, Portugal.

Hansen, M.H. and Hurwitz, W.N. (1946) The problem of nonresponse in surveys. Journal of the American Statistical Association, 41, 517-529.

Hansen, M.H., Hurwitz, W.N. and Madow, W.G. (1953) Sample Survey Methods and Theory, Volumes 1 and 2. New York: Wiley.

Harris, P. and Baker, K. (1998) Data fusion. Admap, June 1998.

Hart, P.E. (1968) The condensed nearest neighbor rule. IEEE Transactions on Information Theory, IT-14, 515-516.

Hartigan, J. (1975) Clustering Algorithms. New York: Wiley.

Harville, D. and Moore, R. (1999) Determining record linkage parameters using an iterative logistic regression approach. Proceedings of the Section on Survey Research Methods, American Statistical Association. Washington, DC: American Statistical Association.

Hasselbald, V. and Hedges, L.V. (1995) Meta-analysis of screening and diagnostic tests. Psychological Bulletin, 117, 167-178.

Hastie, T. (2001) The Elements of Statistical Learning: Data Mining, Inference and Prediction. New York: Springer-Verlag New York.

Hastie, T., Buja, A. & Tibshirani, R. (1995) Penalized discriminant analysis. Annals of Statistics, 23, 73-102.

Hastie, T. and Tibshirani, R. (1996) Discriminant analysis by Gaussian mixtures. Journal of the Royal Statistical Society, B58, 155-176.

Hastie, T. and Tibshirani, R. (1998) Classification by pairwise coupling. Annals of Statistics, 26, 451-471.

Hastie, T., Tibshirani, R. and Buja, A. (1994) Flexible discriminant analysis by optimal scoring. Journal of the American Statistical Association, 89, 1255-1270.

Heckman, J., Ichimura, H. and Todd, P. (1997) Matching as an econometric evaluation estimaor: evidence from evaluating a job training programme. Review of Economic Studies, 64, 605-654.

Heckman, J., Ichimura, H. and Todd, P. (1998) Matching as an econometric evaluation estimator. Review of Economic Studies, 65, 261-294.

Heitjan, D.F. (1997) Annotation: what can be done about missing data? Approaches to imputation. American Journal of Public Health, 87, 548-550.

Heitjan, D.F. and Basu, S. (1996) Distinguishing ‘missing at random’ and ‘missing completely at random.’ The American Statistician, 50, 207-213.

30

Page 31: THE DATA FUSION BIBLIOGRAPHY

Heitjan, D.F. and Little, R.J.A. (1991) Multiple imputation for the Fatal Accident Reporting system. Applied Statistics, 40, 13-29.

Heitjan, D.F. and Rubin, D.B. (1986) Inference from coarse data using multiple imputation. Computer Science and Statistics, Proceedings of the 18th Symposium on the Interface, 138-143.

Heijtjan, D.F. and Rubin, D.B. (1990) Inference from coarse data via multiple imputation with application to age heaping. Journal of the American Statistics Association, 85, 304-14.

Heitjan, D.F. and Rubin, D.B. (1991) Ignorability and coarse data. The Annals of Statistics, 19, 2244-2253.

Heller, I. (1964) On linear programs equivalent to the transportation problem. SIAM Review, 12, 31-42.

Hellman, M.E. (1970) The nearest neighbor classification rule with a reject option. IEEE Transactions on Systems Science and Cybernetics, SSC-6, 179-185.

Hendrickson, K. and Acott, R. (1994) Beyond fusion – using neural networks to improve media targeting. In From Door-to-Door to Satellite: Media Research for More Effective Planning, Athens, Greece.

Henery, R.J. (1997) Combining classification procedures. In Machine Learning and Statistics: The Interface (ed. G. Nakhaeizadeh and C.C. Taylor). New York: Wiley Interscience.

Henley, W.E. and Hand, D.J. (1996) A k-nearest neighbor classifier for assessing consumer credit risk. Statistician, 44, 77-95.

Hepenstall, N., Smith, G., Morgan, G., Levine, M., Morgan, G. and North, N. (2001) Single source – understanding the consumer in a multimedia world. The ARF Week of Workshops, November 2, 2001, Chicago, USA.

Henrichon, E. and Fu, K. (1969) A nonparametric partitioning procedure for pattern classification. IEEE Transactions on Computers, 18, 614-624.

Herzog, T.N. (1980) Multiple imputation of individual Social Security amounts. Part II. Proceedings of the Section on Survey Research Methods, American Statistical Association, 404-407. Washington, DC: American Statistical Association.

Herzog, T.N. and Lancaster, C. (1980) Multiple imputation of individual Social Security amounts. Part I. Proceedings of the Section on Survey Research Methods, American Statistical Association, 398-403. Washington, DC: American Statistical Association.

Herzog, T.N. and Rubin, D.B. (1983) Using multiple imputation to handle nonresponse in sample surveys. In Incomplete Data in Sample Surveys, Volume 2, Theory and Bibliographies (W.G. Madow, I. Olkin, and D.B. Rubin (eds.)). New York: Academic Press.

Hickman, B.L., Barr, R.S. and Turner, J.S. (1995) Statistical file merging: a new constrained network model and parallel solution approach. INFORMS Sessions, New Orleans, LA.

Hiden, J., Habbema, J.D.F. and Bjerregaard, B. (1978) The measurement of performance in probabilistic diagnosis II. Trustworthiness of the exact values of the diagnostic probabilities. Methods of Information in Medicine, 17, 227-237.

31

Page 32: THE DATA FUSION BIBLIOGRAPHY

Hiden, J., Habbema, J.D.F. and Bjerregaard, B. (178) The measurement of performance in probabilistic diagnosis III. Methods based on continuous functions of the diagnostic probabilities. Methods of Information in Medicine, 17, 238-246.

Hilgers, R.A. (1991) Distribution-free confidence bounds for ROC curves. Methods of Information in Medicine, 30, 96-101.

Hill, C.J. (1978) The application of a systematic method of automatic edit and imputation to the 1976 Canadian Census of Population and Housing. Survey Methodology, 4, 178-202.

Hillier, F.S. and Lieberman, G.J. (2000) Introduction to Operations Research. New York: McGraw Hill.

Hills, M. (1966) Allocation rules and their error rates. Journal of the Royal Statistical Society, B28, 1.

Hills, M. (1967) Discrimination and allocation with discrete data. Journal of the Royal Statistical Society, C16, 237-250.

Hills, R. (1961) An electrical analogue for solving transportation problem. International Journal of Production Research, 1(1), 56-62.

Hinkins, S., Oh, H.L. and Scheuren, F. (1997) Inverse sampling design algorithms. Survey Methodology, 23, 11-21.

Hirano, K., Imbens, G.W. and Ridder, G. (2000) Efficient estimation of average treatment effects using the estimated propensity score. Working Paper T0251, National Bureau of Economic Research (http://www.nber.org/).

Hitchcock, F.L. (1941) The distribution of a product from several sources to numerous localities. Journal of Mathematical Physics, 20, 224-230.

Hochbaum, D.S. (1995) Approximation Algorithms for NP-hard Problems. Boston: PWS Publishing.

Hoel, P.G. and Peterson, R.P. (1949) A solution to the problem of optimum classification. Annals of Mathematical Statistics, 20, 433-438.

Hoeting, J.A., Madigan, D., Raftery, A. and Volinsky, C.T. (1999) Bayesian model averaging: a tutorial. Statistical Science, 14, 382-417.

Hoffman, A.N. (1953) Cycling in the simplex algorithm. Report No. 2974, National Bureau of Standards: Gaithersburg, M.D.

Hoffman, A.J. and Markowitz, H.M. (1954) A note on the shortest path, assignment and transportation problems. Naval Research Logistics Quarterly, 1.

Holland, J. H. (1992) Adaptation in Natural and Artificial Systems. Cambridge, Massachusetts: MIT Press.

Hollenbeck, K. and Doyle, P. (1979) Distributional characteristics of a merged microdata file. Proceedings of the Survey Research Section, 206-208. Washington, DC: American Statistical Association.

Holte, R. (1993) Very simple classification rules perform well on most commonly used datasets. Machine Learning, 11, 63-90.

32

Page 33: THE DATA FUSION BIBLIOGRAPHY

Hood, D. (2002) Meet Mr. Fusion. Strategy Magazine, October 21, 2002.

Hopcroft, J.E. and Karp, R.M. (1973) A n5/2 algorithm for maximum matching in bipartite graphs. SIAM Journal of Computing, 2, 225-231.

Hora, S. and Wilcox, J. (1982) Estimation of error rates in several-population discriminant analysis. Journal of Marketing Research, 19, 57-61.

Horton, I.F., Russell, J.S. and Moore, A.W. (1968) Multivariate covariance and canonical analysis: a method for selecting the most effective discriminators in a multivariate situation. Biometrics, 24, 845-858.

Horton, N.J. and Lipsitz, S.R. (2001) Multiple imputation in practice: comparison of software packages for regression models with missing variables. The American Statistician, 55, 244-254.

Houle, C. Berthelot, J.-M., David, P., Wolfson, M.C., Mustard, C. and Roos, L. (1997) Matching census databases and Manitoba healt care files. In Record Linkage Techniques (W. Alvey and Jamerson, B. (eds)), Arlington, VA.

Howe, G.R. and Lindsay, J. (1981) A generalized iterative record linkage computer system for use in medical follow-up studies. Computers and Biomedical Research, 14, 327-340.

Hughes, G.F. (1968) On the mean accuracy of statistical pattern recognizers. IEEE Transactions on Information Theory, IT-14, 55-63.

Ignizio, N.P. (1976) Goal Programming and Extensions. Lexington, MA: Heath (Lexington Books).

Ignizio, J.P. (1978) A review of goal programming: a tool for multi-objective analysis. Journal of the Operational Research Society, 29, 1109-1119.

Ignizio, J.P. (1983) A note on computational methods in lexicographic linear goal programming. Journal of the Operational Research Society, 34, 539-542.

Ignizio, J.P. (1989) On the merits and demerits of intergaer goal programming. Journal of the Operational Research Society, 22, 338-346.

Imbens, G. W. (2000) The role of the propensity score in estimating dose-response functions. Biometrika, 87, 706-710.

Ingram, D., O’Hare, J., Scheuren, F. and Turek, J. (2000) Statistical matching: a new validation case study. Proceedings of the Survey Research Methods Section, American Statistical Association.

Ingram, D., Turek, J., Scheuren, F. and O’Hare, J. (2001) Statistical matching: continued validation. Proceedings of the Social Sciences Section, American Statistical Association.

Iri, M. (1960) A new method of solving transportation network problems. Journal of Operations Research Society of Japan, 3, 27-87.

Iserman, H. (1982) Linear lexicographic optimization. OR Spectrum, 3, 223-228.

33

Page 34: THE DATA FUSION BIBLIOGRAPHY

Ishibuchi, H., Nozaki, K., Yamamoto, N. and Tanaka, H. (1995) Selecting fuzzy if-then rules in classification problems using genetic algorithms. IEEE Transactions on Fuzzy Systems, 3(3), 260-270.

Ivie, G. and Peacock, J. (2001) Fusion: Standards, guidelines, and disclosure. ARF Week of Workshops, October 29-November 1, 2001, Chicago, USA.

Jabine, T.B. and Scheuren, F.J. (1986) Record linkages for statistical purposes: methodological issues. Journal of Official Statistics, 2, 255-277.

Jacobs, R., Jordan, M., Nowlan, S. and Hinton, G. (1991) Adaptive mixtures of local experts Neural Computation, 3, 79-87.

Jacobson, M. and Maiville, M. S. (2003) Target-specific data intergration for multi-media estimation: the pros and cons of an alternative to general methods. ARF Week of Workshops, September 24, New York City, USA.

Jain, A.K., Dubes, R.C. and Chen, C.-C. (1987) Bootstrap techniques for error estimates. IEEE Transactions on Pattern Analysis and Machine Intelligence, 9, 628-633.

Jain, S., Osherson, D., Royer, J.S. and Sharma, A. (1999) Systems That Learn: An Introduction to Learning Theory. Cambridge, MA: MIT Press.

James, I.R. (1975) Multivariate distributions which have beta conditional distributions. Journal of the American Statistical Association, 70, 681-684.

James, I.R. (1995) A note on the analysis of censored regression data by multiple imputation. Biometrics, 51, 358-362.

Jaro, M.A. (1989) Advances in record-linkage methodology as applied to matching the 1985 Census of Tampa, Florida. Journal of the American Statistical Association, 84, 414-420.

Jaro, M.A. (1995) Probabilistic linkage of large public health data files. Statistics in Medicine, 14, 491-498.

Jarvis, T. (1993) A vital and economic solution in a changing North American market. Proceedings of the San Francisco Readership Symposium, 465-479.

Jarvis, T. and Wood, L. (2001) Fusion --- can magazines be equal players in a fused database? 2001 Worldwide Readership Research Symposium, 123-127.

Jarvis, T. and Wood, L. (2001) Fusion - have your cake and eat it too! Use the full print and full TV respondents. ARF Week of Workshops, October 29-November 1, Chicago, USA.

Jarvis, T. and Wood, L. (2002) Fusion – Have your cake and eat it too!  Use the full print and full TV respondents. Staying Tuned 2002. February 11, Toronto, Canada. See also Canadian Advertising Research Foundation Newsletter #254, March 2002.

Jephcott, J. and Bock, T. (1998) The application and validation of data fusion. Journal of the Market Research Society, 40(3), 185-205.

Ji, C. and Ma, S. (1997) Combinations of weak classifiers. IEEE Transactions on Neural Networks, 8, 32-42.

Jiang, Q. and Zhang, W. (1993) An improved method for finding nearest neighbors. Pattern Recognition Letters, 14, 531-535.

34

Page 35: THE DATA FUSION BIBLIOGRAPHY

Joe, H. (1997) Multivariate Models and Dependence Concepts. London: Chapman and Hall.

Joe, H. and Liu, Y. (1996) A model for a multivariate binary response with covariates based on compatible conditionally specified logistic regressions. Statistics and Probability Letters, 31, 113-120.

Joffe, M.M. and Rosenbaum, P.R. (1999) Propensity scores. American Journal of Epidemiology, 150, 327-333.

John, S. (1960) On some classification problems: I. Sankhya, A22, 301-308.

John, S. (1960) On some classification problems: II. Sankhya, A22, 309-316. Amended Sankyha, A23, 308.

Johnson, B.W., Woodburn, R.L. and Cutwright, V.A. (1993) Using multiple imputation to estimate the effect of data entry errors. Proceedings of the Quality and Productivity Section, American Statistical Association, 65-70. Washington, DC: American Statistical Association.

Jones, J.P. (1995) Single-source research begins to fulfill its promise. Journal of Advertising Research, 35(3), 9-16.

Kadane, J.B. (1978) Some statistical problems in merging data files. In 1978 Compendium of Tax Research, Office of Tax Analysis, Department of the Treasury, 159-171. Washington, DC: U.S. Government Printing Office. Reprinted in Journal of Official Statistics (2001), 17, 423-433.

Kadane, J.B. (2001) Some statistical problems in merging data files. Journal of Official Statistics, 17, 423-433.

Kallio M, and Salo, S. (2000) An interior point method for solving systems of linear equations and inequalities. Operations Research Letters, 27, 101-107.

Kalton, G. (1983) Compensating For Missing Survey Responses. Ann Arbor, Michigan: Institute for Social Research.

Kalton, G. and Kasprzyk, D. (1982) Imputing for missing survey response. Proceedings of the Section on Survey Research Methods, American Statistical Association, 22-31. Washington, DC: American Statistical Association.

Kalton, G. and Kasprzyk, D. (1986) The treatment of missing survey data. Survey Methodology, 12, 1-16.

Kalton, G. and Kish, L. (1981) Two efficient random imputation procedures. Proceedings of the Survey Research Methods Section, American Statistical Association, 146-151. Washington, DC: American Statistical Association.

Kalton, G. and Kish, L. (1984) Some efficient random imputation methods. Communications in Statistics – Theory and Methods, 13(16), 1919-1939.

Kamakura, W.A. and Wedel, M. (1997) Statistical data fusion for cross-tabulation. Journal of Marketing Research, 34, 485-498.

Kamakura, W.A. and Wedel, M. (2000) Factor analysis and missing data. Journal of Marketing Research, 37, 490-498.

35

Page 36: THE DATA FUSION BIBLIOGRAPHY

Karmarkar, N. (1990) An interior-point approach for NP-complete problems. Contemporary Mathematics, 114, 297-308.

Karmarkar, N.K. (1984) A new polynomial time algorithm for linear programming. Combinatorica, 4, 373--395.

Kangar-Parsi, B. and Kanal, L.N. (1985) An improved branch and bound algorithm for computing k-nearest neighbours. Pattern Recognition Letters, 3, 7-12.

Kass, G.V. (1980) An exploratory technique for investigating large quantities of categorical data. Applied Statistics, 29, 119-127.

Kaufman, C.J. (1988) The application of logical imputation to household measurement. Journal of the Market Research Society, 30, 453-466.

Kaufman, L. and Rousseuuw, P.R. (1990) Finding Grups in Data: An Introduction to Cluster Analysis. John Wiley & Sons: New York.

Kelley, R.P. (1983) A preliminary study of the error structure of statistical matcing. Proceedings of the Survey Research Section, 206-208. Washington, DC: American Statistical Association.

Kelley, R.P. (1986) Robustness of Census Bureau’s record linkage system. Proceedings of the Section on Survey Research Methods, American Statistical Association, 620, 624. Washington, DC: American Statistical Association.

Kendall, M.G. (1966) Discrimination and classification. In Multivariate Analysis (P.R. Krishnaiah (ed.)), 165-185.. New York: Academic Press.

Kennedy, R.L., Lee, Y., van Roy, B., Reed, C.D. and Lippman, R.P. (1997) Solving Data Mining Problems Through Pattern Recognition. Upper Saddle River, NJ: Prentice-Hall.

Kennickell, A.B. (1991) Imputation of the 1989 Survey of Consumer Finances: Stochastic relaxation and multiple imputation. Proceedings of the Survey Research Methods Section, American Statistical Association, 1-10. Washington, DC: American Statistical Association.

Kennickell, A.B. and McManus, D.A. (1994) Multiple imputation of the 1983 and 1989 waves of the SCF. Proceedings of the Survey Research Methods Section, American Statistical Association, 523-528. Washington, DC: American Statistical Association.

Kennington, J. (1976) The fixed-charge transportation problem: a computational study with a branch-and-bound code. AIIE Transactions, 8(2), 241-247.

Kennington, J. and Unger, V. (1976) A new branch-and-bound algorithm for the fixed charge transportation problem. Management Science, 22(10), 1116-1126.

Kim, B. and Park, S. (1986) A fast k-nearest neighbor finding algorithm based on the ordered partition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 8, 761-766.

Kim, H. and Loh, W.-Y. (2001) Classification trees with unbiased multiway splits. Journal of American Statistical Association, 96, 589-604.

King, G. (1997) A solution to the ecological inference problem. Princeton University Press: Princeton, NJ.

Kish, L. (1965) Survey Sampling. New York: Wiley.

36

Page 37: THE DATA FUSION BIBLIOGRAPHY

Kite, J. (2001) Mixed-media campaigns: can we fix it? Yes we can! Admap, June 2001, 48-50.

Kittler, J. (1998) Combining classifiers: A theoretical framework. Pattern Analysis and Applications, 1, 18-27.

Kittler, J., Hatef, M. Duin, R.W. and Matas, J. (1998) On combining classifiers. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI-20, 226-239.

Klein, M. (1967) A primal method for minimal cost flows, with applications to the maximum flow problem: I. Journal of the Association for Computing Machinery, 24, 630-645.

Kleinbaum, D.G., Morgenstern, H., and Kupper, L.L. (1982) Selection bias in epidemiologic studies. American Journal of Epidemiology, 113, 452-463.

Klingman D. and R. Russell (1975) Solving constrained transportation problems. Operations Research, 23, 91-106.

Knoke, J. (1986) The robust estimation of classification error rates. Computers and Mathematics with Applications, 12A, 253-260.

Koltchinskii, V. and Panchenko, D. (2002) Empirical margin distributions and bounding the generalization error of combined classifiers. Annals of Statistics, 30, 1-50.

Kong, A., Liu, J. and Wong, W.H. (1994) Sequential imputation and Bayesian missing data problems. Journal of the American Statistical Association, 89, 278-288.

Kooiman, P. (1997) A statistical data base of the Dutch population: some methodological issues. Proceedings of the Symposium on Statistical Software ’97, Centraal Bureau voor de Statistiek, 93-115.

Kooperberg, C., Bose, S. and Stone, C.J. (1997) Polychotomous regression. Journal of the American Statistical Association, 92, 177-127.

Koopmans, T.C. and Reiter, S. (1961) A model of transportation. In T.C. Koopmans (ed). Activity Analysis of Production and Allocation. New York: John Wiley and Sons.

Kotiah, T.C.T. and Steinberg, D.I. (1978) On the possibility of cycling with the simplex method. OR, 26(2), 374-376.

Kotler, P. (1964) Towards an explicit model for media selection. Journal of Advertising Research, 4(2), 34-41.

Kovacevic, M.S. and Liu, T.P. (1994) Statistical matching of survey data files. Proceedings of the Survey Research Methods Section, American Statistical Association. Washington, DC: American Statistical Association.

Kovar, J.G. and Chen, E.J. (1994) Jackknife variance estimation of imputed survey data. Survey Methodology, 20, 45-52.

Kovar, J.G., Whitridge, P. and MacMillan, J. (1998) Generalized edit and imputation system for economic surveys at Statistics Canada. Proceedings of the Section on Survey Research Methods, American Statistical Association, 690-695. Washington, DC: American Statistical Association.

Koza, J. R. (1992) Genetic Programming: On the Programming of Computers by Means of Natural Selection. Cambridge, Massachusetts: MIT Press.

37

Page 38: THE DATA FUSION BIBLIOGRAPHY

Koza, J. R. (1994) Genetic Programming II: Automatic Discovery of Reusable Programs. Cambridge, Massachusetts: MIT Press.

Koza, J.R., Bennett, F.H., Andre, D. and Keane, M.A. (1999) Genetic Programming III: Darwinian Invention and Problem Solving. San Francisco, California: Morgan Kaufmann Publishers Inc.

Krishnaswami, P. and Nath, P. (1968) Bias in multinomial classification. Journal of the American Statistical Association, 63, 298-303.

Kroese, B., Renssen, R.H. and Trijssenaar, M. (2000) Weighting or imputation: constructing a consistent set of estimates based on data from different sources. Netherlands Official Statistics, 15, 23-31.

Krzanowski, W.J. (1975) Discrimination and classification using both binary and continuous variables. Journal of the American Statistical Association, 70, 782-790.

Krzanowski, W.J. (1976) Discrimination and classification using both binary and continuous variables. Journal of the American Statistical Association, 71, 845-848.

Krzanowski, W.J. (1977) The performance of Fisher’s linear discriminant function under non-optimal conditions. Technometrics, 19, 191-200.

Krzanowski, W.J. (1983) Distance between populations using mixed continuous and categorical variables. Biometrika, 70, 235-243.

Krzanowski, W.J. (1984) On the null distribution of the distance between two groups, using mixed continuous and categorical variables. Journal of Classification, 1, 243-253.

Krzanowski, W.J. (1987) A comparison between two distance-based discriminant principles. Journal of Classification, 4, 73-84.

Kuhn, H.W. (1955) The Hungarian method for the assignment problem. Naval Research Logistics Quarterly, 2, 83-97.

Kulkarni, S.R., Lugosi, G. and Venkatesh, S.S. (1998) Learning pattern classification --- a survey. IEEE Transactions on Information Theory, IT-44(6), 2178-2206.

Kwak, N.K. and Schniederjans, M.J. (1985) An alternative solution method for goal programming problems: the lexicographic goal programming case. Socio-economic Planning Sciences, 19, 101-107.

Laaksonen, S. (1999) How to find the best imputation technique? Tests with three methods. International Conference on Nonresponse, Portland, Oregon.

Lachenbruch, P.A. (1967) An almost unbiased method of obtaining confidence internals for the probability of misclassification in discriminant analysis. Biometrics, 23, 639-645.

Lachenbruch, P.A. (1968) On expected probabilities of misclassification in discriminant analysis, necessary sample size, and a relation with the multiple correlation coefficient. Biometrics, 24, 823-834.

Lachenbruch, P.A. (1975) Discriminant Analysis. New York: Hafner Press.

Lachenbruch, P.A. and Mickey, M.R. (1968) Estimation of error rates in discriminant analysis. Technometrics, 10, 1-11.

38

Page 39: THE DATA FUSION BIBLIOGRAPHY

Lachenbruch, P.A., Sneering, D. and Revo, L.T. (1974) Robustness of the linear and quadratic discriminant function to certain types of non-normality. Communications in Statistics, 1, 39-57.

Ladd, G.W. (1966) Linear probability functions and discriminant functions. Econometrica, 34, 873-885.

Lakshminarayan, K., Harp, S.A., Goldman, R. and Samad, T. (1996) Imputation of missing data using machine learning techniques. Proceedings of the Second International Conference on Knowledge Discovery and Data Mining. Portland, OR, 140-145.

Lam, K.F., Choo, E.U. and Wedley, W.C. (1993) Linear goal programming in estimation of classification probabilities. European Journal of Operational Research, 67, 101-110.

Landerman, L.R., Land, K.C. and Pieper, C.F. (1997) An empirical evaluation of the predictive mean method for imputing missing values. Sociological Methods and Research, 26, 3-33.

Lang, J. and Agresti, A. (1994) Simultaneously modelling joint and marginal distributions of multivariate categorical responses. Journal of American Statistical Association, 89, 625-632.

Langlet, E.R. (1990) Use of cluster analysis for collapsing imputation classes. Survey Methodology, 16, 137-143.

Langley, W., Kennington, J. and Shetty, M. (1974) Efficient computational devices for the capacitated transportation problem. Naval Research Logistics Quarterly, 21(4), 637-647.

Lanke, J. (1983) Hot deck imputation techniques that permit standard methods for assessing precision of estimates. Statistical Review, 21, 105-110.

Larsen, M.D. and Rubin, D.B. (2001) Iterative automated record linkage using mixture models. Journal of the American Statistical Association, 96, 32-41.

Lavori, P.W., Dawson, R. and Shera, D. (1995) A multiple imputation strategy for clinicial trials with truncation of patient data. Statistics in Medicine, 14, 1913-1925.

Lawler, E.L. (1976) Combinatorial Optimization: Networks and Matroids. New York: Holt, Rinehart & Winston.

Lazzeroni, L.C., Schenker, N. and Taylor, J.M.G. (1990) Robustness of multiple-imputation

techniques to model misspecification. Proceedings of the Survey Research Methods Section, American Statistical Association, 260-265. Washington, DC: American Statistical Association.

Lebart, L. and Lejeune, M. (1995) Assessment of Data Fusions and Injections. In Encuentro Internacional AIMC sobre Investigación de Medios, Madrid, 118.

Lee, A.M. and Burkart, A.J. (1960). Some optimization problems in advertising media planning. Operational Research Quarterly, 11, 113-122.

Lee. H., Rancourt, E. and Särnal, C.E. (1994) Experiments with variance estimation from survey data with imputed values. Journal of Official Statistics, 10, 231-243.

Lee. H., Rancourt, E. and Särnal, C.E. (1995) Variance estimation in the presence of imputed data for the generalized estimation system. Proceedings of the Section on Survey Research

39

Page 40: THE DATA FUSION BIBLIOGRAPHY

Methods, American Statistical Association, 384-389. Washington, DC: American Statistical Association.

Lee. H., Rancourt, E. and Särnal, C.E. (2002) Variance estimation from survey data under single imputation. In Survey Nonresponse (eds. Groves, R.M., Dillman, D.A., Eltinge, J.L. and Little, R.J.A.), 315-328. New York: Wiley.

Lejeune M. (1995), De l'usage des fusions de données dans les études de marché. Proceedings 50th Session of the International Statistical Institute, Tome LVI. pp.923-935. China: Beijing.

Lesaffre, E. and Albert, A. (1989) Partial separation in logistic discrimination. Journal of the Royal Statistical Society, B51, 109-116.

Lessler, J.T. and Kalsbeek, W.D. (1992) Nonsampling Errors in Surveys. New York: John Wiley & Sons.

Leventhal, B. (1997) An approach to fusing market research with database marketing. International Journal of Market Research, 39(4).

Li, K.C. (1991) Sliced inverse regression for dimension reduction. Journal of the American Statistical Association, 86, 316-327.

Li, K.H. (1988) Imputation using Markov Chains. Journal of Statistical Computation and Simulation, 30, 57-79.

Li, K.H., Meng, X.L., Raghunathan, T.E. and Rubin, D.B. (1991) Significance levels from repeated p-values with multiply-imputed data. Statistica Sinica, 1, 65-92.

Li, K.H., Raghunathan, T.E. and Rubin, D.B. (1991) Large sample significance levels from multiply-imputed data using moment-based statistics and an F reference distribution. Journal of the American Statistical Association, 86, 1065-1073.

Li, Y.P., Propert, K.J. and Rosenbaum, P.R. (2001) Balanced risk set matching. Journal of the American Statistical Association, 96, 870-882.

Liang, K.-Y., Zeger, S.L. and Qaqish, B. (1992) Multivariate regression analyses for categorical data. Journal of the Royal Statistical Society, B54, 3-24.

Lim, T.-S., Loh, W.-Y. and Shih, Y.-S. (2000) A comparison of prediction accuracy, complexity, and training time of thirty-three old and new classification algorithms. Machine Learning, 40, 203-228.

Lippmann, R. (1989) Pattern classification using neural networks. IEEE Communications Magazine, 11, 47-64.

Lin, I.-F. and Paik, M.-C. (2001) Matched case-control data analysis with selection bias. Biometrika, 57, 1106-1112.

Lisitz, S.R., Laird, N.M. and Harrington, D.P. (1991) Generalized estimating equations for correlated binary data: using the odds ratio as a measure of association. Biometrika, 78, 153-160.

Little, J.L.C. and Lodish, L.M. (1966). A media selection model and its optimization by dynamic programming. Industrial Management Review, 8, 15-23.

Little, J.L.C. and Lodish, L.M. (1969) A media planning calculus. Operations Research, 17, 1-34.

40

Page 41: THE DATA FUSION BIBLIOGRAPHY

Little, R.J.A. (1979) Maximum likelihood for multiple regression with missing values: a simulation study. Journal of the Royal Statistical Society B, 41, 1, 76-87.

Little, R.J. A. (1986) Survey nonresponse adjustments. International Statistical Review, 54, 139-157.

Little, R.J.A. (1988) Missing data adjustments in large surveys. Journal of Business and Economic Statistics, 6(2), 287-301 (with discussion).

Little, R.J.A. (1992) Regression with missing X’s: a review. Journal of the American Statistical Association, 87, 1227-1237.

Little, R.J.A. and Rubin, D.B. (1987) Statistical Analysis with Missing Data. New York: John Wiley.

Little, R.J.A. and Rubin, D.B. (1989) The analysis of social science data with missing values. Sociological Methods and Research, 18, 292-326.

Little, R.J.A. and Smith, P.J. (1983) Multivariate edit and imputation for economic data. Proceedings of the Section on Survey Research Methods, American Statistical Association. Washington, DC: American Statistical Association.

Liu, T.P. (1998) A categorical constraints guided matching algorithm. Proceedings of the Survey Methods Section, Statistical Society of Canada, 249-256.

Liu, T.P. and Kovacevic, M.S. (1996) Categorically constrained matching. Proceedings of the Survey Methods Section, Statistical Society of Canada, 123-133.

Liu, T.P. and Kovacevic, M.S. (1997) An empirical study on categorically constrained matching. Proceedings of the Survey Methods Section, Statistical Society of Canada, 167-178.

Liu, T.P. and Kovacevic, M.S. (1998) Categorical matching and constrained rematching of survey datafiles. Working Paper HSMD-98-008E, Statistics Canada.

Liu, W.Z. and White, A.P. (1995) A comparison of nearest neighborhood and tree-based methods of non-parametric discriminant analysis. Journal of Statistical Computing and Simulation, 53, 41-50.

Liu, J.S. and Wu, Y.N. (1999) Parameter expansion scheme for data augmentation. Journal of the American Statistical Association, 94, 1264-1274.

Liu, T.P. and Kovacevic, M.S. (1998) Categorical matching and constrained rematching of survey datafiles. Working Paper HSMD-98-008E, Statistics Canada.

Lloyd, C. J. (1998) Using smoothed receiver operating characteristic curves to summarize and compare diagnostic systems. Journal of the American Statistical Association, 93, 1356-1364.

Loh, W.-Y. and Shih, Y.-S. (1997) Split selection methods for classfication trees. Statistica Sinica, 7, 815-840.

Loh, W.-Y. and Vanichesetakul, N. (1988) Tree-structured classification via generalized discriminant analysis. Journal of the American Statistical Association, 83, 715-725.

Loizou, G. and Maybank, S. J. (1987) The nearest neighbor and the Bayes error rate. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI-9, 254-262.

41

Page 42: THE DATA FUSION BIBLIOGRAPHY

Longford, N.T., Ely, M. and Wadsworth, M.E.J. (2000) Handling missing data in diaries of alcohol consumption. Journal of the Roay Statistical Society: Series A, 163, 381-402.

Lourie, J. (1964) Topology and computation of generalized transportation problem. Management Science, 11, 177-187.

Lovász, L. and Plummer, M.D. (1986) Matching Theory. Amsterdam: North-Holland.

Lovell, M.C. (1983) Data mining. Review of Economics and Statistics, 65, 1-12.

Lovie, A.D. and Lovie, P. (1986) The flat maximum effect and linear scoring models for prediction. Journal of Forecasting, 5, 159-168.

Lu, B., Zanutto, E., Hornik, R. and Rosenbaum, P.R. (2001) Matching with doses in an observational study of a media campaign against drug abuse. Journal of the American Statistical Association, 96, 1245-1253.

Lubin, A. (1950) Linear and non-linear discriminating functions. British Journal of Psychology, 3, 90-104.

Lubischew, A.A. (1962) On the use of discriminant functions in taxonomy. Biometrics, 18, 455-477.

Luttmann, S.M. (1990) Enriching tax research through database merging. Journal of the American Tax Association, 11(2).

Mack, Y. (1981) Local properties of k-nearest neighbor regression estimates. SIAM Journal on Algebraic and Discrete Methods, 2, 311-323.

MacLeod, J.E.S., Luk, A. and Titterington, D.M. (1987) A re-examination of the distance-weighted k-nearest neighbor classification rule. IEEE Transactions on Systems, Man and Cybernetics, SMC-17, 689-696.

MacNaughton-Smith, P. (1963) The classification of individuals by the possession of attributes associated with a criterion. Biometrics, 17, 364-366.

Madow, W.G. and Olkin, I. (1983) Incomplete Data in Sample Surveys, Volume 1, Report and Case Studies. New York: Academic Press.

Madow, W.G. and Olkin, I. (1983) Incomplete Data in Sample Surveys, Volume 2, Proceedings of the Symposium. New York: Academic Press.

Madow, W.G, Olkin, I. And Rubin, D.B. (1983) Incomplete Data in Sample Surveys, Volume 3, Theory and Bibliographies. New York: Academic Press.

Magnanti, T. and Orlin, J. (1988) Parametric linear programming and anti-cycling pivoting rules. Mathematical Programming, 41, 317-325.

Mahalanobis, P.C. (1936) On the generalized distance in statistics. Proceedings of the National Institute of Science of India, 2(1), 49-55.

Maiville, M. (2002) So much data, so little time:  fast and simple alternatives to fusion for multimedia and cross-platform analysis. Staying Tuned 2002. February 11, Toronto, Canada.

42

Page 43: THE DATA FUSION BIBLIOGRAPHY

Mallett, D. (1997) Ascription: there’s still no such thing as a free lunch. Proceedings of the Magazine Audience Measurement Research. New York: Advertising Research Foundation.

Mandese, J. (1994) Death knell for demos? Buyers set to move on. Advertising Age, July 15.

Mandler, E. and Schurman, J. (1988) Combining the classification results of independent classifiers based on the Dempster-Schafer theory of evidence. In Pattern Recognition and Artificial Intelligence (Gelsema, E. and Kanal, L. (eds)), 381-393. New York: Elsevier.

Mangarasian, O.L., Setiono, R. and Wolberg, W.H. (1990) Pattern-recognition via linear-programming: theory and application to medical diagnosis. In Large-Scale Numerical Optimization 1990 (T.F. Coleman and Y. Li (eds.)), 22-31. Philadelphia: SIAM.

Mani, G. (1991) Lowering variance of decisions by using artificial neural network portfolios. Neural Computation, 3, 484-486.

Mantel, H. Singh, A., Kimack, M. and Rowe, G. (1991) Statistical matching: use of auxiliary information to avoid the conditional independent assumption. Proceedings of the Bureau of the Census Annual Research Conference, 688-711.

Marbett, A., Stone, M. and Washbrook, J. (1980) Cross-validatory selection of binary variables in differential diagnosis. Applied Statistics, 29, 198-204.

Marker, D. A., Judkins, D.R. and Winglee, M. (2001) Large-scale imputation for complex surveys. Groves, R.M., Dillman, D.A., Eltinge, J.L. and Little, R.J.A. (2001) Survey nonresponse. p.329-342. New York: John Wiley & Sons Inc.

Marks, S. and Dunn, O.J. (1974) Discriminant functions when covariance matrices are unequal. Journal of the American Statistical Association, 69, 555-559.

Martini, A. and Trivellato, U. (1997) The role of survey data in microsimulation models for social policy analysis. Labour, 11, 83-112.

Mason, L., Barlett, P. and Baxter, J. (1999) Improved generalization through explicit optimization of margins. Machine Learning, 38, 243-255.

Massey, W.F. (1965) Discriminant analysis of audience characteristics. Journal of Advertising Research, 5, 39-48.

Matushita, K. (1956) Decision rule, based on distance, for the classification problem. Annals of the Institute of Statistical Mathematics, 8, 67-77.

Matushita, K. (1957) Classification based on distance in multivariate Gaussian cases. Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, 1, 299-304. Berkeley, California: University of California Press.

Maville, M.S. (2001) So much data, so little time: fast and simple alternatives to fusion for multimedia and cross-platform analysis. The ARF Week of Workshops, November 1, 2001, Chicago USA.

Maxwell. A.E. (1961) Canonical variate analysis when the variables are dichotomous. Educational and Psychological Measurement, 21, 259-271.

Maghout, K., Comes, G. and Steinberg, N. (1962) Généralisation à n dimensions du problème de transport. Revue Française de Recherche Opérationelle, 6(22), 21-34.

43

Page 44: THE DATA FUSION BIBLIOGRAPHY

Mangasarian, O.L. (1965) Linear and nonlinear separation of patterns by linear programming. Operations Research, 13, 444-452.

McIntire, P. (2001) Marketers get your measure. The Australian, 30 October 2001.

McConochie, R.M. and Uyenco, B. (2001) Cross-media contributions of Arbitron’s portable people meters (PPM). The ARF Week of Workshops, November 1, Chicago, USA.

McConochie, R.M. and Uyenco, B. (2002) Trash and treasures of duplication: Cross media synergy prospects from Arbitron’s portable people meters. ESOMAR/ARF Week of Audience Measurement, Cannes, France.

McConochie, R.M. and Uyenco, B. (2002) Trash and treasures of duplication II: Improving Media plans. Proceedings of the ARF WOW, New York City, USA.

McConochie, R.M. and Uyenco, B. (2003) Real cross media intelligence for real cross media planning. The PPM contribution. ARF/ESOMAR Week of Audience Measurement, Los Angeles, California, USA, June 18, 2003.

McCue, K. F. (2001) The statistical foundations of EI method. American Statistician, 55, 106-110.

McDonald, S. C. (2001) Using the 24-hour day as the common denominator for cross-media comparisons: the case of multi-media mentor. 10th Worldwide Readership Research Symposium, Venice, 519-523.

McKay, R. J. and Campbell, N.A. (1982) Variable selection techniques in discriminant analysis. I: Description. British Journal of Mathematical and Statistical Psychology, 35, 1-29.

McKay, R.J. and Campbell, N.A. (1982) Variable selection techniques in discriminant analysis. II: Allocation. British Journal of Mathematical and Statistical Psychology, 35, 30-41.

McLachlan, G.J. (1974) Estimation of the errors of misclassification on the criterion of asymptotic mean square error. Technometrics, 16, 255-256.

McLachlan, G.J. (1976) The bias of the apparent error rate in discriminant analysis. Biometrika, 63, 239-244.

McLachlan, G.J. (1977) Estimating the linear discriminant function from initial samples containing a small number of unclassified observations. Journal of the American Statistical Association, 72, 403-406.

McLachlan, G.J. (1977) A note on the choice of a weighting function to give an efficient method for estimating the probability of misclassification. Pattern Recognition, 9, 147-149.

McLachlan, G.J. (1976) The bias of the apparent error rate in discriminant analysis. Biometrika, 63, 239-244.

McLachlan, G.J. (1992) Error rate estimation in discriminant analysis: recent advances. In Advances in Multivariate Statistical Analysis (Gupta, A.K. (ed.)), 233-252. Dordrecht: D. Reidel.

McLachlan, G.J. (1992) Discriminant Analysis and Statistical Pattern Recognition. New York: John Wiley.

44

Page 45: THE DATA FUSION BIBLIOGRAPHY

McNeil, B.J. and Adelstein, S.J. (1976) Determining the value of diagnostic and screening tests. Journal of Nuclear Medicine, 17, 439-448.

Mehta, H. and Gonsalves, B. (2001) Pressing the hot button for press. 10th Worldwide Readership Research Symposium, Venice, 543-553,

Meimand, M. (1995) Statistical matching – a first experience. New Zealand Statistics Review, 1995 (March), 9-19.

Melton, E. and Appel, V. (2003) Imputation of missing data: testing a model-based approach. Worldwide Readership Research Symposium, Cambridge, MA, USA.

Melton, R.S. (1963) Some remarks of failure to meet assumptions in discriminant analysis. Psychometrika, 28, 49-53.

Meng, X.-L. (1994) Multiple imputation inferences with uncongenial sources of input (with discussion). Statistical Science, 9, 538-573.

Meng, X.-L. (2000) Missing data: dial M for ??? Journal of the American Statistical Association, 95, 1325-1330.

Meng, X.L. (2001) A congenial overview and investigation of multiple imputation inferences under congeniality. Groves, R.M., Dillman, D.A., Eltinge, J.L. and Little, R.J.A. (2001) Survey nonresponse. p. 343-356. New York: John Wiley & Sons Inc.

Meng, X.L. and Rubin, D.B. (1992) Performing likelihood ratio tests with multiply-imputed data sets. Biometrika, 79, 103-111.

Meng, X., and D. B. Rubin (1993) Maximum likelihood via the ECM algorithm: a general framework. Biometrika, 80, 267-278.

Meng, X.-L. and van Dyk, D.A. (1999) Seeking efficient data augmentation schemes via conditional and marginal augmentation. Biometrika, 86, 301-320.

Merz, J. (1991) Micro-simulation – a survey of principles, developments and applications. International Journal of Forecasting, 7, 77-104.

Messham, N. (1993) Sharper targets: more effective schedules? How TGI’s – coupling TGI to BARB – can sometimes give planners a significant competitive edge. Admap, 332, 26-29.

Metz, C.E. (1986) Methodology in radiologic imaging. Investigative Radiology, 21, 720-733.

Michalewicz, Z. (1992) Genetic Algorithms + Data Structures = Evolution Programs. Berlin, Germany: Springer-Verlag.

Michalewicz, Z. and Fogel, D.B. (1999) How to Solve It: Modern Heuristics. New York: Springer-Verlag New York Inc.

Michalewicz, Z. and Janikow, C.Z. (1991) Handling constraints in genetic algorithms. In R. K. Belew and L. B. Booker (eds.), Proceedings of the 4th International Conference on Genetic Algorithms. San Francisco, California: Morgan Kaufmann., 151-157.

Michalewicz, Z. and Schoenauer, M. (1996) Evolutionary algorithms for constrained parameter optimization problems. Evolutionary Computation, 4(1):1-32.

45

Page 46: THE DATA FUSION BIBLIOGRAPHY

Michalewicz, Z., Vignaux, G.A. and Hobbs, M. (1991) A non-standard genetic algorithm for the nonlinear transportation problem. ORSA Journal on Computing, 3(4), 307-316.

Michie, D., Spiegelhalter, D.J. and Taylor, C.C. (eds.) (1994) Machine Learning, Neural and Statistical Classification. Chichester, UK: Ellis Horwood.

Miller, R.G. (1962) Statistical prediction by discriminant analysis. Meterorological Monographs, 4(25). Boston: American Meteorological Society.

Ming, K. and Rosenbaum, P.R. (2000) Substantial gains in bias reduction from matching with a variable number of controls. Biometrics, 56, 118-124.

Ming, K. and Rosenbaum, P.R. (2001) A note on optimal matching with variable controls using the assignment algorithm. Journal of Computational and Graphical Statistics, 10, 455-463.

Mislevy, R.J. (1993) Should "multiple imputations" be treated as "multiple indicators"? Psychometrika, 58, 79-85.

Mitchell, M. (1996) An Introduction to Genetic Algorithms. Cambridge, Massachusetts: MIT Press.

Mitchell, T.M. (1997) Machine Learning. New York: McGraw Hill.

Molenberghs, G. and Goetghebeur, E. (1997) Simple fitting algorithms for incomplete categorical data. Journal of the Royal Statistical Society, B 59, 401-414.

Molenberghs, G., Goetghebeur, E.J., Lipsitz, S.R. and Kenward, M.G. (1999) Nonrandom missingness in categorical data: strengths and limitations. The American Statistician, 53, 110-118.

Molenberghs, G. and Ritter, L.L. (1996) Methods for analyzing multivariate binary data, with association between outcomes of interest. Biometrics, 52, 1121-1133.

Montaquila, J.M. and Jernigan, R.W. (1997) Variance estimation in the presence of imputed data. Proceedings of the Section on Survey Research Methods, American Statistical Association, 273-277. Washington, DC: American Statistical Association.

Montaquila, J.M. and Ponikowski, C.H. (1993) Comparison of methods for imputing missing responses in establishment surveys. Proceedings of the Section on Survey Research Methods, American Statistical Association, 446-451. Washington, DC: American Statistical Association.

Moore, D.H. (1973) Evaluation of five discrimination procedures for binary variables. Journal of the American Statistical Association, 68, 339-404.

Moran, M.A. and Murphy, B.J. (1979) A closer look at two alternative methods of statistical discrimination. Applied Statistics, 28, 223-232.

Moriarty, C. and Scheuren, F. (2001) Statistical matching: pitfalls of current procedures. Social Sciences Section, Joint Statistical Meetings, Atlanta, GA.

Moriarity, C. and Scheuren, F. (2001) Statistical matching: a paradigm for assessing the uncertainty in the procedure. Journal of Official Statistics, 17, 407-422.

46

Page 47: THE DATA FUSION BIBLIOGRAPHY

Moriarity, C. and Scheuren, F. (2003) A note on Rubin’s statistical matching using file concatenation with adjusted weights and multiple imputations. Journal of Business and Economic Statistics, 21, 65-73.

Morin, R.L. and Raeside, D.E. (1981) A reappraisal of distance-weighted k-nearest neighor classification for pattern recognition with missing data. IEEE Transactions on Systems, Man and Cybernetics, SMC-11, 241-243.

Morrison, D.G. (1969) On the interpretation of discriminant analysis. Journal of Marketing Research, 6, 156-163.

Morrison, D.R. (1969) Matching algorithms. Journal of Combinatorial Theory, 6, 20-32.

Munkres, J. (1957) Algorithms for the assignment and transportation problems. Journal of the Society for Industrial and Applied Mathematics, 5, 32-38.

Murthy, C.A., Bandyopadhyay, S. and Pal, S.K. (1996) Genetic algorithm-based pattern classification: relationship with Bayes classifier. In Genetic Algorithms for Pattern Recognition. (S.K. Pal and P.P. Wang (eds.)). Boca Raton, FL: CRC Press.

Myles, J.P. and Hand, D.J. (1990) The multi-class metric problem in nearest neighbor discrimination rules. Pattern Recognition, 23, 1291-1297.

Nakhaeizadeh, G. and Taylor, C.C. (eds.) (1997) Machine Learning and Statistics: The Interface. New York: Wiley-Interscience.

Napior, D. (1996) Between the covers: statistical fusion of magazine audience and print ad exposure databases. In Readership Measurement At Crossroads: Same Methods, Different Answers: Why? ARF Print Readership Research Workshop, 275-280, New York, NY.

Napior, D. and Ivins, B. (2005) Calibrated fusion evaluation: the MRI-comScore Case. Worldwide Readership Research Symposium, Prague 2005.

Nash, S.G. and Sofer, A. (1996) Linear and Nonlinear Programming. New York: McGraw-Hill.

Nelsen, R.B. (1999) An Introduction to Copulas. New York: Springer-Verlag New York.

Neter, J., Maynes, E.S. and Ramanathan, R. ( 1965) The effect of mismatching on the measurement of response errors. Journal of the American Statistical Association, 60, 1005-1027.

Newcomb, H.B. (1988) Handbook of Record Linkage: methods for health and statistical studies, administration and business. Oxford: Oxford Medical Publications.

Newcomb, H.B., Fair, M.E. and Lalonde, P. (1992) The use of names for linking personal records (with discussion). Journal of the American Statistical Association, 87, 1193-1208.

Newcombe, H. B., and Kennedy, J. M. (I 962) Record linkage, making maximum use of the discriminating power of Identifying information. Communication of the Association for Computing Machinery, 5(11), 563-566.

Newcombe, H. B., Kennedy, J.M., Axford, S.J. and James, A.P. (1959) Automatic linkage of vital records. Science, 130, 954-959.

47

Page 48: THE DATA FUSION BIBLIOGRAPHY

Nicklas, R. (2002) Questionnaire length and respondent burden in media practice. IMPUTE: Symposium on Nonresponse, Questionnaire Split and Multiple Imputation, Nuremberg, Germany, September 25.

Nicklas, R. and Sudholt, T. (1999) Movements in the German TV-advertising market -- development and first results of a new fusion approach: The MOVE-project. Advertising Research Foundation Week of Workshops, New York City, October 7, 1999.

Nielsen, S.F. (2001) Nonparametric conditional mean imputation. Journal of Statistical Planning and Inference, 99, 129-150.

Nordbotten, S. (1995) Editing statistical records by by neural networks. Journal of Official Statistics, 11, 391-411.

Nordbotten, S. (1996) Neural network imputation applied to the Norwegian 1990 Population Census data. Journal of Official Statistics, 12, 385-401.

Nordholt, E.S. (1998) Imputation: Methods, simulation experiments, and practical examples. International Statistical Review, 66 (2), 157-180.

O’Brien, S. (1991) The role of data fusion in actionable media targeting in the 1990’s. Marketing and Research Today, 19 (February), 15-22.

O’Brien, S. and Wilcox, S. (1988) The marriage of BARB and G-Track. Admap, 276, 11-14.

O’Hare, J.F. (1997) Impute or match? Strategies for microsimulation modeling. In International Conference on Combinatorics, Information Theory and Statistics, University of Southern Maine, Portland, ME.

Okauchi, S. (2002) The Japanese fusion experience (validation of data fusion by means of ACR data splitting). ARF Week of Workships. October 10, 2002, New York City.

Okner, B. A. (1972) Constructing a new database from existing microdata sets: The 1966 merge files. Annals of Economic and Social Measurement, 1, 325-342.

Okner, B. (1974) Data matching and merging: an overview. Annals of Economic and Social Measurement, 3, 347-352.

Olkin, I. And Tate, R. F. (1961) Multivariate correlation models with mixed discrete and continuous variables. Annals of Mathematical Statistics, 32, 448-465.

Olson, D.A. (1984) A comparison of four goal programming algorithms. Journal of the Operational Research Society, 33, 247-251.

Opitz, D. and Maclin, R. (1999) Popular ensemble methods: an empirical study. Journal of Artificial Intelligence Research, 11, 169-198.

Orcutt G H (1986) Views on microanalytic simulation modeling. In Microanalytic simulation models to support social and financial models (Orcutt, G.H., Merz, J. and Quinke, H. (eds.)), 9-41. Amsterdam: Elsevier.

Orlin, J.B. (1993) A faster strongly polynomial minimum cost flow algorithm. Operations Research, 41, 338-350.

Orr, G.B. and Müeller, K.-R. (1998) Neural Networks: Tricks of the Trade. Berlin: Springer-Verlag.

48

Page 49: THE DATA FUSION BIBLIOGRAPHY

Overstreet, G.A., Bradley, E.L. Jr. and Kemp, R.S. Jr. (1992) The flat maximum effect and generic linear scoring models: A test. IMA Journal of Mathematical Applications in Business and Industry, 4, 97-109.

Owen, A. (1984) A neighbourhood-based LANDSAT classifier. Canadian Journal of Statistics, 12, 191-200.

Owens, D. (1999) Why independent mixed-media planning? Admap, 34(1), January 1999, 20-21.

Paass, G. (1985) Statistical record linkage methodology: state of the art and future prospects. In The 100th Session of the International Statistical Institute, Amsterdam, International Statistical Institute.

Paass, G. (1986) Statistical match: Evaluation of existing procedures and improvements by using additional information. In Microanalytic Simulation Models to Support Social and Financial Policy (Orcutt, G.H., Merz, J. and Quinke, H. (eds.)). Amsterdam: Elsevier Science.

Paass, G. (1989) Stochastic generation of a synthetic sample from marginal information. Proceedings of the Fifth Annual Research Conference, U.S. Bureau of the Census, 431-445.

Paass, G. and Wauschkuhn, U. (1980) Experimentelle Erprobung und vergleichende Bewertung Statistischer Matchverfahren. Interner Report IPES.80.201, Gesellschaft für Mathematik und Datenverarbeitung, Bonn, Germany.

Paik, M.C. and Sacco, R.L. (2000) Matched case-control data analyses with missing covariates. Applied Statistics, 49, 145-156.

Pal, S.K. and Wang, P.P. (eds.) (1996) Genetic Algorithms for Pattern Recognition. Boca Raton, FL: CRC Press.

Papadimitriou, C.H. and Steiglitz, K. (1998) Combinatorial Optimization: Algorithms and Complexity. Mineola, NY: Dover Publications Inc.

Papazian, E (1980) Using product usage data in media selection. Marketing and Media Decisions, 15(7), 16-20.

Parsons, L.S. (2000) Using SAS software to perform a case-control match on propensity score in an observational study. Proceedings of the Twenth-Fifth Annual SAS Users Group International Conference. Cary, NC: SAS Institute, Inc. 1166-1171.

Parthasarthy, G. and Chatterji, B.N. (1990) A class of new KNN methods for low sample problems. IEEE Transactions on Systems, Man and Cybernetics, SMC-20, 715-718.

Parzen, E. (1962) On estimation of a probability density function and mode. Annals of Mathematical Statistics, 33, 1065-1076.

Patterson, D.W. and Mattson, R.L. (1966) A method of finding linear discriminant functions for a class of performance criteria. IEEE Transactions on Information Theory, IT-12, 380-387.

Patterson, M., O'Malley, L. and Evans, M.J., (1999) Direct marketing: data fusion and data control. Journal of Targeting Measurement and Analysis for Marketing, 7(3), 288-302.

Patrick, E.A. and Fischer, F.P. III (1970) A generalized k-nearest neighbor rule. Information and Control, 16(2), 128-152.

49

Page 50: THE DATA FUSION BIBLIOGRAPHY

Peck, R. and van Ness, J. (1982) The use of shrinkage estimators in linear discriminant analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 4, 530-537.

Peck, R., Jennings, L.W. and Young, D.M. (1988) A comparison of several biased estimators for improving the expected error rate of the sample quadratic discriminant function. Journal of Statistical Computing and Simulation, 29, 143-156.

Pepe, M.S. (1997) A regression modelling framework for receiving operating characteristic curves in medical diagnostic testing. Biometrika, 84, 595-608.

Pendrod, C. S. and Wagner, T.J. (1977) Another look at the edited nearest neighbor rule. IEEE Transactions on Systems, Man and Cybernetics, 7, 92-94.

Penrose, L. (1947) Some notes on discrimination. Annals of Eugenics, 13, 228-237.

Peterson, D.W. (1970) Some convergence properties of a nearest neighbor decision rule. IEEE Transactions on Information Theory, IT-16,26-31.

Pfefferman, D. (1993) The role of sampling weights when modeling survey data. International Statistical Review, 61, 317-357.

Phelps, S.P. (1987) Single source: proceed with caution. Journal of Advertising Research, 27(5), RC8-RC9.

Piper, J. (1992) Variability and bias in experimentally measured classifier error rates. Pattern Recognition Letters, 13, 685-692.

Platek, R. (1980) Causes of incomplete data, adjustments and effects. Survey Methodology, 6, 93-132.

Platek, R. and Gray, G.B. (1978) Nonresponse and imputation. Survey Methodology, 4, 144-177.

Prager, W.W. (1957) A generalization of Hitchcock's transportation problem. Journal of Mathematical Physics, 36, 99-106.

Pregibon, D. (1977) Typical survey data: estimation and estimation. Survey Methodology, 2, 70-102.

Prentice, R.L. and Pike, R. (1979) Logistic disease incidence models and case-control studies. Biometrika, 66, 403-411.

Puliyel, T., Pillai, S. and Sinha, A. (1991) Making datasets compatible for data fusion: results of an Indian experiment. In Fifth Worldwide Readership Research Symposium, Hong Kong.

Purdye, K. and R. Dick (1996) Product/media and the quest for the holy grail. In Worldwide Electronic and Broadcast Audience Research Symposium, San Francisco, ESOMAR/ARF, 405-426.

Pyle, D. (1999) Data Preparation for Data Mining. San Francisco, CA: Morgan Kaufmann Publishers Inc.

Qin, J. and Zhang, B. (2003) Using logistic regression procedures for estimating receiver operating characteristic curves. Biometrika, 90(3), 585-596.

50

Page 51: THE DATA FUSION BIBLIOGRAPHY

Qu, P. and Loh, W.-Y. (1992) Application of Box-Cox transformation to discrimination for the two-class problem. Communications in Statistics (Theory and Methods), 21, 2757-2774.

Quade, D. (1982) Nonparametric analysis of covariance by matching. Biometrics, 38, 587-611.

Quandt, R. E. and Kuhn, H.W. (1964) On upper bounds for the number of iterations in solving linear programs. Operations Research, 12, 161-165.

Quenouille, M.H. (1949) Note on the elimination of insignificant variates in discriminant analysis. Annals of Eugenics, 14, 305-308.

Quinlan, J.R. (1986) Induction of decision trees. Machine Learning, 1, 81-106.

Quinlan, J.R. (1996) Bagging, boosting and C4.5. In Proceedings Thirteenth American Association for Artificial Intelligence National Conference on Artificial Intelligence, 725-730. Menlo Park, CA: AAAI Press.

Radner, D.B. (1978) The development of statistical matchings in economics. Proceedings of the Social Statistics Section, American Statistical Association, 503-508.

Rässler, S. (2001) Alternative Approaches to Statistical Matching with an Application to Media Data (Alternative Verfahren der Datenfusion mit einer Anwendung auf Media-Daten),  Habilitationsschrift, Nürnberg.

Rässler, S. (2002) Split questionnaire survey sampling. IMPUTE: Symposium on Nonresponse, Questionnaire Split and Multiple Imputation, Nuremberg, Germany, September 25.

Rässler, S. (2002) Statistical matching : a frequentist theory, practical applications, and alternative Bayesian approaches. Springer-Verlag New York: New York.

Rässler, S. (2003) A non-iterative Bayesian approach to statistical matching. Staistica Neerlandica, 57(1), 58-74.

Rässler, S. (2004) Data fusion: identification problems, validity, and multiple imputation. Austrian Journal of Statistics, 33(1-2), 153-171.

Rässler, S., Fleischer, K. (1998). Aspects concerning data fusion techniques. ZUMA Nachrichten Spezial 4, Mannheim, 317-333.

Rässler, S. and Fleisher, K. (1999) An evaluation of data fusion techniques. Workshop and Symposium on Combining Data from Difference Sources. May 4-7, 1999, Ottawa, Canada.

Raghavendra B G (1989) An evaluation and optimisation model for media planning.  International Journal on Management and Systems, 5(l), 1-16. 

Raghavendra, B.G. (1993) Advertising media planning and the financial portfolio problem: some comparisons. Journal of Management, 22.

Raghunathan, T.E. and Grizzle, J.E. (1995) A split questionnaire survey design. Journal of the American Statistical Association, 90, 54-63.

Raghunathan, T.E. and Lepkowski, N. (1999) Multiple imputation for a large national survey sample. Proceedings of the Section on Survey Research Methods, American Statistical Association. Washington, DC: American Statistical Association.

51

Page 52: THE DATA FUSION BIBLIOGRAPHY

Raimondi, D. and Santini, G. (1997) Just-in-time data modelling. Proceedings of the Vancouver Worldwide Readership Symposium.

Raimondi, D. and Santini, G. (2001) Fusion quality assurance. 10th Worldwide Readership Research Symposium, Venice, 111-118.

Ramalingam, T. (1985) Statistical properties of the file-merging methodology. Doctor of Philosophy Thesis. Lafayette, Indiana: Purdue University.

Rancourt, E. (1999) Estimation with nearest neighbour imputation at Statistics Canada. Proceedings of the Section on Survey Research Methods, American Statistical Association. Washington, DC: American Statistical Association.

Rancourt, E., Sarndal, C. E. and Lee, H. (1994) Estimation of the variance in the presence of nearest neighbor imputation. Proceedings of the Section on Survey Research Methods, American Statistical Association, 888-893. Washington, DC: American Statistical Association.

Randles, R.H., Brofitt, J.D., Ramberg, J.S. and Hogg, R.V. (1978) Discriminant analysis based on ranks. Journal of the American Statistical Association, 73, 379-384.

Randles, R., Broffitt, J., Ramberg, J. and Hogg, R. (1978) Generalized linear and quadratic discriminant functions using robust estimates. Journal of the American Statistical Association, 73, 564-568.

Raghunathan, T.E. (2002) Controlled split survey in media practice. IMPUTE: Symposium on Nonresponse, Questionnaire Split and Multiple Imputation, Nuremberg, Germany, September 25.

Rao, B.L.S.P. (1983) Nonparametric functional estimation. New York: Academic Press.

Rao, C.R. (1946) Tests with discriminant functions in multivariate analysis. Sankhya, 7, 407-414.

Rao, C.R. (1947) The problem of classification and distance between two populations. Nature, 159, 30.

Rao, C.R. (1947) A statistical criterion to determine to which an individual belongs. Nature, 160, 835-836.

Rao, C.R. (1948) The utilization of multiple measurements in problems of biological classification. Journal of the Royal Statistical Society, B10(2), 159-193.

Rao, C.R. (1949) On the distance between two populations. Sankhya, 9, 246-248.

Rao, C. R. (1950) Statistical inference applied to classificatory problems. Sankhya, 10, 229-256.

Rao, C.R. (1954) On the use and interpretation of distance functions in statistics. Bulletin of the International Institute of Statistics, 34, 90-97.

Rao, C.R. (1962) Use of discriminant and allied functions in multivariate analysis. Sankhya, A22, 317-338.

Rao, J.N.K. (1996) On variance estimation with imputed survey data. Journal of the American Statistical Association, 91, 499-505.

52

Page 53: THE DATA FUSION BIBLIOGRAPHY

Rao, J.N.K. and Shao, J. (1992) Jackknife variance estimation with survey data under hot deck imputation. Biometrika, 79, 811-822.

Rao, M.R. (1971) Cluster analysis and mathematical programming. Journal of the American Statistical Association, 66, 622-626.

Rappaport, A. !975) Research paradigms for studying dynamic decision behavior. In Utility, Probability and Human Decision-Making (Wendt, D. and Vlek, C.A.J. (eds.)). Dordrecht, The Netherlands: Reidel.

Rässler, S. (2002) Statistical Matching: A Frequentist Theory, Practical Applications and Alternative Bayesian Approaches. New York: Springer-Verlag New York Inc.

Rässler, S. and Fleischer, K. (2000). An evaluation of data fusion techniques. In Symposium 99: Combining Data from Different Sources, Statistics Canada, Ottawa, Canada.

Rätsch, G., Onoda, T. and Müller, K.R. (2001) Soft margins for AdaBoost. Machine Learning, 42(3), 287-320.

Raudys, S. (2001) Statistical and Neural Classifiers: An Integrated Approach to Design. London: Springer-Verlag London.

Raynor, W.J. (1983) Caliper pair-matching on a continuous variable in case control studies. Communications in Statistics: Theory and Methods, 12, 1499-1509.

Redway, H. (2002) Creating the base data from which to simulate an application of data fusion by statistical matching. Combining data: using advanced technology to enhance social science resources, University of Manchester, United Kingdom, December 18, 2002.

Reichert, A.K., Cho, C.-C. and Wagner, G.M. (1983) An examination of the conceptual issues involved in developing credit-scoring models. Journal of Business and Economic Statistics, 1, 101-114.

Reilly, J. (1999) Integrated media planning through statistical matching: development and evaluation of the New Zealand Panoroma service. Workshop and Symposium on Combining Data from Difference Sources. May 4-7, 1999, Ottawa, Canada.

Reilly, J. (2000) The Development and Evaluation of Statistical Matching Applications. Masters of Science Thesis, University of Auckland, New Zealand.

Reilly, M. (1993) Data analysis using hot-deck multiple imputation. Statistician, 42, 307-313.

Reilly, M. and Pepe, M. (1997) The relationship between hot-deck multiple imputation and weighted likelihood. Statistics in Medicine, 16, 5-19.

Remme, J., Habbema, J.D.F. and Hermans, J. (1980) A simulative comparison of linear, quadratic and kernel discrimination. Journal of Statistical Computing and Simulation, 11, 87-106.

Renssen, R.H. (1998) Use of statistical matching techniques in calibration estimation. Survey Methodology, 24(2), 171-183.

Richard, M.D. and Lippmann, R.P. (1991) Neural network classifiers estimate Bayesian probabilities. Neural Computing, 3, 461-483.

53

Page 54: THE DATA FUSION BIBLIOGRAPHY

Richardson, C. (1990) Data fusion does not work. Journal of the Market Research Society, 32(3), 472-473.

Ridgeway, G. (1999) The state of boosting. In Computer Science and Statistics: Proceedings of the Thirty-first Symposium on the Interface (K. Berk and M. Pourahmadi (eds.)), 172-181. Fairfax, Virginia: Interface Foundation of North America.

Ridgeway, G. (2000) Looking for lumps: boosting and bagging for density estimation. Proceedings of the International Meeting on Nonlinear Methods and Data Mining, (S. Borra and A. Di Ciaccio, eds.), 11-25.

Riffenburgh, R.H. and Clunies-Ross, C.W. (1960) Linear discriminant analysis. Pacific Science, 14, 251-256.

Ripley, B.D. (1994) Neural networks and related methods for classification (with discussion). Journal of the Royal Statistical Society, B56, 409-456.

Ripley, B.D. (1996) Pattern Recognition and Neural Networks. New York: Cambridge University Press.

Ritter, G.L., Woodruff, H.B., Lowry, S.R. and Isenhour, T.L. (1975) An algorithm for a selective nearest neighbor decision rule. IEEE Transactions on Information Theory, IT-21, 665-669.

Roberts, A. (1994) TV exposure, brand buying and ad effects. Admap, June, 31-57.

Roberts, A., 1994: Media exposure and consumer purchasing: an improved data fusion technique. Paper given at the 178th ESOMAR Seminar, 21st March 1994

Roberts, A. (1994) Media exposure and consumer purchasing: an improved data fusion technique. Marketing and Research Today, 22 (August), 159-172.

Robins, J.M. and Wang, N. (2000) Inference for imputation estimators. Biometrika, 87, 113-124.

Robinson, W. (1950) Ecological correlations and the behavior of individuals. American Sociological Review, 15, 351-357.

Rockwell, R.C. (1975) An investigation of imputation and differential quality of data in the 1970 Census. Journal of the American Statistical Association, 70, 39-42.

Rodgers, W.L. (1982) An evaluation of statistical matching. Proceedings of the Section on Survey Research Methods, American Statistical Association, 128-132. Washington, DC: American Statistical Association.

Rodgers, W.L. (1984) An evaluation of statistical matching. Journal of Business and Economic Statistics, 2, 91-102.

Rodgers, W.L. and DeVol, E. (1982) An evaluation of statistical matching. Proceedings of the Section on Survey Research Methods, American Statistical Association, 128-132. Washington, DC: American Statistical Association.

Rogot, E., Sorlie, P., and Johnson, N. J. (1986) Probabilistic methods in matching Census samples to the National Death Index. Journal of Chronic Disease, 39, 719-734.

Romero, C. (1986) A survey of generalized goal programming (1970-1982) European Journal of Operational Research, 25, 183-191.

54

Page 55: THE DATA FUSION BIBLIOGRAPHY

Rosenbaum, P.R. (1984) Conditional permutation tests and the propensity score in observational studies Journal of the American Statistical Association, 79, 565-574.

Rosenbaum, P.R. (1984) From association to causation in observational studies: the role of tests of strongly ignorable treatment assignment. Journal of the American Statistical Association, 79, 41-48.

Rosenbaum, P.R. (1984) The consequences of adjustment for a concomitant variable that has been affected by the treatment. Journal of the Royal Statistical Society, A147, 656-666.

Rosenbaum, P.R. (1987) Model-based direct adjustment. Journal of the American Statistical Association, 82, 387-394.

Rosenbaum. P.R. (1987) The role of a second control group in observational studies. Statistical Science, 2, 292-316.

Rosenbaum, P.R. (1987) Sensitivity of certain permutation inferences in matched observational studies. Biometrika, 74, 13-26.

Rosenbaum, P.R. (1988) Sensitivity analysis for matching with multiple controls. Biometrika, 75, 577-581.

Rosenbaum, P.R. (1989) Optimal matching in observational studies. Journal of the American Statistical Association, 84, 1024-1032.

Rosenbaum, P.R. (1991) Sensitivity analysis for matched case-control studies. Biometrics, 47, 87-100.

Rosenbaum, P.R. (1991) A characterization of optimal designs for observational studies. Journal of the Royal Statistical Society, Series B, 53, 597-610.

Rosenbaum, P.R. (1991) Discussing Hidden Bias in Observational Studies. Annals of Internal Medicine, 115, 901-905.

Rosenbaum, P.R. (1992) Detecting bias with confidence in observational studies. Biometrika, 79, 367-374.

Rosenbaum, P.R. (1995) Observational Studies. New York: Springer Verlag.

Rosenbaum, P.R. (1999) Choice as an alternative to control in observational studies. Statistical Science, 14, 259-304.

Rosenbaum, P.R. and Rubin, D.B. (1983) The central role of the propensity score in observational studies for causal effects. Biometrika, 70, 41-55.

Rosenbaum, P.R. and Rubin, D.B. (1983) Assessing sensitivity to an unobserved binary covariate in an observational study with binary outcome. Journal of the Royal Statistical Society, B45, 212-218.

Rosenbaum, P.R. and Rubin, D.B. (1984) Reducing bias in observational studies using subclassification on the propensity score. Journal of the American Statistical Association, 79, 516-524.

Rosenbaum, P.R. and Rubin, D.B. (1985) Constructing a control group using multivariate matching sample incorporating the propensity score. American Statistician, 39, 33-38.

55

Page 56: THE DATA FUSION BIBLIOGRAPHY

Rosenbaum, P.R. and Rubin, D.B. (1985) The bias due to incomplete matching. Biometrics, 41, 103-116.

Rosenbaum, P.R. and Silber, J.H. (2001) Matching and thick description in an observational study of mortality after surgery. Biostatistics, 2, 217-232.

Rosenblatt, M. (1956) Remarks on some non-parametric estimates of a density function. Annals of Mathematical Statistics, 27, 832-837.

Rossiter, J.R. and Danaher, P.J. (1998) Advanced Media Planning. Kluwer Academic Publishers: Boston, MA.

Rothman, J. (1988) Data fusion and single source surveys. Admap, 276, 37-41.

Rothman, J. (1988) Testing data fusion. ESOMAR Seminar on Media and Media Research, Madrid.

Rubin, D.B. (1973) Matching to remove bias in observational studies. Biometrics, 29, 159-183. Printer’s correction note (1974) , 30, 728.

Rubin, D.B. (1974) Characterizing the estimation of parameters in incomplete-data problem. Journal of the American Statistical Association, 69, 467-474.

Rubin, D.B. (1975) The use of matched sampling and regression adjustment to remove bias in observational studies. Biometrics, 29, 185-203.

Rubin, D.B. (1976) Inference and missing data. Biometrika, 63, 581-592.

Rubin, D.B. (1976) Matching methods that are equal percent bias reducing: some examples. Biometrics, 32, 109-120. Printer’s correction 955.

Rubin, D.B. (1976) Multivariate matching methods that are equal percent bias reducing: Maximums on bias reduction for fixed sample sizes. Biometrics, 32, 121-132.

Rubin, D.B. (1977) Formalizing subjective notions about the effect of nonrespondents in sample surveys. Journal of the American Statistical Association, 72, 538-543.

Rubin, D.B. (1977) Assignment to a treatment group on the basis of a covariate. Journal of Educational Statistics, 2, 1-26.

Rubin, D.B. (1978) Multiple imputation in sample surveys --- a phenomenological Bayesian approach to nonresponse. Proceedings of the Survey Research Methodology Section, American Statistical Association, 20-34. Washington DC: American Statistical Association.

Rubin, D.B. (1979) Illustrating the use of multiple imputations to handle nonresponse in sample surveys. Proceedings of the 42nd Session of the International Statistical Institute, Book 2, 517-532.

Rubin, D.B. (1979) Illustrating the use of multiple imputations to handle nonresponse in sample surveys. Bulletin of the International Statistical Institute, 48, 517-532.

Rubin, D.B. (1979) Using multivariate matched sample and regression adjustment to control bias in observational studies. Journal of American Statistical Association, 74, 318-328.

Rubin, D.B. (1980) Bias reduction using Mahalanobis’ metric matching. Biometrics, 36, 295-298.

56

Page 57: THE DATA FUSION BIBLIOGRAPHY

Rubin, D.B. (1981) The Bayesian bootstrap. Annals of Statistics, 9, 130-134.

Rubin, D.B. (1986) Statistical matching using file concatenation with adjusted weights and multiple imputations. Journal of Business and Economic Statistics, 4, 87-94.

Rubin, D.B. (1986) Basic ideas of multiple imputation for nonresponse. Survey Methodology, 12(1), 37-48.

Rubin, D. B. (1987) Multiple Imputation for Nonresponse Surveys. John Wiley & Sons, Inc.

Rubin, D.B. (1988) An overview of multiple imputation. Proceedings of the Survey Research Methods Section of the American Statistical Association, 79-84. Washington, DC: American Statistical Association.

Rubin, D.B. (1991) Practical implications of modes of statistical inference for causal effects and the critical role of the assignment mechanism. Biometrics, 47, 1213-1234.

Rubin, D. B. (1996) Multiple imputation after 18+ years. Journal of the American Statistical Association, 91, 473-489.

Rubin, D.B. (1997) Estimating causal effects from large data sets using propensity scores. Annals of Internal Medicine, 127, 757-763.

Rubin, D.B. and Belin, T. (1991) Recent developments in calibrating error rates for computer matching. Proceedings of the 1991 Annual Research Conference, U.S. Bureau of the Census, 657-668. Washington, DC: U.S. Bureau of the Census.

Rubin, D.B. and Neal, T. (1992) Characterizing the effect of matching using linear propensity score methods with normal distributions. Biometrika, 79, 797-809.

Rubin, D.B. and Neal, T. (1996) Matching using estimated propensity scores: relating theory to practice. Biometrics, 52, 249-264.

Rubin, D.B. and Neal, T. (2000) Combining propensity score matching with additional adjustments for prognostic covariates. Journal of the American Statistical Association, 95, 573-585.

Rubin, D. B. and Schafer, J.L. (1990) Efficiently creating multiple imputations for incomplete multivariate normal data. Proceedings of the Statistical Computing Section, American Statistical Association, 1990, 83-89. Washington, DC: American Statistical Association.

Rubin, D.B., Schafer, J.L. and Schenker, N. (1988) Imputation strategies for missing values in post-enumeration surveys. Survey Methodology, 14, 209-221.

Rubin, D.B. and Schenker, N. (1986) Multiple imputation for interval estimation from simple random samples with ignorable nonresponse. Journal of the American Statistical Association, 81, 366-374.

Rubin, D.B., Stern, H. and Vehovar, V. (1995) Handling 'Don't Know' survey responses: The case of the Slovenian plebiscite. Journal of the American Statistical Association, 90, 822-828.

Rubin, D.B. and Thomas, N. (1992) Affinely invariant matching methods with ellipsoidal distributions. Annals of Statistics, 20, 1079-1093.

57

Page 58: THE DATA FUSION BIBLIOGRAPHY

Rubin, D.B. and Thomas, N. (1992) Characterizing the effect of matching using linear propensity score methods with normal distributions. Biometrika, 79, 797-809.

Rubin, D.B. and Thomas, N. (1996) Matching using estimated propensity scores: relating theory to practice. Biometrics, 52, 249-264.

Rubin, D.B. and Thomas, N. (2000) Combining propensity scores matching with additional adjustment for prognostic covariates. Journal of the American Statistical Association, 90, 822-828.

Rubin, D.B. and Zanutto, E. (2001) Using matched substitutes to adjust for nonignorable nonresponse through multiple imputations. In Groves, R.M., Dillman, D.A., Eltinge, J.L. and Little, R.J.A. (2001) Survey nonresponse, 389-402. New York: John Wiley & Sons Inc.

Rüschendorf, L., Schweizer, B. and Taylor, M.D. (eds) (1996) Distributions with Fixed Marginals and Related Topics. Lecture Notes – Monograph Series Volume 28. Hayward, CA: Institute of Mathematical Statistics.

Ruggles, N. and Ruggles, R. (1974) A strategy for merging and matching microdata sets. Annals of Economic and Social Measurement, 3, 353-371.

Ruggles, N., Ruggles, R. and Wolff, E. (1977) Merging microdata: rationale, practice and testing. Annals of Economic and Social Measurement, 6, 429-44.

Ruggles, N.D. and Ruggles, R. (1999) Macro- and Micro-data Analyses and Their Integration. Northampton, MA: Edward Elgar Publishing.

Ruhomon, B. (1987) Data fusion: its role in media research and media planning. Admap, June.

Rushbrook, T. and Street. P. (2002) A single-source multimedia study – Building from television, a study for media convergence. ESOMAR/ARF Week of Audience Measurement, Cannes, France.

Rushbrook, T. (2002) Canadian Advertising Research Foundation Data Integration Committee Report. Canadian Advertising Research Foundation Newsletter, 255, June 2002.

Russell, E.J. (1969) Extension of Dantzig's algorithm to finding an initial near optimal-basis for the transportation problem. Operations Research, 17, 187-191.

Safavian, S.R. and Landgrebe, D. (1991) A survey of decision tree classifier methodology. IEEE Transactions on Systems, Man and Cybernetics, SMC-21, 660-674.

Salzberg, S. (1997) On comparing classifiers: Pitfalls to avoid and a recommended approach. Data Mining and Knowledge Discovery, 1(3), 317-327.

Sande, I.G. (1979) A personal view of the hot deck imputation procedures. Survey Methodology, 5, 238-258.

Sande, I. G. (1981) Descriptive statistics used in monitoring edit and imputation process. Proceedings of the 13th Symposium on the Interface. Pittsburgh, Pennsylvania.

Sande, I.G. (1981) Imputation in surveys: coping with reality. Survey Methodology, 7, 21-43.

Sande, I.G. (1982) Imputation in surveys: coping with reality. The American Statistician, 36(3), 145-152.

58

Page 59: THE DATA FUSION BIBLIOGRAPHY

Sande, I.G. (1983) Hot-deck imputation procedures. In Incomplete Data in Survey Sample Surveys, Volume 3, Proceedings of the Symposium. W.G. Madow and I. Olkin (eds.). New York: Academic Press, 334-350.

Santini, G. (1983) Fusion d'enquêtes ou rapprochement de données ? FRF : une méthode de fusion sur variables élaborées. In Les médias - Expériences, recherches actuelles, applications , Séminaire de l'I.R.E.P. (décembre 1983), 1985, pp. 77-83.

Santini, G. (1987) Méthodes de fusion : nouvelles réflexions, nouvelles expériences, nouveaux enseignements. In Les médias, expériences et recherches , Séminaire de l'I.R.E.P. (décembre 1986), 1987, pp. 367-386.

Santini, G. (1988) Validation of data fusion techniques: what can statistical theory do for us? Readership Research: Theory and Practice. Proceedings of the Fourth International Symposium. H. Henry (ed.). Barcelona.

Santini, G. (1989) Fusion in perspective. In International Symposium on Television Research, Tarrytown, New York.

Santini, G. (1999) Fusión y benchmarking. Boletín de la Asociación Mexicana de Agencias de Investigración de Mercado y Opinión Pública, Año 6, Numéro 21.

Santini, G. (2003) Revisting media planning models’ assumptions. Worldwide Readership Research Symposium, Cambridge, MA, USA.

Saporta G. and Co V. (1999) Fusion de fichiers: une nouvelle méthode basée sur l'analyse homogène. In G.Brossier and A.M.Dussaix (eds.) Enquêtes et sondages, 81-93. Dunod.

Särndal, C.E. (1992) Methods for estimating the precision of survey estimates when imputation has been used. Survey Methodology, 18, 241-252.

Sarunas, R. and Jain, A.K. (1991) Small sample size effects in statistical pattern recognition: Recommendations for practitioners. IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI-13(3), 252-264.

Schafer, J.L. (1991) A comparison of missing data treatments in the Post-Enumeration Program. Journal of Official Statistics, 7, 475-498.

Shafer, J.L. (1995) Model-based imputation of census short-form items. Proceedings of the Annual Research Conference, 267-299. Washington, DC: Bureau of the Census.

Schafer, J.L. (1997) Analysis of Incomplete Multivariate Data. London: Chapman & Hall.

Schafer, J.L. (1999) Multiple imputation: a primer. Statistical Methods in Medical Research, 8, 3-15.

Schafer, J.L. (2002) “Thou shalt not impute only once” IMPUTE: Symposium on Nonresponse, Questionnaire Split and Multiple Imputation, Nuremberg, Germany, September 25.

Schafer, J.L. and Graham, J.W. (2002) Missing data: our view of the state of the art. Psychological Methods, 7(2), 147-177.

Schafer, J.L., Khare, M. and Ezzati-Rice, T.M. (1993) Multiple imputation of missing data in NHANES III. In Proceedings of the Annual Research Conference, p. 459-487. Bureau of the Census, Washington, D.C.

59

Page 60: THE DATA FUSION BIBLIOGRAPHY

Schafer, J.L. and Olsen, M.K. (1998) Multiple imputation for multivariate missing-data problems: a data analyst's perspective. Multivariate Behavioral Research, 33, 545-571.

Schafer, J.L. and Schenker, N. (1991) Variance estimation with imputed means. Proceedings of the Survey Research Methods Section, American Statistical Association, 696-701. Washington, DC: American Statistical Association.

Schafer, J.L. and Schenker, N. (2000) Inference with imputed conditional means. Journal of the American Statistical Association, 95, 144-154.

Schapire, R. E. (1990) The strength of weak learnability. Machine Learning, 5, 197-227.

Schapire, R., Freund, Y., Bartlett, P. and Lee, W. (1998) Boosting the margin: a new explanation for the effectiveness of voting methods. Annals of Statistics, 26, 1651-1686.

Schapire, R.E. and Singer, Y. (1999) Improved boosting algorithms using confidence-rated predictions. Machine Learning, 37(3):297-336.

Scharrenbroch, A. (2002) Single imputation in large-scale media surveys. IMPUTE: Symposium on Nonresponse, Questionnaire Split and Multiple Imputation, Nuremberg, Germany, September 25.

Scheler, H.-E. and Wiegand, J. (1985) A report on experiments in fusion in the official German Media Research (AG:MA). Proceedings of the Salzburg International Readership Symposium.

Scheler, H.-E. and Wiegand, J. (1986) Combining separately derived data sets into an integrated intermedia planning system: The German partnership model and fusion experiments. Television Research – International Symposium. Tarrytown, New York, October 5-8, 1986.

Scheler, H.-E. and Wiegand, J. (1987) A report on experiments in fusion in the “official” German media research (AG.MA). In Henry, H. (ed), Readership Research: Theory and Practice, 352-360, Amsterdam: Elsevier Science.

Schemper, M. and Heinze, G. (1997) Probability imputation revisited for prognostic factor studies. Statistics in Medicine 16, 73-80.

Schenker, N. (1989) The use of imputed probabilities for missing binary data. In Proceedings of the Fifth Annual Research Conference, 133-139. Washington, DC: U.S. Bureau of the Census.

Schenker, N., Treiman, D.J. and Weidman, L. (1988) Evaluation of multiply-imputed public-use tapes. Proceedings of the Survey Research Methods Section of the American Statistical Association, 85-92. Washington, DC: American Statistical Association.

Schenker, N., Treiman, D.J. and Weidman, L. (1993) Analyses of public use decennial census data with multiply-imputed industry and occupation codes. Journal of the Royal Statistical Society Series C, 42, 545-556.

Schenker, N. and Welsh, A.H. (1988) Asymptotic results for multiple imputation. Annals of Statistics, 16, 1550-1566.

Scheuren, F. (1989) A comment on “The social policy simulation database and model: an example of survey and administrative data integration.” Survey of Current Business, 40-41.

60

Page 61: THE DATA FUSION BIBLIOGRAPHY

Scheduren, F. and Alvey, W. (1974) Selected bibliography on the matching of person records from different sources. Proceedings of the Social Statistics Section, American Statistical Association, 151-154. Washington, DC: American Statistical Association.

Scheuren, F. and Oh, H.L. (1975) Fiddling around with nonmatches and mismatches. Proceedings of the Social Statistics Section, American Statistical Association, 627-633. Washington, DC: American Statistical Association.

Scheuren, F. and Oh, H.L. (1975) Fiddling around with nonmatches and mismatches. Proceedings of the Social Statistics Section, American Statistical Association, 627-633. Washington, DC: American Statistical Association.

Scheuren, F. and Winkler, W.E. (1993) Regression analysis of data files that are computer matched – Part I. Survey Methodology, 19, 39-58.

Scheuren, F. and Winkler, W.E. (1996) Recursive merging and analysis of administrative lists and data. Proceedings of the Section of Government Statistics, American Statistical Association, 123-128. Washington, DC: American Statistical Association.

Scheuren, F. and Winkler, W.E. (1997) Regression analysis of data files that are computer matched – Part II. Survey Methodology, 23, 157-165.

Schmid, J., Jr. (1950) A comparison of two procedures for calculating discriminant function coefficients. Psychometrika, 15, 431-434.

Schniederjans, M.J. (1995) Goal Programming: Methodology and Applications. Boston: Kluwer Academic Publishers.

Schniederjans, M.J. and Kwak, N.M. (1982) An alternative solution method for goal programming problems: a tutorial. Journal of the Operational Research Society, 33, 247-251.

Schreiber, R.J. (1968). Probability assignments for the simulation of media reach and frequency. Journal of Advertising Research, 8, 3-8.

Schürmann, J. (1996) Pattern Classification: A Unified View of Statistical and Neural Approaches. New York: Wiley Interscience.

Schwarz, G. (1978) Estimating the dimension of a model. Annals of Statistics, 6, 461-464.

Scott, D.W. (1992) Multivariate Density Estimation: Theory, Practice and Visualization. New York: John Wiley.

Sedransk, J. (1985) The objective and practice of imputation . Proceedings of the First Annual Research Conference, U.S. Bureau of the Census, Washington, DC, 44-452.

Sethi, I. (1981) A fast algorithm for recognizing nearest neighbors. IEEE Transactions on Systems, Man and Cybernetics, 11, 245-248.

Shao, J. and Chen, Y. (1998) Bootstrapping sample quantiles based on complex survey data under hot deck imputation. Statistica Sinica, 8, 1071-1085.

Shao, J. and Steel, P. (1999) Variance estimation for imputed survey data with nonnegligible sampling fractions. Journal of the American Statistical Association, 94, 254-265.

Sharma R. (2000) A new dual based procedure for the transportation problem. European Journal of Operational Research, 122, 611-624.

61

Page 62: THE DATA FUSION BIBLIOGRAPHY

Shetty, C.M. (1959) A solution to the transportation problem with non-linear costs. Operations Research, 7(5), 571-580.

Shively, W.P. (1969) Ecological inference: the use of aggregate data to study individuals. American Political Science Review, 65, 1183-1196.

Short, R.D. and Fukunaga, K. (1981) Optimal distance measure for nearest neighbor classification. IEEE Transactions on Information Theory, IT-27, 622-627.

Silber, J.H., Rosenbaum, P.R. and Ross, R.N. (1995) Comparing the contributions of groups of predictors: which outcomes vary with hospital rather than patient characteristics? Journal of the American Statistical Association, 90, 7-18.

Silman, R. (1992) Fusion and the media/marketing database mix. In ARF/ESOMAR Worldwide Broadcast Audience Research Symposium, Toronto (Canada), 729-740.

Silverman, B.W. (1986) Density Estimation for Statistics and Data Analysis. London: Chapman and Hall.

Silverman, B.W. and Jones, C.M. (1989) E. Fix and J.L. Hodges (1951): An important contribution to nonparametric discriminant analysis and density estimation. International Statistical Review, 57(3), 233-247.

Simonnard, M.A. and Hadley, G.F. (1959) Maximum number of iterations in the transportation problem. Naval Research Logistics Quarterly, 6.

Sims, C.A. (1972) Comments on “Construction a new data base from existing micro-data sets: the 1966 merge file”, by B.A. Okner. Annals of Economic and Social Measurement, 1, 345-355.

Sims, C.A. (1972) Rejoinder on “Construction a new data base from existing micro-data sets: the 1966 merge file”, by B.A. Okner. Annals of Economic and Social Measurement, 1, 355-357.

Sims, C.A. (1978) Comments on “Some Statistical Problems in Merging Data Files,” by J.B. Kadane, 1978 Compendium of Tax Research, U.S. Department of Treasury, 172-177.

Sinharay, S., Stern, H.S. and Russell, D. (2001) The use of multiple imputatino for the analysis of missing data. Psychological Methods, 6(4), 317-329.

Singh, A.C. (1989) Log-linear imputation. Proceedings of the Fifth Annual Research Conference. Washington, DC: U.S. Bureau of the Census.

Singh, A.C., Armstrong, J.B. and Lemaître, G.E. (1988) Statistical matching using loglinear imputation. Proceedings of the Section on Survey Research Methods, American Statistical Association, 672-677. Washington, DC: American Statistical Association.

Singh, A.C., Mantel, H.J., Kinack, M.D. and Rowe, R. (1993) Statistical matching: use of auxiliary information as an alternative to the conditional independence assumption Survey Methodology, 19, 59-79.

Sissors, J.Z. (1971) Matching media with markets. Journal of Advertising Research, 11(5), 39-45.

Sitter, R. R. and Rao, J. N. K. (1997) Imputation for missing values and corresponding variance estimation. Canadian Journal of Statistics, 24, 61-73.

62

Page 63: THE DATA FUSION BIBLIOGRAPHY

Sitter, R.R. and Skinner, C.J. (1994) Multi-way stratification by linear programming. Survey Methodology, 20, 65-73.

Smith, A., Kark, J., Cassel, J. and Spears, G. (1977) Analysis of prospective epidemiologic studies by minimum distance case-control matching. American Journal of Epidemiology, 105, 567-574.

Smith, C. (1947) Some examples of discrimination. Annals of Eugenics (London), 18, 272-283.

Smith, F.W. (1968) Pattern classifier design by linear programming. IEEE Transactions on Computers, C-17, 367-372.

Smith, H. (1997) Matching with multiple controls to estimate treatment effects in observational studies. Sociological Methodology, 27, 325-353.

Smith, K.A., Chuan, S. and van der Putten, P. (2001) Determining the validity of clustering for data fusion. International Workshop on Hybrid Intelligent Systems, 11-12 December 2001, Adelaide, Australia.

Smith, M. E., and Newcombe, H.B. (1975) Methods for computer linkage of hospital admission-separation records into cumulative health histories. Methods of Information in Medicine 14 (July), 118-125.

Smith, M.E. and Silins, J. (1981) Generalized iterative record linkage system. Proceedings of the Social Statistics Section, American Statistical Association, 128-137. Washington, DC: American Statistical Association.

Smyth, P. and Wolpert, D. (1998) Stacked density estimation. In Advances in Neural Information Processing Systems Volume 10 (M.I. Jordan, K.J. Kearns and S.A. Solia (eds.)), 668-674. Cambridge, MA: MIT Press.

Smyth, P. and Wolpert, D. (1999) An evaluation of linearly combining density estimators via stacking. Machine Learning, 36(1/2), 59-83.

Snapinn, S.M. and Knoke, J.D. (1988) Bootstrapped and smoothed classification error rate estimators. Communications in Statistics: Simulation, 17, 1135-1153.

Snapinn, S.M. and Knoke, J.D. (1989) Estimation of error rates in discriminant analysis with selection of variables. Biometrics, 45, 289-299.

Snyder, T.L. and Steele, J.M. (1990) Worst case greedy matching in the unit d-cube. Networks, 20, 779-800.

Sollich, P. and Krogh, A. (1996) Learning with ensembles: How over-fitting can be useful In Advances in Neural Information Processing Systems (Touretsky, D., Mozer, M. and Hasselmo, M. (eds.)), 8, 190-196. Cambridge, MA: MIT Press

Somla, A.J., Bartlett, P.J., Schölkopf, B. and Schuurmans, D. (2000) Advances in Large-Margin Classifiers. Cambridge, MA: MIT Press.

Soong, R. (2001) Data fusion in Latin America. 47th Annual Conference of the Advertising Research Foundation, March 9th, New York City, NY. (http://www.zonalatina.com/Zldata166.htm)

63

Page 64: THE DATA FUSION BIBLIOGRAPHY

Soong, R. (2001) Varieties of data fusion experiences. Zona Latina, May 2001. (http://www.zonalatina.com/Zldata179.htm)

Soong, R. (2001) Data fusion and media planning. Speech to the Market Research Club of Chicago, June 12th, Chicago, IL.

Soong, R. (2001) Data fusion: Intergrating results from multiple surveys. 6th Annual CASRO Technology Conference, June 22nd, New York City, NY. (http://www.zonalatina.com/Zldata183.htm)

Soong, R. (2002) Quick vs. optimal algorithms in data fusion. Zona Latina, January 2002. (http://www.zonalatina.com/Zldata215.htm)

Soong, R. (2002) The pros and cons of fusion. Staying Tuned 2002. February 11, Toronto, Canada. (http://www.zonalatina.com/Zldata220.htm) See also Canadian Advertising Research Foundation Newsletter #254, March 2002.

Soong, R. (2002) Data fusion: an intuitive argument. Zona Latina, March 2002, (http://www.zonalatina.com/Zldata228.htm )

Soong, R. and de Montigny, M. (2001) An anatomy of data fusion. Paper for the Worldwide Readership Research Symposium, Venice (Italy), 87-109. (http://www.zonalatina.com/WRRSfusion.pdf)

Soong, R. and de Montigny, M. (2002) The contribution of magazines in mixed TV-print schedules. ESOMAR/ARF Week of Audience Measurement, Cannes, France. Also reprinted in Excellence 2003 In International Research (2003). ESOMAR: Amsterdam, The Netherlands. (http://www.zonalatina.com/WAMSoongDeMontigny.pdf)

Soong, R. and de Montigny, M. (2003) Does fusion-on-the-fly really fly? ARF/ESOMAR Week of Audience Measurement, Los Angele, USA. (http://www.zonalatina.com/WAM2003.pdf)

Soong, R. and de Montigny, M. (2003) Fusion-on-the-fly for multimedia applications. 11th Worldwide Readership Symposium, Cambridge, MA, USA. (http://www.zonalatina.com/WRRS2003fly.pdf)

Soong, R. and de Montigny, M. (2003) Foundations of split-sample foldover tests. 11th Worldwide Readership Symposium, Cambridge, MA, USA. (http://www.zonalatina.com/WRRS2003split.pdf)

Soong, R. and de Montigny, M. (2004) No Free Lunch in Data Integration. ARF/ESOMAR Week of Audience Measurement, Geneva, Switzerland.(http://www.zonalatina.com/WAM2004.pdf)

64

Page 65: THE DATA FUSION BIBLIOGRAPHY

Sorum, M. (1971) Estimating the conditional probability of misclassification. Technometrics, 13,

333.

Sorum, M. (1972) Three probabilities of misclassification. Technometrics, 14, 309-316.

Sorum, M. (1972) Estimating the expected and the optimal probabilities of misclassification. Technometics, 14, 935-943.

Speetzen, R. (1988) The art of models --- ascription in Germany. In Readership Research: Theory and Practice. Proceedings of the Fourth International Symposium. H. Henry (ed.). Barcelona.

Squires, S. (1995) Data fusion and data linkage: how fusion may now, at last, fulfill its potential. Admap, January 1995.

Srinivisan, V. and Kim, Y.H. (1987) Credit granting: a comparative analysis of classification procedures. Journal of Finance, 52, 665-681.

Srivastava, M.S. (1967) Comparing distances between multivariate populations – the problem of minimum distances. Annals of Mathematical Statistics, 38, 550-557.

Staniswalis, J.G. (1989) The kernel estimate of a regression function in likelihood-based models. Journal of the American Statistical Association. 84, 276-283.

Stasch, S. (1965) Linear programming and space-time considerations in media selection. Journal of Advertising Research, 5, 40-46.

Stasch, S. (1967). Linear programming and media selection: a comment. Journal of Marketing Research, 4, 1205-7.

Steenrod, N. (1950) The Topology of Fibre Bundles. Princeton, NJ: Princeton University Press.

Stoddard, L.R. (1986) Will single source data change media planning? Journal of Advertising Research, 36(2), RC13-15.

Stoller, D.S. (1954) Univariate two-population distribution-free discrimination. Journal of the American Statistical Association, 49, 771-777.

Stone, C.J. (1977) Consistent nonparametric regression. Annals of Statistics, 3(4), 595-645.

Strauss, W.J., Carroll, R.J., Bortnick, S.M., Menkedick, J.R. and Schultz, B.D. (2001) Combining datasets to predict the effects of regulation of environmental lead exposure in housing stock. Biometrics, 57, 203-210.

Sudholt, T. (2000) Fusion in Germany: History and application of fusion techniques from a media point of view. Will Fusion Come to America Workshop, Advertising Research Foundation, December 2000, New York City.

Summerfield, P. (2001) New prez looks to reinformce BBM’s turf. Strategy Magazine, March 26, 2001.

Summerfield, P. (2001) CMDC puts the brakes on research. Strategy Magazine, September 24, 2001.

65

Page 66: THE DATA FUSION BIBLIOGRAPHY

Summerfield, P. (2001) Nielsen Media Research ventures into fusion territory. Strategy Magazine, November 5, 2001.

Sun M. (1998) A tabu search heuristic procedure for solving the transportation problem with exclusionary side constraints. Journal of Heuristics, 3, 305-326.

Sun, M., Aronson, J.E., McKeown, P.G. and Drinka, D. (1998) A tabu search heuristic procedure for the fixed charge transportation problem. European Journal of Operation Research, 106, 441-456.

Sutcliffe, J.P. (1965) A probability model for errors of distribution of classification. I. General considerations. Psychometrika, 30, 73-96.

Sutcliffe, J.P. (1965) A probability model for errors of distribution of classification. II. Particular case. Psychometrika, 30, 129-155.

Sutherland, H., Taylor, R. and Gomulka, J. (2001) Combining household income and expenditure data in policy simulations. Department of Applied Economics Working Paper, Amalgamated Series 0110, University of Cambridge, Cambridge, United Kingdom.

Swets, J.A. and Pickett, R.M. (1982) Evaluation of Diagnostic Systems: Methods from Signal Detection Theory. New York: Academic Press.

Szwarc, W. (1960) The initial solution of the transportation problem. Operations Research, 8, 727-729.

Szwarc, W. (1964) The transportation problem with stochastic demand. Management Science, vol. 11 (1), 33-50.

Tamiz, M. and Jones, D.F. (1996) An overview of current solution methods and modelling practices in goal programming. In M. Tamiz (ed). Multi-objective Programming and Goal Programming: Theories and Applications. Berlin: Springer Verlag.

Tanner, M.A. and Wong, W.H. (1987) The calculation of posterior distributions by data augmentation (with discussion). Journal of the American Statistical Association, 82, 528-550.

Tapia, R.A. and Thompson, J.R. (1978) Nonparametric Probability Density Estimation. Baltimore, Maryland: The Johns Hopkins University Press.

Tardos, E. (1985) A strongly polynomial minimum cost circulation algorithm. Combinatorica, 5, 247-255.

Taylor, C. C. (1997) Distance-based decision trees. In Machine Learning and Statistics: The Interface (edited by G. Nakhaeizadeh and C.C. Taylor). New York: Wiley Interscience.

Taylor, R., Gomulka, J. and Sutherland, H. (2000) Creating order out of chaos? Identifying homogeneous groups of households across multiple data sets. 26th Conference of the International Association for Research in Income and Wealth, Cracow, Poland.

Tchaoussoglou, C. (2000) What’s Happening: The European User’s Experience. Will Fusion Come to America Workshop, Advertising Research Foundation, December 2000, New York City.

Tepping, B.J. (1968) A method for optimum linkage of records. Journal of the American Statistical Association, 63, 1321-1332.

66

Page 67: THE DATA FUSION BIBLIOGRAPHY

Thadani, R. and Sinha, A. (1988) A study on the validity of data fusion: the Indian experience. Readership Research: Theory and Practice. Proceedings of the Fourth International Symposium. H. Henry (ed.). Barcelona, Spain.

Thibaudeau, Y. (1993) The discrimination power of dependency structures in record linkage. Survey Methodology, 19, 31-38.

Thibaudeau, Y. (1999) Model explicit item imputation for demographic categories for Census 2000. U.S. Bureau of the Census, Statistical Research Report Series, No. RR1999/04. Washington, DC: U.S. Bureau of the Census.

Thompson, J.R. and Tapia, R.A. (1990) Nonparametric Function Estimation, Modeling and Simulation. Philadelphia, Pennsylvania: SIAM.

Thompson, M.L. and Zucchini, W. (1989) On the statistical analysis of ROC curves. Statistics in Medicine, 8, 1277-1290.

Titterington, D.M., Murray, G.D., Murray, L.S., Spiegelhalter, D.J., Skene, A.M., Habbema, J.D.F. and Gelpke, G.J. (1981) Comparison of discrimination techniques applied to a complex data set of head injury patients. Journal of the Royal Statistical Society, A144, 145-175.

Todeschini, R. (1989) k-nearest neighborhood method: the influence of data transformations and metrics. Chemometrics Intelligence Laboratory Systems, 6, 213-220.

Tokuyama, T. and Nakano, J. (1992) Efficient algorithms for the Hitchcock transportation problem. Proceedings of the Third ACM-SIAM Symposium on Discrete Algorithms, Orlando, Florida. New York Association for Computing Machinery.

Tokuyama, T. and Nakano, J. (1995) Efficient algorithms for the Hitchcock transportation problem. SIAM Journal on Computing, 24(3), 563-578.

Tomke, I. (1976) An experiment with the edited nearest-neighbor rule. IEEE Transactionson Systems, Man and Cybernetics, 6, 448-452.

Totschek, R. and Wood, R.C. (1961) An investigation o fthe real-time solution of the transportation problem. Journal of the ACM, 8(2), 230-239.

Toussaint, G.T. (1974) Bibliography on estimation of misclassification. IEEE Transactions on Information Theory, IT-20, 472-479.

Toussaint, G.T. (1975) An efficient method for estimating the probability of misclassication applied to a problem in medical diagnosis. Computers in Biology and Medicine, 4, 269-278.

Trawinski, I.M. and Bargmann, R.E. (1964) Maximum likelihood estimation with incomplete multivariate data. Annals of Mathematical Statistics, 35, 647-657.

Tresp, V. and Taniguchi, M. (1995) Combining estimators using non-constant weighting functions.In Advances in Neural Information Processing Systems (Tesauro, G., Touretzky, D. and Leen, T. (eds.)), 7, 419-426. Cambridge, MA:. MIT Press.

Tutz, G. (1986) An alternative choice of smoothing for kernel-based density estimates in discrete discriminant analysis. Biometika, 73,405-411.

Tutz, G. (1988) Smoothing for discrete kernels in discrimination. Biometrical Journal, 6, 729-739.

67

Page 68: THE DATA FUSION BIBLIOGRAPHY

Tutz, G. (1989) On cross-validation for discrete kernel estimates in discrimination. Communications in Statistics --- Theory and Methods, 18, 4145-4162.

Twyman, T. and Mundy, C. (2001) Mixing research techniques for a mixed economy. ESOMAR Worldwide Radio Conference, Athens, Greece.

Ullmann, J.R. (1974) Automatic selection of reference data for use in a nearest-neighbor method of pattern classification. IEEE Transactions on Information Theory, IT-20,541-543.

U.S. Department of Commerce (1980) Report on exact and statistical matching techniques. Statistical Policy Working Paper 5. Washington, DC: Federal Committee on Statistical Methodology.

Urbakh, V. Yu. (1971) Linear discriminant analysis: loss of discriminant power when a variate is omitted. Biometrics, 27, 531-534.

Ury, H. (1975) Efficiency of case-control studies with multiple controls per case: continuous or dichotomous data. Biometrics, 31, 643-649.

Utgoff, P.E., Berkman, N.C. and Clouse, J.A. (1997) Decision tree induction based on efficient tree restructuring. Machine Learning, 29, 5-44.

Vacek, P.M. and Ashikaga, R. (1980) An examination of the nearest neighbor rule for imputing missing values. Proceedings of the Statistical Computing Section, American Statistical Association, 421-425. Washington, DC: American Statistical Association.

Valiant, L. (1984) A theory of the learnable. Communications of the ACM, 27, 1134-1142.

Vsn Buuren, S. (2002) Practical issues in multiple imputation for surveys. IMPUTE: Symposium on Nonresponse, Questionnaire Split and Multiple Imputation, Nuremberg, Germany, September 25.

van Buuren, S., Boshuizen, H.C. and Knook, D.L. (1999) Multiple imputation of missing blood pressure covariates in survival analysis. Statistics in Medicine, 18, 681-694.

van Buuren, S., van Mulligen E.M. and Brand, J.P.L. (1994) Routine multiple imputation in statistical data bases. In Proceedings of the Seventh International Working Conference on Scientific and Statistical Database Management, (J.C. French and H. Hinterberger (eds.)), Charlottesville, VA, Sept. 28-30, 1994, 74-78. IEEE Computer Society Press, Los Alamitos, CA.

van Buuren, S. and van Rijckevorsel, J.L.A. (1992) Imputation of missing categorical data by maximizing internal consistency. Psychometrika, 57, 567-580.

van der Laan, P. (2000) Integrating administrative registers and household surveys. Netherlands Official Statistics, 15, 7-15.

van der Putten, P. (2000) Data fusion: a way to provide more data to mine in? Proceedings of the 12th Belgian-Dutch Artificial Intelligence. Conference BNAIC '2000, November 2000. De Efteling, Kaatsheuvel, The Netherlands. 

van der Putten, P. (2000) No mining without data: the data fusion solution. Intelligent Marketing and Finance Colloquium, Erasmus Research Institute of Management, November 27, 2000.

van der Putten, P. and Kok, J.N. (2001) Fusing data sets using IDA algorithms. Leiden Institute of Advanced Computer Science, The Netherlands.

68

Page 69: THE DATA FUSION BIBLIOGRAPHY

van der Putten, P., Kok, J.N. and Gupta, A. (2002) Why the information explosion can be bad for data mining and how data fusion provides a way out. Proceedings of the Second SIAM International Conference on Data Mining. SIAM: Philadelphia, PA.

van der Putten, P., Kok, J.N. and Gupta, A. (2002) Data fusion through statistical matching. Paper 185, Center for eBusiness@MIT, MIT Sloan School of Management.

van Dyk, D.A. and Meng, X.-L. (2001) The art of data augmentation. Journal of Computational and Graphical Staistics, 10(1), 1-111.

van Pelt, X. (2001) The Fusion Factory: A Constrained Data Fusion Approach. Master of Science Thesis, Leiden Institute of Advanced Computer Science, The Netherlands.

Vapnik, V. (1998) Statistical Learning Theory. New York: John Wiley & Sons Inc.

Vapnik, V. (2000) The Nature of Statistical Learning Theory. New York: Springer-Verlag New York.

Vapnik, V.N. and Bottou, L. (1993) Local algorithms for pattern recognition and dependencies estimation. Neural Computing, 6, 851-876.

Velila, S. and Barrio, J.A. (1994) A discriminant rule under transformation. Technometrics, 36, 348-353.

Venes, V. and Stepan, J. (1997) Distributions with Given Marginals and Moment Problems. Dordrecht: Kluwer Academic Publishers.

Vidal, E. (1986) An algorithm for finding nearest neighbors in (approximately) constant average times. Pattern Recognition Letters, 4, 145-157.

Vidale, M.L. (1956) A graphical solution of the transportation problem. Operations Research, 4(2), 193-203.

Vignaux, G. A. and Michalewicz, Z. (1989) Genetic algorithms for the transportation problem. In Methodologies for Intelligent Systems (Ras, Z. (ed.), 4, 252-259. Holland: North-Holland.

Vignaux, G.A. and Michalewicz, Z. (1991) A genetic algorithm for the linear transportation problem. IEEE Transactions on Systems, Man, and Cybernetics, 21(2), 445-452.

Vinod, H. (1969) Integer programming and the theory of grouping. Journal of the American Statistical Association, 64, 506-517.

Vlachonikolos, I. (1990) Predictive discrimination and classification with mixed binary and continuous variables. Biometrika, 77, 657-662.

von Mises, R. (1945) On the classification of observation data into distinct groups. Annals of Mathematical Society, 16, 68-73.

von Winterfeldt, D. and Edwards, W. (1982) Costs and payoffs in perceptual research. Psychological Bulletin, 83, 213-217.

Vriens, M. and Melton, E. (2002) Managing missing data: multiple imputation can improve data quality. Marketing Research, 14, 12-17.

69

Page 70: THE DATA FUSION BIBLIOGRAPHY

Wagner, H.M. (1959) On a class of capacitated transportation problems. Management Science, 5, 304-318.

Wagner, T. (1971) Convergence of the nearest neighborhood rule. IEEE Transactions on Information Theory, 19, 696-699.

Wainer, H. (1976) Estimating coefficients in linear models: It doesn’t make no nevermind. Psychological Bulletin, 83, 213-217.

Wainer, H. (1978) On the sensitivity of regression and regressors. Psychological Bulletin, 85, 267-273.

Wald, A. (1944) On a statistical problem arising in the classification of an individual into one of two groups. Annals of Mathematical Statistics, 15, 145-163.

Walker, S.B. and Duncan, D.B. (1967) Estimation of the probability of an event as a function of several independent variables. Biometrika, 54, 167-179.

Walkup, D. (1979) On the exepcted value of a random assignment problem. SIAM Journal on Computing, 8, 440-442.

Walsh, J.E. (1963) Simultaneous confidence intervals for differences of classification probabilities. Biometrie Zeitschrifte, 5, 231-234.

Walsh, P. (2001) Multibasing: Data integration without regression to the mean. 10th Worldwide Readership Research Symposium, Venice, 57-67.

Wand, M.P. and Jones, M.J. (1995) Kernel Smoothing. London: Chapman and Hall.

Wang, N. and Robins, J.M. (1998) Large-sample theory for parametric multiple imputation procedures. Biometrika, 85, 935-948.

Wang, R., Sedransk, J. and Jinn, J.H. (1992) Secondary data analysis when there are missing observations. Journal of the American Statistical Association, 87, 952-961.

Walsh, P. and Zack, B. (2003) Data integration: validating Telmar’s Multibasing technique. Worldwide Readership Research Symposium, Cambridge, MA, USA.

Webb, G.I. (2000) MultiBoosting: A technique for combining boosting and wagging. Machine Learning, 40(2), 159-196.

Wedel, M. and Kamakura, W.A. (1999) Market Segmentation: Conceptual and Methodological Foundations. Kluwer Academic Publishers: Boston, MA.

Wei, G. and Tanner, M.A. (1990) A Monte Carlo implementation of the EM algorithm and the poor man’s data augmentation algorithm. Journal of the American Statistical Association, 85, 699-705.

Weogamd, J. (1986) Combining different media surveys: the Germany partnership model and fusion experiments. Journal of the Market Research Society, 28(2), 189-208.

Weinberg, C.R. and Wacholder, S. (1900) The design and analysis of case-control studies with biased sampling. Biometrics, 46, 953-975.

Weiner, J.M. and Dunn, O.J. (1966) Elimination of variates in linear discriminant problems. Biometrics, 22, 268-275.

70

Page 71: THE DATA FUSION BIBLIOGRAPHY

Weiss, S.M. and Indurkhya, N. (1995) Rule-based machine learning methods for functional prediction. Journal of Artificial Intelligence Research, 3, 383-403.

Weiss, S.M. and Indurkhya, N. (1998) Predictive Data Mining: A Practical Guide. San Francisco, California: Morgan Kaufmann Publishers Inc.

Weiss, S.M. and Kulikowski, C.A. (1991) Computer Systems That Learn: Classification and Prediction Methods from Statistics, Neural Networks, Machine Learning and Expert Systems. San Mateo, California: Morgan Kaufmann.

Welch, B.L. (1939) Note on discriminant functions. Biometrika, 31, 218-220.

Welniak, E.J. and Coder. J.F. (1980) A measure of the bias in the March CPS earnings imputation system. Proceedings of the Section on Survey Research Methods, American Statistical Association, 421,425. Washington, DC: American Statistical Association.

Wendt, F. (1976) Beschriebung einer Fusion. Hamburg: Gruner und Jahr, Schriftenreihe 21.

Wendt, F. (1983) The AG.MA model. Proceedings of the Readership Symposium II. Montreal, Canada.

Wendt, F. (1979) Bericht über ein Fusionsexperiment mit Daten der Niederländerischen Midiauntersuchung NOP 77. Hamburg-Puidoux: Bureau Wendt.

Wendt, F. (1980) Zusammenführung von Daten – Voraussetzungen und Grenzen. Hamburg: Gruner und Jahr, Schriftenreihe 28.

Wendt, F. (1986) Einige Gedanken zur Fusion, Auf dem Wege zum Partnerschaftsmodell, Arbeitsgemeinscharft Media-Analyse e.V., Media-Micro-Census GmbH, Frankfurt, 109-140.

Wendt, F. (2000) Bericht über die Durchführung der Fusionen zur MA 2000, MA 2000, Arbeitsbemeinschaft Media-Analyse e.V., Media-Micro-Census GmbH, Frankfurt, 1-61.

Wendt, F. and Wendt-Maeder, I. (1979) Vom Leser pro Nummer zur Nutzungswahrscheinlichkeit, Teil 1. Arbeitsgemeinschaft Media-Analyse e.V. Schriften Band 5, Media-Micro-Census GmbH, Frankfurt.

Wendt, F. and Wendt-Maeder, I. (19830 Vom Leser pro Nummer zur Nutzungswahrscheinlichkeit, Teil 2. Arbeitsgemeinschaft Media-Analyse e.V. Schriften Band 5, Media-Micro-Census GmbH, Frankfurt.

Wendt-Maeder, I. (1986) Fusionen in AG.MA-Bereich Bedingungen und Möglichkeiten. Arbeitsgemeinschaft Media-Analyse e.V., Media-Micro-Census GmbH, Frankfurt.

Wesler, O. (1959) A classification problem involving multinomials. Annals of Mathematical Statistics, 30, 128-133.

Wettschereck, D. and Dietterich, T.D. (1994) Locally adaptive nearest neighbor algorithms. In Advances of Neural Information Processing Systems 6. San Francisco, California: Morgan Kaufmann Publishers, Inc.

Wicken, G. and Elms, S. (1999) Demystifying data fusion — the why?, the how? and the wow! Advertising Research Foundation Week of Workshops, New York, October 7, 1999.

71

Page 72: THE DATA FUSION BIBLIOGRAPHY

White, A.P. and Liu, W.Z. (1994) Bias in information-based measures in decision tree induction. Machine Learning, 15, 321-329.

White, A.P. and Liu, W.Z. (1999) Statistical properties of tree-based approaches to classification. In Nakhaeizadeh, G. and Taylor, C.C. (eds.) (1997) Machine Learning and Statistics: The Interface. New York: Wiley-Interscience.

Wiedenbeck, M. (1999) Fusion of data and estimation by entropy maximisation. In Symposium 99: Combining Data from Different Sources, Statistics Canada, Ottawa, Canada.

Wiedey, G., and Zimmermann H. J. (1978) Media selection and fuzzy linear programming. Journal of Operational Research Society, 31, 342-349.

Wiegand, J. (1986) Combining different media surveys: the German partnership model and fusion experiments. Journal of the Market Research Society, 28(2), 189-208.

Wigbout, M. (1994) To match or not to match, that is the question. In 30th Annual Conference of the Operational Research Society of New Zealand/45th Annual Conference of the New Zealand Statistical Association, 297-301. Massey University, Palmerston North, New Zealand.

Wilcox, S. and Johnson, H. (1997) Multi-media reach and frequency analysis. Proceedings of the Vancouver Worldwide Readership Symposium.

Wilcox, S. and Read. S. (1990) The marriage of BARB and G-Track: a response to criticism. Admap, January 1990.

Wildner, R. (1994) Die Grenzen von Single Source, planung and analyse, 1, 22-27.

Wildner, R. (2000) Measuring advertising effectiveness in a sample matched in a consumer panel and a television panel. Proceedings of the Fifth Internatioanl Conference on Logic and Methodology, Cologne.

Willenborg, L.C.R.J. (1995) Data editing and imputation from a computational point of view. Statistica Neerlandica, 49, 139-151.

Williams, A.C. (1962) A treatment of transportation problems by decomposition. Journal of the Society for Industrial and Applied Mathematics, 10, 35-48.

Williams, A.C. (1963) A stochastic transportation problem. Operations Research, 11, 759-770.

Williams, E.J. (1961) Tests for discriminant functions. Journal of the Australian Mathematical Society, 2, 243-252.

Williams, G. J. (1955) Significance tests for discriminating functions and linear functional relationships. Biometrika, 42, 360-381.

Williams, T. (2001) Flexible matching imputation: combining hot-deck imputation with model-based methodology. Section on Survey Methods Research, Joint Statistical Meetings, Atlanta, GA.

Williams, W.H. and Mallows, C.L. (1970) Systematic biases in panel surveys due to differential nonresponse. Journal of the American Statistical Association, 65, 1338-1349.

72

Page 73: THE DATA FUSION BIBLIOGRAPHY

Williamson, D.A. (2001) Buyers demand more audience data: Evolving new techniques for measuring media. AdAge.com, Summer 2001 special report.

Winkler, W. E. (1984) Exact Matching Using Elementary Techniques. Energy Information Administration Technical Report, U. S. Dept. of Energy, Washington DC.

Winkler, W. E. (1988) Using the EM algorithm for weight computation in the Fellegi-Sunter Model of record linkage. Proceedings of the Section on Survey Research Methods, American Statistical Association, 667-671. Washington DC: American Statistical Association.

Winkler, W.E. (1989) Frequency-based matching in the Fellegi-Sunter model of record linkage. Proceedings of the Survey Research Methods Section, American Statistical Association, 778-783. Washington DC: American Statistical Association.

Winkler, W.E. (1989) Methods for adjusting for lack of independence in the Fellegi-Sunter model of record linkage. Survey Methodology, 15, 101-117.

Winkler, W. E. (1990) String comparator metrics and enhanced decision rules in the Fellegi-Sunter model of record linkage. Proceedings of the Section on Survey Research Methods, American Statistical Association, 354-359. Washington, DC: American Statistical Association.

Winkler, W. E. (1991) Error model for analysis of computer linked files. Proceedings of the Section on Survey Research Methods, American Statistical Association, 472-477. Washington, DC: American Statistical Association.

Winkler, W. E. (1992) Comparative analysis of record linkage decision rules. Proceedings of the Section on Survey Research Methods, American Statistical Association, 829-834. Washington, DC: American Statistical Association.

Winkler, W.E. (1993) Improved decision rules in the Fellegi-Sunter model of record linkage. Proceedings of the Survey Research Methods Section, American Statistical Association, 274-279. Washington, DC: American Statistical Association.

Winkler, W.E. (1994) Advanced methods for record linkage. Proceedings of the Survey Research Methods, American Statistical Association, 467-472. Washington, DC: American Statistical Association.

Winkler, W. E. (1997) Producing public-use microdata that are analytically valid and confidential. Proceedings of the Section on Survey Research Methods, American Statistical Association, 41-50. Washington, DC: American Statistical Association.

Winkler, W.E. (1999) The state of record linkage and current research problems. U.S. Bureau of the Census, Statistical Research Report Series, No. RR1999/04. Washington, DC: U.S. Bureau of the Census.

Winkler, W.E. (2000) Using the EM-algorithm for weight computation in the Fellegi-Sunter model of record linkage. U.S. Bureau of the Census, Statistical Research Report Series, No. RR2000/05. Washington, DC: U.S. Bureau of the Census.

Winkler, W. E. and Scheuren, F. (1991) How Matching Error Effects Regression Analysis: Exploratory and Confirmatory Results. Unpublished report, Washington DC: Statistical Research Division Technical Report, U.S.

Winter, F. (1980) Match target markets to media audiences. Journal of Advertising Research, 20(1), 61-65.

73

Page 74: THE DATA FUSION BIBLIOGRAPHY

Wolberg, W.H. and Mangasarian, O.L. (1990) Cancer diagnosis via linear programming. SIAM News, 23(5), 1-18.

Wolfson, M.C. (1995) Socio-Economic Statistics and Public Policy: A New Role for Microsimulation Modeling. 50th Session of the International Statistical Institute, Beijing, China.

Wolfson, M., Gribble, S., Bordt, M., Murphy, B. and Rowe, G. (1989) The social policy simulation database and model: an example of survey and administrative data integration. Survey of Current Business, 69, 36-41.

Wood, L. (2002) Addressing the needs of buyers and planners in a multimedia world. Staying Tuned 2002. February 11, Toronto, Canada.

Wood, L. (2002) The Fusion Lab – Report to the Industry. Do Fusions Track/Trend Year-to-Year? ARF Week of Workshops. October 10, New York City.

Wolpert, D. (1992) Stacked generalization. Neural Networks, 5, 241-259.

Wolpert, D. (1992) On the connection between in-sample testing and generalization error. Complex Systems, 6(1), 47-94.

Wolpert, D.H. and MacReady, W.G. (1997) No free lunch theorem for optimization. IEEE Transaction on Evolutionary Computation, 1, 67-82.

Wolverton, C.T. (1973) Strong consistency of an estimate of the asymptotic error probability of the nearest neighbor rule. IEEE Transactions on Information Theory, IT-19, 119-120.

Wolverton, C.T. and Wagner, T.J. (1969) Asymptotically optimal discriminant functions for pattern classification. IEEE Transactions on Information Theory, IT-15, 258-265

Woodbury, M. (1983) Statistical record matching for files. In Incomplete Data in Sample Surveys, Volume 3, Proceedings of the Symposium. W.G. Madow and I. Olkin (eds.). New York: Academic Press, 203-212.

Wyman, F.J., Young, D.M. and Turner, D.W. (1990) A comparison of asymptotic error rate for the sample linear discriminant functions. Pattern Recognition, 23(7), 775-785.

Xie, F. and Paik, M.C. (1997) Multiple imputation methods for the missing covariates in generalized estimating equation. Biometrics, 53, 1538-1546.

Yancey, W.E. (2000) Frequency-dependent probability measures for record linkage. U.S. Bureau of the Census, Statistical Research Report Series, No. RR2000/07. Washington, DC: U.S. Bureau of the Census.

Yoshizoe, Y. and Araki, M. (1999) Use of statistical matching for household surveys in Japan. In 52nd Session of the International Statistical Institute, Helsinki, Finland.

Young, R. (1997) Minimizing ascription. Proceedings of the Magazine Audience Measurement Research. New York: Advertising Research Foundation.

Yunck, T. (1976) A technique to identify nearest neighbors. IEEE Transactions on Systems, Man and Cybernetics, 6, 678-683.

74

Page 75: THE DATA FUSION BIBLIOGRAPHY

Zanakas, S.H. and Gupta, S.K. (1985) A categorized bibliographic survey of goal programming. Omega, 13, 211-222.

Zangwill, W.I. (1965). Media selection by decision programming. Journal of Advertising Research, 5, 30-36.

Zanutto, E.L. (1998) Modeling matched substitutes to create multiple imputations for unit nonrespondents. Proceedings of the Section on Government Statistics, American Statistical Association. Washington, DC: American Statistical Association.

Zavanella, B. (1990) Statistical matching of different sources. In Income and Wealth Distribution, Inequality and Poverty (C. Dagum and M. Zenga (eds)), 361-374. Berlin: Springer-Verlag.

Zeleny, M. (1982) The pros and cons of goal programming. Computers and Operations Research, 8, 357-359.

Zhang, S. (1991) On anti-cycling pivoting rules for the simplex method. Operations Research Letters, 10, 189-192.

Zornig, P. (1991) Degeneracy Graphs and Simplex Cycling. Lecture Notes in Economics and Mathematical Systems. Berlin: Springer-Verlag.

Zweig, M.H. and Campbell, G. (1993) Receiver-operating characteristic (ROC) plots. Clinical Chemistry, 29, 561-577.

75


Recommended