首页 | 本学科首页   官方微博 | 高级检索  
     

THE EFFECT OF MISLABELED SAMPLES ON THE PERFORMANCE OF THE LINEAR LEARNING MACHINE
引用本文:BARRY K.LAVINE ANTHONY J.I.WARD JIAN HWA HAN Department of Chemistry,Clarkson University,Potsdam,NY,U.S.A.ROY-KEITH SMITH Georgia Department of Agriculture,Capitol Square,Atlant,GA,U.S.A.ORLEY R.TAYLOR Department of Entomology,University of Kansas,Lawrence,KS,U.S.A.. THE EFFECT OF MISLABELED SAMPLES ON THE PERFORMANCE OF THE LINEAR LEARNING MACHINE[J]. 地理学报(英文版), 1990, 0(1)
作者姓名:BARRY K.LAVINE ANTHONY J.I.WARD JIAN HWA HAN Department of Chemistry  Clarkson University  Potsdam  NY  U.S.A.ROY-KEITH SMITH Georgia Department of Agriculture  Capitol Square  Atlant  GA  U.S.A.ORLEY R.TAYLOR Department of Entomology  University of Kansas  Lawrence  KS  U.S.A.
作者单位:BARRY K.LAVINE ANTHONY J.I.WARD JIAN HWA HAN Department of Chemistry,Clarkson University,Potsdam,NY 13676,U.S.A.ROY-KEITH SMITH Georgia Department of Agriculture,Capitol Square,Atlanta,GA 30334,U.S.A.ORLEY R.TAYLOR Department of Entomology,University of Kansas,Lawrence,KS 66045,U.S.A.
摘    要:Over the past 15 years the linear learning machine has been applied to a large number of chemicalproblems.The learning machine approach is conceptually simple and does not require knowledge aboutthe statistical distribution of the data.However,there are problems associated with this approach.Oneproblem which has not been investigated is the influence of mislabeled samples on the positioning of thehyerplane in feature space.If a few samples in a data set are incorrectly tagged prior to training(i.e.thesamples are labeled as members of class 2 even though they are actually members of class 1),it is stilIpossible using the linear learning machine to achieve a classification success rate of 100% for the trainingset.However,unfavorable results will be obtained for the prediction set.The magnitude of this effect andits potential implications regarding the proper use of the linear learning machine are discussed.


THE EFFECT OF MISLABELED SAMPLES ON THE PERFORMANCE OF THE LINEAR LEARNING MACHINE
BARRY K.LAVINE ANTHONY J.I.WARD JIAN HWA HAN. THE EFFECT OF MISLABELED SAMPLES ON THE PERFORMANCE OF THE LINEAR LEARNING MACHINE[J]. Journal of Geographical Sciences, 1990, 0(1)
Authors:BARRY K.LAVINE ANTHONY J.I.WARD JIAN HWA HAN
Affiliation:BARRY K.LAVINE ANTHONY J.I.WARD JIAN HWA HAN Department of Chemistry,Clarkson University,Potsdam,NY,U.S.A.ROY-KEITH SMITH Georgia Department of Agriculture,Capitol Square,Atlant,GA,U.S.A.ORLEY R.TAYLOR Department of Entomology,University of Kansas,Lawrence,KS,U.S.A.
Abstract:Over the past 15 years the linear learning machine has been applied to a large number of chemical problems.The learning machine approach is conceptually simple and does not require knowledge about the statistical distribution of the data.However,there are problems associated with this approach.One problem which has not been investigated is the influence of mislabeled samples on the positioning of the hyerplane in feature space.If a few samples in a data set are incorrectly tagged prior to training(i.e.the samples are labeled as members of class 2 even though they are actually members of class 1),it is stilI possible using the linear learning machine to achieve a classification success rate of 100% for the training set.However,unfavorable results will be obtained for the prediction set.The magnitude of this effect and its potential implications regarding the proper use of the linear learning machine are discussed.
Keywords:Classification  Pattern recognition  Preprocessing
本文献已被 CNKI 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号