eCommons

 

A Theory of Term Importance in Automatic Text Analysis

dc.contributor.authorSalton, Gerarden_US
dc.contributor.authorYang, C. S.en_US
dc.contributor.authorYu, C. T.en_US
dc.date.accessioned2007-04-19T19:08:10Z
dc.date.available2007-04-19T19:08:10Z
dc.date.issued1974-07en_US
dc.description.abstractMost existing automatic content analysis and indexing techniques are based on word frequency characteristics applied largely in an ad hoc manner. Contradictory requirements arise in this connection, in that terms exhibiting high occurence frequencies in individual documents are often useful for high recall performance (to retrieve many relevant items), whereas terms with low frequency in the whole collection are useful for high precision (to reject nonrelevant items).en_US
dc.format.extent1419909 bytes
dc.format.extent820751 bytes
dc.format.mimetypeapplication/pdf
dc.format.mimetypeapplication/postscript
dc.identifier.citationhttp://techreports.library.cornell.edu:8081/Dienst/UI/1.0/Display/cul.cs/TR74-208en_US
dc.identifier.urihttps://hdl.handle.net/1813/6048
dc.language.isoen_USen_US
dc.publisherCornell Universityen_US
dc.subjectcomputer scienceen_US
dc.subjecttechnical reporten_US
dc.titleA Theory of Term Importance in Automatic Text Analysisen_US
dc.typetechnical reporten_US

Files

Original bundle
Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Name:
74-208.pdf
Size:
1.35 MB
Format:
Adobe Portable Document Format
No Thumbnail Available
Name:
74-208.ps
Size:
801.51 KB
Format:
Postscript Files