A Theory of Term Importance in Automatic Text Analysis
dc.contributor.author | Salton, Gerard | en_US |
dc.contributor.author | Yang, C. S. | en_US |
dc.contributor.author | Yu, C. T. | en_US |
dc.date.accessioned | 2007-04-19T19:08:10Z | |
dc.date.available | 2007-04-19T19:08:10Z | |
dc.date.issued | 1974-07 | en_US |
dc.description.abstract | Most existing automatic content analysis and indexing techniques are based on word frequency characteristics applied largely in an ad hoc manner. Contradictory requirements arise in this connection, in that terms exhibiting high occurence frequencies in individual documents are often useful for high recall performance (to retrieve many relevant items), whereas terms with low frequency in the whole collection are useful for high precision (to reject nonrelevant items). | en_US |
dc.format.extent | 1419909 bytes | |
dc.format.extent | 820751 bytes | |
dc.format.mimetype | application/pdf | |
dc.format.mimetype | application/postscript | |
dc.identifier.citation | http://techreports.library.cornell.edu:8081/Dienst/UI/1.0/Display/cul.cs/TR74-208 | en_US |
dc.identifier.uri | https://hdl.handle.net/1813/6048 | |
dc.language.iso | en_US | en_US |
dc.publisher | Cornell University | en_US |
dc.subject | computer science | en_US |
dc.subject | technical report | en_US |
dc.title | A Theory of Term Importance in Automatic Text Analysis | en_US |
dc.type | technical report | en_US |