doi: 10.6017/ital.v30i1.3040

A Simple Scheme for Book Classification Using Wikipedia

Andromeda Yelton

Abstract


Because the rate at which documents are being generated outstrips librarians’ ability to catalog them, an accurate, automated scheme of subject classification is desirable. However, simplistic word-counting schemes miss many important concepts; librarians must enrich algorithms with background knowledge to escape basic problems such as polysemy and synonymy. I have developed a script that uses Wikipedia as context for analyzing the subjects of nonfiction books. Though a simple method built quickly from freely available parts, it is partially successful, suggesting the promise of such an approach for future research.

Full Text: PDF

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.


http://napoleon.bc.edu/ojs/public/site/images/ejadmin/lita_67

 

ISSN:2163-5226