






Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
The use of text classification techniques to extract information from web documents and generate knowledge bases. Docsity.com is a system that trains machine-learning subsystems to predict classes and relations, populates the knowledge base with data collected from the web, and provides ontology and training examples as inputs. The document also covers knowledge extraction, which consists of assigning a new web page to a class and filling in class attributes by extracting relevant information. Various classification methods, including naive bayes, are applied to different datasets, such as news stories and email filtering.
Typology: Slides
1 / 10
This page cannot be seen from the preview
Don't miss anything!






