A Matter of Words: NLP for Quality Evaluation of Wikipedia Medical Articles

Vittoria Cozza, Marinella Petrocchi, Angelo Spognardi

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

Abstract

Automatic quality evaluation of Web information is a task with many fields of applications and of great relevance, especially in critical domains, like the medical one. We move from the intuition that the quality of content of medical Web documents is affected by features related with the specific domain. First, the usage of a specific vocabulary (Domain Informativeness); then, the adoption of specific codes (like those used in the infoboxes of Wikipedia articles) and the type of document (e.g., historical and technical ones). In this paper, we propose to leverage specific domain features to improve the results of the evaluation of Wikipedia medical articles, relying on Natural Language Processing (NLP) and dictionaries-based techniques. The results of our experiments confirm that, by considering domain-oriented features, it is possible to improve existing solutions, mainly with those articles that other approaches have less correctly classified.
Original languageEnglish
Title of host publicationWeb Engineering : 16th International Conference, ICWE 2016, Lugano, Switzerland, June 6-9, 2016. Proceedings
Number of pages9
Volume9671
PublisherSpringer
Publication date2016
Pages448-456
ISBN (Print)978-3-319-38790-1
ISBN (Electronic)978-3-319-38791-8
DOIs
Publication statusPublished - 2016
EventThe 16th International Conference on Web Engineering - USI Lugano, Switzerland
Duration: 6 Jun 20169 Jun 2016
Conference number: 16

Conference

ConferenceThe 16th International Conference on Web Engineering
Number16
Country/TerritorySwitzerland
CityUSI Lugano
Period06/06/201609/06/2016
SeriesLecture Notes in Computer Science
ISSN0302-9743

Keywords

  • Information Systems Applications (incl. Internet)
  • Information Storage and Retrieval
  • Software Engineering
  • Computer Appl. in Administrative Data Processing
  • User Interfaces and Human Computer Interaction
  • Artificial Intelligence (incl. Robotics)

Fingerprint

Dive into the research topics of 'A Matter of Words: NLP for Quality Evaluation of Wikipedia Medical Articles'. Together they form a unique fingerprint.

Cite this