World Bank Knowledge for Change Program – Full Proposal ... | Summaries Machine Learning

World Bank Knowledge for Change Program – Full Proposal Template

Page 1 of 14

Basic Data:

Title

Structuring 50 years of knowledge on development –

Applying natural language processing (NLP) models to a

corpus of 500,000+ documents

Linked Project ID

N.A.

Product Line

Applied Amount ($)

60,000

Est. Project

Period

02/15/2020

-06/15/2021

Team Leader(s)

Olivier Dupriez

Managing

Unit

DECAT

Contributing

unit(s)

DECAT

Funding Window

Innovation in Data Production, Analysis and Dissemination

Regions/Countries

World

General:

1. What is the Development Objective (or main objective) of this Grant?

LOCATING KNOWLEDGE. The design of economic and social development policies and programs, and

research on development issues, must start with a review and understanding of existing knowledge on the

subject(s) of interest. A common problem that policy makers, program managers, and researchers face when

doing such exploratory work is the identification of the most relevant information available. Discovery

methods provided by data and documents repositories—through filtering or exact keyword matching—limit

the relevance of the results returned to the user; such methods will return the materials that literally match

the query (lexical search), not those are semantically or conceptually satisfying it (semantic or conceptual

search). For example, a search for “malnutrition” may fail to return data or documents related to stunting,

wasting, or obesity. Also, the search functionalities in data and documents catalogs do not provide adequate

solutions to identify documents and datasets based on the combination and relative importance of the topics

they address. In summary, search functionalities in data and documents catalogs may be effective at helping

users who know precisely what they need to find and who are able to formulate queries accordingly, but

they perform poorly as recommender systems.

IDENTIFYING KNOWLEDGE GAPS. Another problem is the identification of gaps in the available resources.

Assessing how extensively certain topics (or combination of topics) have been addressed in a knowledge

repository, and tracking the coverage of emerging themes, can lead to a better understanding of the

dynamics of development policies and to a better alignment of research work to operational priorities.

Such problems can be largely solved by exploiting machine learning algorithms—natural language processing

(NLP) in particular—designed to discover information in vast amounts of data and documents that cannot

possibly be processed manually.

World Bank Knowledge for Change Program – Full Proposal ..., Summaries of Machine Learning

Related documents

Partial preview of the text

Download World Bank Knowledge for Change Program – Full Proposal ... and more Summaries Machine Learning in PDF only on Docsity!

Basic Data:

DECAT

DECAT

General:

Disbursement Projection