Credits:
4
Hours:
42
Area:
Big Data Sensing & Procurement
Description:
The module provides the description of a search engine structure and of Text Mining tools, by analyzing their characteristics and limits with respect to the computational cost, the precision/recall/F1 parameters, and the expressivity of the supported queries. The module is also based on hands-on activities that will present well-known open-source Python tools for the crawling and analysis of web pages, the semantic annotation of texts (TagMe), and the indexing of text data collections (ElasticSearch).
Technics and tools:
TagMe, Rake, Python