Automated metadata Extraction supported by artificial intelligence


METADATA capabilities

pre processing icon.png

Content Pre-Processing

During pre-processing tokenization (a process of dividing your text into words) and part-of-speech tagging (classifying each word on its grammatical category) take place. This gets your text ready for further content analysis and metadata tagging.


entity extraction icon.png

Entity Extraction

EDIA’s entity extractor collects valuable keywords and named entities from your text. Keywords are words that have a meaning on their own such as force in physics or liver in biology. Named entities are proper names of people, organisations or products, such as Apple or Peter Smith.


readability icon.png

Readability Analysis

EDIA’s readability analyzer provides an overview of the reading difficulty level of your texts. The reading difficulty of a text can be analysed through various difficulty scales such as CEFR, Flesch-Kincaid and others.


topic classification icon.png

Topic Classification

EDIA’s Topic Classifier identifies what your text is about from a taxonomy of predefined topical categories. EDIA supports IAB topical taxonomy. Other topical taxonomies such as curricula, learning objective taxonomies, and others are possible.


Get Started

Our capabilities are delivered in the form of APIs or CMS (content management system) integrations that automatically extract valuable metadata from your educational content.

Get more insights. Review our collection of case studies, reports, and more: