Vis enkel innførsel

dc.contributor.advisorLi-Chun Zhang
dc.contributor.advisorKathrine Frey Frøslie
dc.contributor.authorBauer-Nilsen, Louise Risholm
dc.date.accessioned2023-08-24T16:27:10Z
dc.date.available2023-08-24T16:27:10Z
dc.date.issued2023
dc.identifierno.nmbu:wiseflow:6839610:54592628
dc.identifier.urihttps://hdl.handle.net/11250/3085703
dc.description.abstractThis thesis proposes a generative approach to COICOP classification using entity resolution and maximum entropy classification as a formal framework. The current limitations in COICOP classification are related to the corpus of item descriptions and lack of data. I propose a new perspective on the classification task at hand, as I argue that the underlying problem in classification is the data itself. Therefore, corpus and feature engineering are crucial when improving classification. The proposed approach aims to engineer the corpus to construct an entity forest from the item descriptions, where terms in the description are mapped to the roots and branches of trees in the entity forest. The results of the proposed approach are illustrated by a proof-of-concept with data from Statistics Norway. This thesis provides insight into the problems with previous approaches to COICOP classification and shows how we potentially can achieve true resolution and more accurate classification.
dc.description.abstract
dc.languageeng
dc.publisherNorwegian University of Life Sciences
dc.titleMaximum Entropy COICOP Classification using Entity Forest
dc.typeMaster thesis


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel