Home | Activities | Resources & Tools | Portal | Publications | Events | People | Links |
The coreference system has been developed to decide wether two mentions refer to the same entity or not.
The input of the systems consists of a list of Named Entities of type Person, i.e. Person Names (PNs), that have been automatically recognized in a document collection. Its output consists of a number of clusters of PNs, where each cluster is interpreted as the set of PNs that refer to the same entity.
The system has three main modules, depicted in Figure 1, corresponding to three steps of analysis: Person Name Splitter, Local Coreference module and Global Coreference module.
Fig. 1: Coreference System Architecture. |
Fig. 2: Last_Name Perceptron. |
maxL p(l , L , l_L , t) = maxL p(t) p(L | t) p(l | L, t) p(l_L | l , L , t)
Fig. 3: Probability formula, where l stands for a first name, L for last name, l_L for first last name combination and t for topic.
Enter the Ontotext Portal
(access is restricted to the partners of the project)
Maintainer: bentivofbk.eu
Last modified: Tue Aug 28 2007