Sie befinden Sich nicht im Netzwerk der Universität Paderborn. Der Zugriff auf elektronische Ressourcen ist gegebenenfalls nur via VPN oder Shibboleth (DFN-AAI) möglich. mehr Informationen...
Ergebnis 25 von 5418
Journal of computer science and technology, 2017-05, Vol.32 (3), p.644-662
2017
Volltextzugriff (PDF)

Details

Autor(en) / Beteiligte
Titel
EntityManager: Managing Dirty Data Based on Entity Resolution
Ist Teil von
  • Journal of computer science and technology, 2017-05, Vol.32 (3), p.644-662
Ort / Verlag
New York: Springer US
Erscheinungsjahr
2017
Quelle
Alma/SFX Local Collection
Beschreibungen/Notizen
  • Data quality is important in many data-driven applications, such as decision making, data analysis, and data mining. Recent studies focus on data cleaning techniques by deleting or repairing the dirty data, which may cause information loss and bring new inconsistencies. To avoid these problems, we propose EntityManager, a general system to manage dirty data without data cleaning. This system takes real-world entity as the basic storage unit and retrieves query results according to the quality requirement of users. The system is able to handle all kinds of inconsistencies recognized by entity resolution. We elaborate the EntityManager system, covering its architecture, data model, and query processing techniques. To process queries efficiently, our system adopts novel indices, similarity operator and query optimization techniques. Finally, we verify the efficiency and effectiveness of this system and present future research challenges.

Weiterführende Literatur

Empfehlungen zum selben Thema automatisch vorgeschlagen von bX