학술논문

Midas : integrating public financial data
Document Type
Conference
Source
Proceedings of the 2010 ACM SIGMOD International Conference on Management of data. :1187-1190
Subject
data cleansing
financial data
information extraction
information integration
Language
English
Abstract
The primary goal of the Midas project is to build a system that enables easy and scalable integration of unstructured and semi-structured information present across multiple data sources. As a first step in this direction, we have built a system that extracts and integrates information from regulatory filings submitted to the U.S. Securities and Exchange Commission (SEC) and the Federal Deposit Insurance Corporation (FDIC). Midas creates a repository of entities, events, and relationships by extracting, conceptualizing, integrating, and aggregating data from unstructured and semi-structured documents. This repository enables applications to use the extracted and integrated data in a variety of ways including mashups with other public data and complex risk analysis.

Online Access