학술논문

Mining DEOPS Records: Big Data's Insights into Dictatorship
Document Type
Conference
Source
2014 IEEE 10th International Conference on e-Science e-Science (e-Science), 2014 IEEE 10th International Conference on. 2:67-70 Oct, 2014
Subject
Computing and Processing
Data mining
Crowdsourcing
Character recognition
Smart phones
Strips
Optical character recognition software
Presses
artificial intelligence
data mining
machine learning
image processing
text processing
crowdsourcing
big data
escience
DEOPS
Brazilian military dictatorship
Language
Abstract
Historical data provide valuable information for the nderstanding of human interactions through time. However, mining this data is challenging as the available records are generally noise digitized handwritten, typewritten or press printed documents. In this research proposal, we plan to develop tools and techniques for pre-processing and extracting information from documents of the military dictatorship period that ruled Brazil from 1964 to 1985. The data to be analyzed consists of digitized images of records from DEOPS/SP (São Paulo State Department of Political and Social Order), an emblematic police agency which have monitored (and in some cases, harassed and tortured) hundreds of thousands Brazilian citizens during that period. The idea is to use state-of-the-art powerful artificial intelligence algorithms in conjunction with crowd sourcing techniques to pre-process and extract information from this important period of the Brazilian History.