Read our feature in the December 2012 Sigmod Record.
The Data Analytics group at QCRI has built expertise focused on three core data management challenges that will enable the effective use of this growing asset class: extraction from its natural digital habitat, integration from a large and evolving number of sources, and robust cleaning processes to assure data quality and validation.
The Data Trio: Extraction, Integration, and Cleaning: Institutions and industries at a national level deal with large scale, heterogeneous data collected from large number of sources. The main challenge is a judicious use of the information within and across organizations to make informed decisions and to run operations effectively.
At QCRI, we are focusing on the interaction among three core data management challenges that will enable effective use of the continuously growing data: Information Extraction, Data and Schema Integration, and Data Cleaning.
Going beyond traditional ETL approaches, we are investigating multiple new directions, including: handling unstructured data; interleaving extraction, integration, and cleansing tasks in a more dynamic and interactive process that responds to evolving data sets and real-time decision-making constraints; and leveraging the power of human cycles to solve hard problems such as data cleaning and information integration.
Scalable Knowledge Models: Grand challenges mean big data. ‘Knowledge base’ is the term commonly used to refer to data, along with the rules and the logic that describe the information within this data. Large-scale knowledge management is a core-computing challenge due to the expensive process involved in reasoning about the data and inferring the facts and the various semantics embedded within. We focus on developing efficient knowledge representation models and semantic-aware query languages and processing engines that bring semantics to real applications. Main applications domains include media and health, where current approaches are either too expensive or fall short in delivering user needs.
QCRI is at an exciting stage of growth with world-class researchers. I am excited about the opportunity to establish QCRI at the forefront of data mining research.
In the News
QCRI's MicroMappers used for Typhoon Haiyan
QCRI's MicroMappers helps to radically change how we respond to disasters like Supertyphoon Haiyan
QCRI's MicroMappers software provides teams with data-driven map of what they should be doing and where
QCRI and Boeing are organizers of the first annual Machine Learning and Data Analytics Symposium. Submissions are due January 24, 2014.
QCRI's Dr. Halima Bensmail, Computational Science and Engineering, will be a keynote speaker at this event.
The Qatar Foundation Annual Research Conference (ARC ’13) will bring together distinguished researchers and experts to discuss the growing dependence on computing and networking technologies by the world’s economies, and the unprecedented security concerns and risk this has to a nation’s cyber infrastructure. For the first time at the annual conference, organised by Qatar Foundation Research and Development (QF R&D), cyber security as a large-scale research challenge will be addressed in-depth through discussions and debates on risks, emerging threats and opportunities.
In an effort to make Arabic more accessible to those unfamiliar with the language, Qatar Foundation International (QFI) and Qatar Computing Research Institute (QCRI) recently launched Madar Al-Huruf ...
Qatar Computing Research Institute (QCRI) commemorated the end of a successful 2013 summer internship programme on Wednesday with a closing ceremony and speech from guest speaker Dr. Tarek El Fouly, from the College of Engineering at Qatar University.