Data Analytics

DA group photo

Read our feature in the December 2012 Sigmod Record.

The Data Analytics group at QCRI has built expertise focused on three core data management challenges that will enable the effective use of this growing asset class: extraction from its natural digital habitat, integration from a large and evolving number of sources, and robust cleaning processes to assure data quality and validation.

The Data Trio: Extraction, Integration, and Cleaning:  Institutions and industries at a national level deal with large scale, heterogeneous data collected from large number of sources. The main challenge is a judicious use of the information within and across organizations to make informed decisions and to run operations effectively.

At QCRI, we are focusing on the interaction among three core data management challenges that will enable effective use of the continuously growing data: Information Extraction, Data and Schema Integration, and Data Cleaning.

Going beyond traditional ETL approaches, we are investigating multiple new directions, including: handling unstructured data; interleaving extraction, integration, and cleansing tasks in a more dynamic and interactive process that responds to evolving data sets and real-time decision-making constraints; and leveraging the power of human cycles to solve hard problems such as data cleaning and information integration.

Scalable Knowledge Models: Grand challenges mean big data. ‘Knowledge base’ is the term commonly used to refer to data, along with the rules and the logic that describe the information within this data. Large-scale knowledge management is a core-computing challenge due to the expensive process involved in reasoning about the data and inferring the facts and the various semantics embedded within. We focus on developing efficient knowledge representation models and semantic-aware query languages and processing engines that bring semantics to real applications. Main applications domains include media and health, where current approaches are either too expensive or fall short in delivering user needs.


For technical or informational questions, please send an email to 
QCRI Careers with the name of the group to whom you’re directing your question, e.g. ALT, CS&E, Cyber Security, Data Analytics, Distributed Systems or Social Computing, in the subject line.

Principal Scientist

genericImage.jpg

Dr. Mourad Ouzzani

To be part of something different than what I had been used to at Purdue University and contribute to the first computing research institution in the region.
Read more

Principal Scientist

Dr. Sanjay Chawla

QCRI provides an ideal environment to conduct high-impact research which can transcend disciplinary boundaries.
Read more
our-research/data-analytics
Open Source Release:
  • NADEEF.  A semi-automatic extensible data cleaning system.
Meet us at the following conferences:
Learn more about our us:
default

Follow Us

  • YouTube
  • Twitter
  • Facebook
  • RSS Feed
  • Linkedin
  • github-web.png
Back to Top

In the Media

The FOundation.jpg

A Digital Companion

03/02/2016

v\:* {behavior:url(#default#VML);} o\:* {behavior:url(#default#VML);} w\:* {behavior:url(#default#VML);} .shape {behavior:url(#default#VML);} QCRI's Jalees e-book platform is changing how Arabic ...

Read More

Huff Post.jpg

How Digital Humanitarians Are Closing the Gaps In Worldwide Disaster Response

01/02/2016

It is now commonplace for people around the world to use social media during emergencies, and the volume of online information coupled with its rapid arrival is becoming increasingly overwhelming to ...

Read More

Peninsulalogoforweb.jpg

CMU-Q to host regional 24-hour hackathon

21/01/2016

DOHA: Carnegie Mellon University in Qatar (CMU-Q) will host its first regional 24-hour hackathon from Friday at the university’s campus in Education City.  The CarnegieApps Hackathon is an annual ...

Read More

Upcoming Events

2016

Default Thumbnail

Machine Learning and Data Analytics Symposium - MLDAS 2016

Download ICS File 14/03/2016  - 15/03/2016 ,

Machine Learning and Data Analytics Symposium - MLDAS 2016 Building on the success of MLDAS 2015 and MLDAS 2014 , The Third Machine Learning and Data Analytics (MLDAS) Symposium , will be held on ...

Read More

CSAIL Logo 226.png

QCRI-MIT CSAIL Annual Meeting 2016

Download ICS File 20/03/2016 ,

Open invitation to attend the annual research project review meeting by QCRI and MIT- CSAIL. Executive overview sessions will highlight our eight main collaborative projects: Understanding Health ...

Read More

Rus QCRI WEB.JPG

Self-Driving Cars Are Coming - Public Talk by Daniela Rus

Download ICS File 20/03/2016 ,

Abstract: We spend a lot of time in out cars, yet this is a part of our lives where we have been vulnerable to the world's leading cause of bodily harm. Now, the digitization of practically ...

Read More

News Releases

HBKU-logo-final-(2).jpg

QCRI Humanitarian Technology Becomes First from the Middle East to Win the Open Source Software World Challenge Grand Prize

07/12/2015

Doha, December 6, 2015 – Qatar Computing Research Institute, one of Hamad bin Khalifa University’s three specialized national research institutes, recently won the esteemed Open Source Software World...

Read More

1 Lunch and Learn.jpg

Qatar Computing Research Institute Welcomes New Batch of Students to Summer Internship Programme

02/06/2015

Hands-On Programme Offers Undergraduate Students An Opportunity To Conduct Research And Gain Real-World Experience Doha, Qatar, 02 June 2015 - Enjoying its fourth consecutive year of success, the ...

Read More

Farnam Jahanian - Copy.JPG

Dr Farnam Jahanian joins Qatar Computing Research Institute's Scientific Advisory Committee

27/05/2015

The Provost of Carnegie Mellon University Brings A Wealth Of Knowledge And Expertise To Qatar Foundation-Based Research Institute

Read More