Data Analytics

DA group photo

Read our feature in the December 2012 Sigmod Record.

The Data Analytics group at QCRI has built expertise focused on three core data management challenges that will enable the effective use of this growing asset class: extraction from its natural digital habitat, integration from a large and evolving number of sources, and robust cleaning processes to assure data quality and validation.

The Data Trio: Extraction, Integration, and Cleaning:  Institutions and industries at a national level deal with large scale, heterogeneous data collected from large number of sources. The main challenge is a judicious use of the information within and across organizations to make informed decisions and to run operations effectively.

At QCRI, we are focusing on the interaction among three core data management challenges that will enable effective use of the continuously growing data: Information Extraction, Data and Schema Integration, and Data Cleaning.

Going beyond traditional ETL approaches, we are investigating multiple new directions, including: handling unstructured data; interleaving extraction, integration, and cleansing tasks in a more dynamic and interactive process that responds to evolving data sets and real-time decision-making constraints; and leveraging the power of human cycles to solve hard problems such as data cleaning and information integration.

Scalable Knowledge Models: Grand challenges mean big data. ‘Knowledge base’ is the term commonly used to refer to data, along with the rules and the logic that describe the information within this data. Large-scale knowledge management is a core-computing challenge due to the expensive process involved in reasoning about the data and inferring the facts and the various semantics embedded within. We focus on developing efficient knowledge representation models and semantic-aware query languages and processing engines that bring semantics to real applications. Main applications domains include media and health, where current approaches are either too expensive or fall short in delivering user needs.


For technical or informational questions, please send an email to 
QCRI Careers with the name of the group to whom you’re directing your question, e.g. ALT, CS&E, Cyber Security, Data Analytics, Distributed Systems or Social Computing, in the subject line.

Principal Scientist

genericImage.jpg

Dr. Mourad Ouzzani

To be part of something different than what I had been used to at Purdue University and contribute to the first computing research institution in the region.
Read more

Principal Scientist

Dr. Sanjay Chawla

QCRI provides an ideal environment to conduct high-impact research which can transcend disciplinary boundaries.
Read more
our-research/data-analytics
Open Source Release:
  • NADEEF.  A semi-automatic extensible data cleaning system.
Meet us at the following conferences:
Learn more about our us:
default

Follow Us

  • YouTube
  • Twitter
  • Facebook
  • RSS Feed
  • Linkedin
  • github-web.png
Back to Top

In the Media

World Economic Forum.png

Six Ways Social Media is Changing the World

11/04/2016

Around the world, billions of us use social media every day, and that number just keeps growing. In fact, it’s estimated that by 2018, 2.44 billion people will be using social networks, up from ...

Read More

New Scientist.JPG

AI helps answer thousands of health queries in Zambia via SMS

10/04/2016

For many people in Zambia with health queries, sending a text message is the best way to get it answered. U-report, a free SMS-based service set up by UNICEF and run by volunteers, receives many ...

Read More

Poynter.JPG

What Does the Future of Automated Fact-Checking Look Like?

07/04/2016

DURHAM, N.C. — There’s nothing new about trying to correct the record in real time. Even a couple of decades ago, campaign aides would walk through the press sections at debates, circulating freshly ...

Read More

Events

Past Events

2016

ARC2016icon.jpg

Qatar Foundation Annual Research Conference 2016 (ARC'16)

Download ICS File 22/03/2016  - 23/03/2016 ,

The Qatar Foundation Annual Research Conference 2016 (ARC’16) will be held on 22nd and 23rd March,2016 at the Qatar National Convention Center. The conference aims to advance Qatar’s ambitious ...

Read More

Coding is Cool Website Photo.png

Coding is Cool 2016

Download ICS File 21/03/2016 , QNCC Room 215-217

Organized by Qatar Computing Research Institute and MIT-Computer Science and Artificial Intelligence Laboratory

Read More

Rus QCRI WEB.JPG

Self-Driving Cars Are Coming - Public Talk by Daniela Rus

Download ICS File 20/03/2016 , QNCC Room 215-217

Abstract: We spend a lot of time in out cars, yet this is a part of our lives where we have been vulnerable to the world's leading cause of bodily harm. Now, the digitization of practically ...

Read More

News

Default Thumbnail

QCRI’s ‘guilt by association’ tool targets suspicious domains

18/04/2016

A group of scientists at the Hamad bin Khalifa University’s Qatar Computing Research Institute (QCRI) has invented a new tool to identify unknown malicious domains by using a real-life “...

Read More

viral social media.jpeg

QCRI research on viral social media events shows people ‘flock like birds’

07/04/2016

New research developed by QCRI has shown that people behave like schools of fish or flocks of birds when events go viral on social media. The research, led by Dr Javier Borge-Holthoefer, with ...

Read More

Default Thumbnail

Messi makes rare moves but Ronaldo ‘just another player’, QCRI research finds

07/04/2016

Lionel Messi is an extraordinary footballer whose on-field manoeuvres are rare, whereas Cristiano Ronaldo moves in similar ways to scores of other players, new QCRI research has found. The findings ...

Read More