Data Analytics

DA group photo

Read our feature in the December 2012 Sigmod Record.

The Data Analytics group at QCRI has built expertise focused on three core data management challenges that will enable the effective use of this growing asset class: extraction from its natural digital habitat, integration from a large and evolving number of sources, and robust cleaning processes to assure data quality and validation.

The Data Trio: Extraction, Integration, and Cleaning:  Institutions and industries at a national level deal with large scale, heterogeneous data collected from large number of sources. The main challenge is a judicious use of the information within and across organizations to make informed decisions and to run operations effectively.

At QCRI, we are focusing on the interaction among three core data management challenges that will enable effective use of the continuously growing data: Information Extraction, Data and Schema Integration, and Data Cleaning.

Going beyond traditional ETL approaches, we are investigating multiple new directions, including: handling unstructured data; interleaving extraction, integration, and cleansing tasks in a more dynamic and interactive process that responds to evolving data sets and real-time decision-making constraints; and leveraging the power of human cycles to solve hard problems such as data cleaning and information integration.

Scalable Knowledge Models: Grand challenges mean big data. ‘Knowledge base’ is the term commonly used to refer to data, along with the rules and the logic that describe the information within this data. Large-scale knowledge management is a core-computing challenge due to the expensive process involved in reasoning about the data and inferring the facts and the various semantics embedded within. We focus on developing efficient knowledge representation models and semantic-aware query languages and processing engines that bring semantics to real applications. Main applications domains include media and health, where current approaches are either too expensive or fall short in delivering user needs.


For technical or informational questions, please send an email to 
QCRI Careers with the name of the group to whom you’re directing your question, e.g. ALT, CS&E, Cyber Security, Data Analytics, Distributed Systems or Social Computing, in the subject line.

Research Director

 

Dr. Divy Agrawal

Read more

Principal Scientist

Dr.Moahammed_Zaki

Dr. Mohammed J. Zaki

QCRI is at an exciting stage of growth with world-class researchers. I am excited about the opportunity to establish QCRI at the forefront of data mining research.
Read more

Principal Scientist

 

Dr. Prasenjit Mitra

Read more
our-research/data-analytics
Open Source Release:
  • NADEEF.  A semi-automatic extensible data cleaning system.
Meet us at the following conferences:
Learn more about our us:
default

Follow Us

  • YouTube
  • Twitter
  • Facebook
  • RSS Feed
  • Linkedin
  • github-web.png
Back to Top

In the News

bq doha.jpg

Strife in the Arab world driving humanitarian assistance

21/07/2014

Middle East countries are set to redefine the international humanitarian aid sector, and it isn't just because their contributions to global funds are increasing.

Read More

logo_peninsula_qatar_100x235_hb.jpg

QU signs deal for PhD research programme

13/07/2014

Doha: Qatar University (QU) and Qatar Foundation Research & Development (QF R&D) yesterday signed a cooperation agreement to establish an interdisciplinary PhD programme in all fields of research.  ...

Read More

logo_peninsula_qatar_100x235_hb.jpg

Boeing, QCRI to develop new product

08/07/2014

QCRI and Boeing working on project related to health maintenance services for aircraft.

Read More

Upcoming Events

2014

EMNLP website.jpg

EMNLP 2014: Conference on Empirical Methods in Natural Language Processing

Download ICS File 25/10/2014 - 29/10/2014, Renaissance Hotel, Doha, Qatar

QCRI is the local host for EMNLP 2014. EMNLP is organized by SIGDAT, the Association for Computational Linguistics special interest group for linguistic data and corpus-based approaches to natural language processing.

Read More

Default Thumbnail

11th ACS/IEEE International Conference on Computer Systems and Applications (AICCSA 2014)

Download ICS File 10/11/2014 - 13/11/2014, Doha, Qatar

Annual ACS/IEEE Conference will be held in Doha, Qatar. Organized by Qatar University.

Read More

Past Events

2014

QITCOM.jpg

QITCOM Conference 2014: Innovating Today for the Future of Qatar

Download ICS File 26/05/2014 - 28/05/2014, Qatar National Convention Centre

The QITCOM 2014 Conference will be held on 26th, 27th and 28th of May, 2014, at the Qatar National Convention Centre.  The conference epitomizes QITCOM's aim of tackling local and regional concerns ...

Read More

Press Releases

Default Thumbnail

To Make Your Mark in Computer Science Visit the QCRI Summer Internship Programme Open House

03/04/2014

Qatar Computing Research Institute to host student information session showcasing unique summer internship opportunities

Read More

Tim Berners Lee still.jpg

QCRI Organises Talk by the Inventor of the World Wide Web

17/03/2014

Doha, Qatar, 17 March 2014:  Just a quarter of a century since the birth of the World Wide Web, Qatar Computing Research Institute (QCRI) has invited its inventor, Sir Tim Berners-Lee, to deliver a ...

Read More

MLDAS poster

World’s Top Machine Learning and Data Analytics Experts Come to Qatar

02/03/2014

Joint Boeing and QCRI research symposium to highlight new approaches to valuable data extraction

Read More