Data Analytics

DA group photo

Read our feature in the December 2012 Sigmod Record.

The Data Analytics group at QCRI has built expertise focused on three core data management challenges that will enable the effective use of this growing asset class: extraction from its natural digital habitat, integration from a large and evolving number of sources, and robust cleaning processes to assure data quality and validation.

The Data Trio: Extraction, Integration, and Cleaning:  Institutions and industries at a national level deal with large scale, heterogeneous data collected from large number of sources. The main challenge is a judicious use of the information within and across organizations to make informed decisions and to run operations effectively.

At QCRI, we are focusing on the interaction among three core data management challenges that will enable effective use of the continuously growing data: Information Extraction, Data and Schema Integration, and Data Cleaning.

Going beyond traditional ETL approaches, we are investigating multiple new directions, including: handling unstructured data; interleaving extraction, integration, and cleansing tasks in a more dynamic and interactive process that responds to evolving data sets and real-time decision-making constraints; and leveraging the power of human cycles to solve hard problems such as data cleaning and information integration.

Scalable Knowledge Models: Grand challenges mean big data. ‘Knowledge base’ is the term commonly used to refer to data, along with the rules and the logic that describe the information within this data. Large-scale knowledge management is a core-computing challenge due to the expensive process involved in reasoning about the data and inferring the facts and the various semantics embedded within. We focus on developing efficient knowledge representation models and semantic-aware query languages and processing engines that bring semantics to real applications. Main applications domains include media and health, where current approaches are either too expensive or fall short in delivering user needs.


For technical or informational questions, please send an email to 
QCRI Careers with the name of the group to whom you’re directing your question, e.g. ALT, CS&E, Cyber Security, Data Analytics, Distributed Systems or Social Computing, in the subject line.

Research Director

 

Dr. Divyakant (Divy) Agrawal

QCRI provides a unique opportunity to be a part of building a high-impact research center in a geographical area that can have long-term effects in terms of transforming Qatar and the region as a whole. It’s this long-term vision and related investments from Qatari leadership that I find so exciting and believe will forever change the future of the people of the Middle East for the better.
Read more

Principal Scientist

Dr.Moahammed_Zaki

Dr. Mohammed J. Zaki

QCRI is at an exciting stage of growth with world-class researchers. I am excited about the opportunity to establish QCRI at the forefront of data mining research.
Read more

Principal Scientist

prasenjit 

Dr. Prasenjit Mitra

Storing, managing, retrieving, and mining Big Data is one of the most difficult computing challenges of our times. Along with my colleagues in the Data Analytics Group at QCRI, I am interested in enabling end-users to utilize large datasets to the fullest by designing infrastructure and algorithms, and applying data and text mining techniques.
Read more

Principal Scientist

Dr. Sanjay Chawla

QCRI provides an ideal environment to conduct high-impact research which can transcend disciplinary boundaries.
Read more
our-research/data-analytics
Open Source Release:
  • NADEEF.  A semi-automatic extensible data cleaning system.
Meet us at the following conferences:
Learn more about our us:
default

Follow Us

  • YouTube
  • Twitter
  • Facebook
  • RSS Feed
  • Linkedin
  • github-web.png
Back to Top

In the News

BBC Black.jpg

#newsHACK III: The Winners

21/12/2014

Hello, I'm Basile, hacker-journalist with BBC News Labs over in Euston, and I'm just recovering from the third edition of NewsHACK , which took place on December 15 and 16 in London. 50 participants ...

Read More

Data Science Central.png

Why Media Bias Has Nowhere to Run and Hide from Data Science

21/12/2014

When you want to see the face of biased reporting in online news, you may not have to go further than, the satirical news site, The Onion. Titles such as “Media Reports of Bear Attacks May Be Biased”...

Read More

Tech Crunch Logo.jpg

Real-Time Disaster Relief

16/12/2014

The Philippines last week topped international headlines as a typhoon ripped through the island nation, claiming dozens of lives and leaving a swath of destruction. The story has a ring of ...

Read More

Upcoming Events

2015

index.jpg

4th International Conference and Exhibition on Metabolomics & Systems Biology

Download ICS File 27/04/2015 - 29/04/2015,

Metabolomics is an emerging field which combines strategies to identify and quantify cellular metabolites using sophisticated analytical technologies with the application of statistical and ...

Read More

www2015 use.jpg

24th International World Wide Web Conference

Download ICS File 18/05/2015 - 22/05/2015, Florence, Italy

The annual World Wide Web Conference is the premier international forum to present and discuss progress in research, development, standards, and applications of the topics related to the Web. WWW ...

Read More

www2015 use.jpg

The 3rd International Workshop on Social Web for Disaster Management (SWDM'15)

Download ICS File 18/05/2015, Florence, Italy

Co-located with the WWW'15 conference (May 2015, Florence, Italy). Proceedings published by ACM. QCRI's Dr Carlos Castillo is co-organizer.

Read More

Press Releases

QF Logo Website.png

Families Visit Qatar Foundation's Tent at Darb El-Saai National Day Celebrations

11/12/2014

Fun Filled Activities Designed To Raise Awareness Of Local Culture And The Foundation’s Many Initiatives

Read More

ARC Logo.jpg

Qatar Foundation's Annual Research Conference'14 Leads Discussions on Cyber Security and Computing for Social Good to Address Nation's Grand Challenges

09/11/2014

Doha, Qatar, 8 November 2014: Qatar Foundation’s Annual Research Conference (ARC’14), has announced the ARC’14 agenda to address Qatar’s Cyber Security Grand Challenge, focusing on computing ...

Read More

QCRI QMIC Photo.jpg

Greater Road Safety Subject of Collaborative Research by Qatar Computing Research Institute and Qatar Mobility Innovations Center

29/10/2014

Results From Joint Research Efforts To Bring Benefits To Local Residents Doha, 28 October 2014:  Qatar Computing Research Institute (QCRI) and the Qatar Mobility Innovations Center (QMIC) have signed...

Read More