Our Research >‏ Data Analytics >‏ Projects and Demos

Projects and Demos

Visit the Data Analytics site for more information about the team's work.  Some of our more recent projects and demos include:
  • NADEEF (which means ''clean'' in Arabic) is an extensible and generalized data cleaning system.  Released as open source, NADEEF allows users to implement their own data repairing algorithms to replace default NADEEF data repair implementation. 
  • Rayyan: The Systematic Reviews Web App.  Rayyan aims to build tools to support the process of creating, analyzing, and maintaining systematic reviews, in terms of data extraction, cleaning, integration, and mining of published clinical trials and journal articles. A production system is available here and a demo here.
  • KATARA aims to perform trusted data cleaning by using reliable knowledge bases augmented with crowd sourcing for validation.
  • Analytics on Data Anomalies. Oftentimes users face errors in the results of a query. We introduce DBRx, a system for discovering concise explanations of data anomalies. 
  • Web Data Integration. Web data is a great opportunity, but using it in analytics requires new solution to overcome the varierty and volatily. In this project we exploit web data for data integration tasks.
  • The World Bank's Auto Geotagger is a prototype which automatically identifies locations in documents from the World Bank Projects Data API using the Stanford Name Entity Recognizer (NER) and Alchemy, geocodes them with the Google Geocoder, Yahoo! Placefinder , and Geonames and visualizes them on a map.
  • Tamr, a start-up founded in 2013, is based on technology developed at QCRI that allows for scalable data curation and integration by deduplicating the resulting dataset composite.  Link to the paper here.

Principal Scientist

genericImage.jpg

Dr. Mourad Ouzzani

To be part of something different than what I had been used to at Purdue University and contribute to the first computing research institution in the region.
Read more

Principal Scientist

prasenjit 

Dr. Prasenjit Mitra

Storing, managing, retrieving, and mining Big Data is one of the most difficult computing challenges of our times. Along with my colleagues in the Data Analytics Group at QCRI, I am interested in enabling end-users to utilize large datasets to the fullest by designing infrastructure and algorithms, and applying data and text mining techniques.
Read more

Principal Scientist

Dr. Sanjay Chawla

QCRI provides an ideal environment to conduct high-impact research which can transcend disciplinary boundaries.
Read more

Follow Us

  • YouTube
  • Twitter
  • Facebook
  • RSS Feed
  • Linkedin
  • github-web.png
Back to Top

In the Media

computerworld.jpg

Agora Dark Web Market Discovers Suspicious Activity on Servers, Pauses Operations

31/08/2015

The Agora Dark Web market cited Tor Hidden Services security vulnerabilities that could allow de-anonymization attacks and temporarily shut down operations after detecting suspicious activity on its ...

Read More

newscientist_logo_rect.jpg

AI Football Manager Knows How Different Teams Play the Game

18/08/2015

Atletico Madrid used few predictable passing patterns in the 2013/14 season – and won the league that year (Image: ADRIAN DENNIS/AFP/Getty) Who really calls the shots in team sports? The players? The...

Read More

MIT logo.jpg

Shoring up Tor

29/07/2015

Researchers mount successful attacks against popular anonymity network — and show how to prevent them.

Read More

Upcoming Events

2015

vldb2015.jpg

41st International Conference on Very Large Data Bases

Download ICS File 31/08/2015  - 04/09/2015 ,

VLDB is a premier annual international forum for data management and database researchers, vendors, practitioners, application developers, and users. The conference will feature research talks, ...

Read More

GCC Summit.JPG

GCC Government Social Media Summit

Download ICS File 16/09/2015  - 17/09/2015 , Sofitel, Downtown - Dubai

The use of social media by governments has steadily grown over the recent years in the GCC region. With some of the highest social media platform penetration rates in the world the MENA Governments ...

Read More

emnlp2015.jpg

EMNLP 2015: Conference on Empirical Methods in Natural Language Processing

Download ICS File 17/09/2015  - 21/09/2015 , Lisboa, Portugal

The premier conference on natural language processing, organized under SIGDAT , the Association for Computational Linguistics special interest group on linguistic data and corpus-based approaches to ...

Read More

News Releases

1 Lunch and Learn.jpg

Qatar Computing Research Institute Welcomes New Batch of Students to Summer Internship Programme

02/06/2015

Hands-On Programme Offers Undergraduate Students An Opportunity To Conduct Research And Gain Real-World Experience Doha, Qatar, 02 June 2015 - Enjoying its fourth consecutive year of success, the ...

Read More

Farnam Jahanian - Copy.JPG

Dr Farnam Jahanian joins Qatar Computing Research Institute's Scientific Advisory Committee

27/05/2015

The Provost of Carnegie Mellon University Brings A Wealth Of Knowledge And Expertise To Qatar Foundation-Based Research Institute

Read More

Ji with MicroMappers 1.jpg

Qatar Computing Research Institute Boosts Nepal Earthquake Relief Efforts with High-Tech Digital Initiatives

25/05/2015

Doha, Qatar, May 25 2015: Emergency responders are still dealing with the fallout of Nepal’s two devastating earthquakes and using advanced technology and platforms, some of which have been developed...

Read More