SMS-Based FAQ Retrieval

FIRE 2012 Task


Home
Dataset
Important Dates
People
Submission
Attendance
Resources
Contact
Results
Joint Task Coordinators
COER
and
IBM Research

Dataset

The FAQs have been collected from online sites - both government and private. The FAQs are in three languages:

  • English
  • Hindi
  • Malayalam

The FAQs are from several domains, including:

  • Railway Enquiry
  • Telecom
  • Health
  • Banking

The SMSes have been generated by asking college students to write down their information need using a mobile phone.

To participate register by sending an email to firesmstask@gmail.com and download the dataset and open it using the password that will be provided to you upon registering.

The following datasets have been released till date:

Training Dataset: Release Date: Aug 4, 2012

Test Dataset: Release Date: Oct, 7 2012

Test Dataset with Matches: Release Date: Nov, 24 2012