The dataset contains the following four directories:
	1. Train_docs
	2. Train_catches
	3. Test_docs
	4. Test_catches

The first folder ("Train_docs") contains 100 case statements.

The second folder ("Train_catches")  contains the gold standard catchwords for each of these 100 train case statements.

The third folder ("Test_docs") contains 300 test case statements. For each of these 300 statements, one should produce a set of catchphrases.

The fourth folder ("Test_catches") contains the gold standard catchwords for each of these 300 test case statements.
