Small dataset for testing:
A very tiny dataset, constructed for testing the code and getting results in a few seconds.
Reference
G. Tibély et al: Extracting tag hierarchies PLoS ONE 8(12): e84133 (2013).
Files:
File name Description Format Size
List_of_test_objects.txt List of objects with tags for testing, where each row corresponds to an object with tags. For simplicity each object and tag is represented by a number. plain text file
rows starting with "#" are skipped
columns are separated by spaces
1st. column: object
rest of the columns: tags.
777b
List_of_test_objects_tag_hierarchy_A.txt Result for the dataset by Algorithm A Directed graph, where
1st. column: source
2nd column: target
16b
List_of_test_objects_tag_hierarchy_B.txt Result for the dataset by Algorithm B Directed graph, where
1st. column: source
2nd column: target
16b
Each file header contains instructions for processing the data with the Hierarchy Extracting Algorithms.
Lines end with unix end-of-line-character, under windows use an intelligent text viewer like WordPad.

Contact
hiertags@hal.elte.hu