sos-coauth-Geology dataset
This dataset is a collection of sequences of sets, where each sequence
is the time-ordered sets of coauthors of a researcher's
publications. Publication data comes from the Microsoft Academic
Graph, where the paper is labeled with the "Geology" subject. All
sequences contain at least 10 sets, and only sets of size at most 5
are considered. Some basic statistics of this dataset are:
- number of sequences: 57,294
- number of unique elements appearing in sets: 525,348
- number of sets: 1,438,652
- number of unique sets: 1,090,485
- Sequences of sets.
Austin R. Benson, Ravi Kumar, and Andrew Tomkins.
Proceedings of KDD, 2018. [bibtex] - Simplicial closure and higher-order link prediction.
Austin R. Benson, Rediet Abebe, Michael T. Schaub, Ali Jadbabaie, and Jon Kleinberg.
Proceedings of the National Academy of Sciences (PNAS), 2018. [bibtex] - An overview of Microsoft Academic Service (MAS) and applications.
Arnab Sinha, Zhihong Shen, Yang Song, Hao Ma, Darrin Eide, Bo-June Hsu, and Kuansan Wang.
Proceedings of WWW, 2015. [bibtex]