The network Geom.net
is based on the file geombib.bib
that contains Computational Geometry Database, version February 2002.
The authors collaboration network in computational geometry was produced from the BibTeX bibliography [Beebe, 2002] obtained from the Computational Geometry Database geombib
, version February 2002 [Jones, 2002].
Two authors are linked with an edge, iff they wrote a common work (paper, book, ...). The value of an edge is the number of common works. Using a simple program written in programming language Python, the BibTeX data were transformed into the corresponding network, and output to the file in Pajek format.
The obtained network has 9072 vertices (authors) and 22577 edges (common papers or books) / 13567 edges as a simple network - multiple edges between a pair of authors are replaced with a single edge.
The problem with the obtained network is that, because of non standardized writing of the author's name, it contains several vertices corresponding to the same author. For example:
R.S. Drysdale, Robert L. Drysdale, Robert L. Scot Drysdale, R.L. Drysdale, S. Drysdale, R. Drysdale, and R.L.S. Drysdale;or:
Pankaj K. Agarwal, P. Agarwal, Pankaj Agarwal, and P.K. Agarwalthat are easy to guess; but an 'insider' information is needed to know that Otfried Schwarzkopf and Otfried Cheong are the same person. Also, no provision is made in the database to discern two persons with the same name. We manually produced the name equivalence partition and then shrank (in Pajek) the network according to it.
The reduced simple network contains 7343 vertices and 11898 edges. It is a sparse network - its average degree is 2m/n = 3.24.
Geom.bib
transformed in Pajek format and 'cleaned' by V. Batagelj and M. Zaveršnik.起点 | 终点 | 数值 |
---|
社团 | 大小 | 节点 |
---|
N and E are the number of nodes and links. 〈k〉 and 〈d〉 are the average degree and the average distance, respectively. C and r are the average clustering coefficient and the assortative coefficient. H is the degree heterogeneity. βc is the epidemic threshold of the SIR model.
N | 7343 |
---|---|
E | 11898 |
<k> | 3.2406 |
<d> | 0.026 |
<C> | 0.0872 |
r | 0.1802 |
H | 4.7062 |
beta_c | 0.0702 |
社团个数:
模块度 (Q)
运行时间(秒)
AUC:
准确率
召回率
F值
准确率