TY - GEN
T1 - Massive Multi-agent Data-Driven Simulations of the GitHub Ecosystem
AU - Blythe, Jim
AU - Bollenbacher, John
AU - Huang, Di
AU - Hui, Pik Mai
AU - Krohn, Rachel
AU - Pacheco, Diogo
AU - Muric, Goran
AU - Sapienza, Anna
AU - Tregubov, Alexey
AU - Ahn, Yong Yeol
AU - Flammini, Alessandro
AU - Lerman, Kristina
AU - Menczer, Filippo
AU - Weninger, Tim
AU - Ferrara, Emilio
N1 - Publisher Copyright:
© 2019, Springer Nature Switzerland AG.
PY - 2019
Y1 - 2019
N2 - Simulating and predicting planetary-scale techno-social systems poses heavy computational and modeling challenges. The DARPA SocialSim program set the challenge to model the evolution of GitHub, a large collaborative software-development ecosystem, using massive multi-agent simulations. We describe our best performing models and our agent-based simulation framework, which we are currently extending to allow simulating other planetary-scale techno-social systems. The challenge problem measured participant’s ability, given 30 months of meta-data on user activity on GitHub, to predict the next months’ activity as measured by a broad range of metrics applied to ground truth, using agent-based simulation. The challenge required scaling to a simulation of roughly 3 million agents producing a combined 30 million actions, acting on 6 million repositories with commodity hardware. It was also important to use the data optimally to predict the agent’s next moves. We describe the agent framework and the data analysis employed by one of the winning teams in the challenge. Six different agent models were tested based on a variety of machine learning and statistical methods. While no single method proved the most accurate on every metric, the broadly most successful sampled from a stationary probability distribution of actions and repositories for each agent. Two reasons for the success of these agents were their use of a distinct characterization of each agent, and that GitHub users change their behavior relatively slowly.
AB - Simulating and predicting planetary-scale techno-social systems poses heavy computational and modeling challenges. The DARPA SocialSim program set the challenge to model the evolution of GitHub, a large collaborative software-development ecosystem, using massive multi-agent simulations. We describe our best performing models and our agent-based simulation framework, which we are currently extending to allow simulating other planetary-scale techno-social systems. The challenge problem measured participant’s ability, given 30 months of meta-data on user activity on GitHub, to predict the next months’ activity as measured by a broad range of metrics applied to ground truth, using agent-based simulation. The challenge required scaling to a simulation of roughly 3 million agents producing a combined 30 million actions, acting on 6 million repositories with commodity hardware. It was also important to use the data optimally to predict the agent’s next moves. We describe the agent framework and the data analysis employed by one of the winning teams in the challenge. Six different agent models were tested based on a variety of machine learning and statistical methods. While no single method proved the most accurate on every metric, the broadly most successful sampled from a stationary probability distribution of actions and repositories for each agent. Two reasons for the success of these agents were their use of a distinct characterization of each agent, and that GitHub users change their behavior relatively slowly.
KW - Collaborative platforms
KW - GitHub
KW - Massive scale simulations
UR - http://www.scopus.com/inward/record.url?scp=85068685792&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-24209-1_1
DO - 10.1007/978-3-030-24209-1_1
M3 - Conference contribution
AN - SCOPUS:85068685792
SN - 9783030242084
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 3
EP - 15
BT - Advances in Practical Applications of Survivable Agents and Multi-Agent Systems
A2 - Demazeau, Yves
A2 - Matson, Eric
A2 - Corchado, Juan Manuel
A2 - De la Prieta, Fernando
PB - Springer Verlag
T2 - 17th International Conference on Practical Applications of Agents and Multi-Agent Systems, PAAMS 2019
Y2 - 26 June 2019 through 28 June 2019
ER -