LAGOON: An Analysis Tool for Open Source Communities

At the Mining Software Repositories (MSR2022) conference in May, we presented our LAGOON tool resulting from the DARPA SocialCyber AIE, and led a discussion session on reducing complexity of machine learning. LAGOON provides a comprehensive platform for analyzing and investigating open-source software (OSS) communities for potentially malicious contributors. This is accomplished by ingesting multiple types of artifacts produced by OSS communities and fusing entities across these disparate data sources, resulting in a sociotechnical, spatiotemporal graph database which can be analyzed with machine learning. LAGOON is a reusable, open-source tool available on GitHub at  In the short video below, we give an overview of LAGOON and some of its features, which include a UI facilitating data analysis and exploration. For more information about the LAGOON platform, check out this paper.