Contributor Location
Question: What is the location of contributors?
Overview
Contributor Location measures the geographic locations which contributors contribute, where they live, or where they work. Tracking contributor location provides insights into the project's global reach, work practices, and time zone considerations. Also, to identify where contributions do not come from in an effort to improve engagement in these areas. A project with contributors from diverse geographic locations may indicate a more inclusive and equitable environment.
Want to Know More?
Data Collection Strategies
Different approaches can be used to collect information about location:
- Collect the location information from a contributor’s profile in the system of engagement.
- Use IP address geolocation of the most frequent locations that contributions are made.
- Infer geographical location from the timestamp in contributions.
- Survey contributors.
The key challenge for collecting data is determining the location of the contributor. Best practice would be to leverage any profile information available from the system of engagement, and if that is not available then use IP geolocation to determine the most frequent location of contribution from that individual. Note that contributors may enter in their profile information false or nonsensical location information (e.g., “Earth” or “Internet”). Note that IP geolocation can provide large numbers of false positives due to use of VPNs or other IP masking tools.
An additional consideration would be the use of external data collection tools such as community surveys or event registration data that could cross reference systems of engagement profiles. Contributor location data could be collected inline with event attendee demographics and speaker demographics.
Filters
Filter contributions by:
- Location. Attempt to group locations in regions to have multiple levels of reporting. Location is a purposely ambiguous term in this context, and could refer to region, country, state, locale, or time zone.
- Period of time. Start and finish date of the period. Default: forever. Period during which contributions are counted.
- Type of contributor, for example:
- Repository authors
- Issue authors
- Code review participants
- Mailing list authors
- Event participants
- IRC authors
- Blog authors
- By release cycle
- Programming languages of the project
- Role or function in project
Visualizations
References
- Gonzalez-Barahona, J. M., Robles, G., Andradas-Izquierdo, R., & Ghosh, R. A. (2008). Geographic origin of libre software developers. Information Economics and Policy, 20(4), 356-363.
Contributors
- Matt Germonprez
- Kevin Lumbard
- Peculiar C. Umeh
Additional Information
To edit this metric please submit a Change Request here
To reference this metric in software or publications please use this stable URL: https://chaoss.community/?p=3468
The usage and dissemination of health metrics may lead to privacy violations. Organizations may be exposed to risks. These risks may flow from compliance with the GDPR in the EU, with state law in the US, or with other laws. There may also be contractual risks flowing from terms of service for data providers such as GitHub and GitLab. The usage of metrics must be examined for risk and potential data ethics problems. Please see CHAOSS Data Ethics document for additional guidance.