2i2c-org · choldgraf · May 22, 2025 · Jun 2, 2025 · yuvipanda · May 22, 2025
diff --git a/content/blog/2025/good-citizen/index.md b/content/blog/2025/good-citizen/index.md
@@ -0,0 +1,117 @@
+---
+title: "The difference between upstream-led vs. stakeholder-led open source contributions"
+date: 2025-05-21
+authors:
+  - Yuvi Panda
+  - Chris Holdgraf
+tags:
+  - open source
+categories:
+  - impact
+---
+
+Over the past year, the 2i2c team has been thinking more deeply about our relationship with our _upstream_ communities. Here are a few questions we've been grappling with:
+
+- How do we tie general upstream maintenance to value delivered to our user communities?
+- How can we scope upstream support so that it doesn't detract from our service needs and product strategy?
+- How can we encourage team members to work on the most impactful aspects of upstream support?
+- How can we intentionally and equitably support open source communities as a team, rather than a collection of individuals?
+
+Along the way, we realized there are **two very different kinds of upstream contributions**:
+
+1. **Stakeholder-led contributions**: A contribution driven by the needs of a stakeholder (e.g., a 2i2c team member implementing a feature for our member communities).
+2. **Upstream-led contributions**: A contribution driven by the needs of the upstream community (e.g., a 2i2c team member reviewing a newcomer's pull request in an upstream repository).
+
+And there's a special case of each type:
+
+3. **Individual-led contributions**: A contribution driven by the needs of an individual person (e.g., a person scratching an itch by contributing to an upstream project).
+
+Historically we have conflated these types of contributions, but we think it's key that we treat them differently. Here's our current thinking on the difference between `stakeholder-led` and `upstream-led` contributions in upstream communities.
+
+## What is a stakeholder in an open source project?
+
+Open source teams[^inclusive] are usually two kinds of teams that overlap heavily:
+
+[^inclusive]: By "open source" we are focusing on multi-stakeholder open source projects with participatory and inclusive leadership and contributions. This wouldn't apply to an organization- or person-specific open source project.
+
+1. A collection of _stakeholders working together_ on the open source project, each with their own goals and interests.
+2. An _open source team_ with a shared goal and strategy for the open source project.
+
+In this case, stakeholders can be individuals or companies. They use and contribute to the open source project because it advances their own interests. For example, an enthusiast contributing to a project because it brings them joy, or a company contributing to a project because they build a product that depends on the open source technology.
+
+However, for open source projects to be successful they also need their own unique identity, goals, strategy for impact, and system of work. This allows a diverse collection of stakeholders to _work together effectively_ and _create impactful technology_. This team is made up of the same stakeholders described above, but with a responsibility to lead and support the open source team, rather than just serve their individual interests as stakeholders.
+
+Thus, any open source stakeholder has two hats: they are both _representatives of a stakeholder_ and _members of an open source team_. While it's possible to _align the interests_ of these two groups, we think it's still important to _distinguish between them_.
+
+## What is a stakeholder-led open source contribution?
+
+A stakeholder-led contribution is _primarily_ driven by the needs of a stakeholder in an open source project. To use 2i2c as an example, let's take a quote from 2i2c's value proposition:
+
+> 2i2c serves a global network of community hubs for interactive learning and discovery
+
+*Community* here does **not** refer to open source upstream software provider communities (like JupyterHub or Kubernetes), but instead to downstream _user communities_ (like [CryoCloud](https://cryointhecloud.com/) or [Openscapes](https://openscapes.org)).
+
+When 2i2c makes a stakeholder-led contribution, it usually means we are trying to _deliver value to one or more of our member communities_ by making an upstream contribution.
+
+Satisfying community needs often involves directly working on the software they use. Driven by our [right to replicate](https://2i2c.org/right-to-replicate/) principles, this means we mostly work on software that is not proprietary to 2i2c nor solely owned by us permanently - but by contributing to an upstream software community. These are all stakeholder-led contributions.
+
+Some illustrative examples:
+
+1. [Allow login to be gated on OAuth2 granted scopes](https://github.com/jupyterhub/oauthenticator/pull/719) was a feature we added to support one of our communities' auth flow (EarthScope)
+2. [Changing how `.pyc` files are kept in images](https://github.com/pangeo-data/pangeo-docker-images/pull/426) was work we did as a result of a support ticket investigating spawn timeout issues in the [LEAP](https://leap.columbia.edu/) hub.
+3. [Adding landing pages functionality to Jupyter Book and MyST](https://github.com/jupyter-book/myst-theme/pull/531) was work we did to support member communities like [CryoCloud](https://cryointhecloud.org) and [Project Pythia](https://cookbooks.projectpythia.org/).
+
+The fact that these are open source contributions is *incidental*. We are making open source contributions, but we are _primarily_ doing this work to deliver value to _our community network_. 
+
+### How 2i2c wants to plan team work around stakeholder-led contributions
+
+Stakeholder-led contributions naturally align with 2i2c's overall goals and strategy, so we can re-use our pre-existing product processes for planning and delivering on them. However, we also want to provide transparency to upstream communities so that they understand who is driving the contributions that we're making.
+
+With that in mind, here are a few ways that stakeholder-led contributions relate to our practices:
+
+- Stakeholder-led contributions should be defined by our product roadmap and prioritization processes.
+- We should allocate engineering time to making these upstream contributions as part of our product lifecycle, and consider the extra coordination and communication work needed to work at the pace of the upstream community.
+- We should cross-link 2i2c product initiatives to upstream issues and pull-requests wherever we can to provide transparency about why we're making a contribution.
+
+## What is an upstream-led open source contribution?
+
+However, contributions can't always be driven by a stakeholder's needs or the open source team will not have an identity or support structure of its own. Here's another excerpt from our value proposition:
+
+> We need infrastructure services that are driven by community needs and values, that follow the same open source science practices we wish to see in others, and that believe in the power of shared community resources and knowledge.
+
+Being a "healthy upstream citizen" is core to 2i2c's mission, and is also a way to help communities we rely on remain healthy. This means that we need to allow some of our contributions be _upstream-led_ rather than _stakeholder-led_. This means doing things that keep the overall ecosystem healthy even if it does not *directly* address a specific member community need. The *presence* of a healthy open source ecosystem is a value to our member communities in-and-of itself.
+
+Defining "upstream-led" needs is often difficult, because open source teams tend to have less structure and formally-stated goals and needs than most organizations. In 2i2c's case, we focus our upstream-led contributions around *maintaining the health of the open source ecosystem*. It includes things like:
+
+1. Help making releases
+2. Provide code review
+3. Help onboard new contributors to the project
+4. Fix broken CI
+5. Write documentation and tutorials
+6. Manage and run meetings
+7. Align open source teams on goals and strategy
+
+However, the real point is that these actions need to be driven by _the upstream project's goals and needs_, not by 2i2c's needs.
+
+Here are a few common examples of contributions that are _not considered_ upstream-led for our team:
+
+1. Opening a PR to add a major feature to an upstream project.
+2. Creating a brand new project in an open source organization in order to scratch your own itch.
+3. Engaging in reactive open-source work that isn't driven by a clear strategy or goal (e.g., randomly responding to the last few GitHub issue comments you happened to notice)
+
+### How 2i2c plans work around upstream-led contributions
+
+Upstream-led contributions are important to 2i2c both for strategic and tactical reasons. However, when left as unstructured time (as we have historically), it runs into all the problems of unstructured work - it happens in non-strategic ways, it isn't evenly balanced across team members, it is more or less accessible depending on your personal comfort level and skills, etc.
+
+With that in mind, here are a few ways that upstream-led contributions relate to our practices:
+
+- We need to _own upstream-led contributions as a team_, rather than asking individuals to identify and do this work on their own.
+- We need to define team _goals_ and _strategy_ to define the impact we want to have, and what kind of work leads to that impact.
+- We need a team system for identifying and prioritizing the most impactful upstream-led contributions to perform.
+- This system must spread the responsibility of upstream-led contributions across our whole product team.
+- It means we need to give people support and training to do this effectively. For example, helping team members grow into roles that involve upstream work, rotating certain types of contributions across team members, etc.
+- If a team member does significant upstream contribution work _outside_ of this system, it'll be on "their own" time rather than on 2i2c time.
+
+## What's next
+
+This post laid out the big ideas that are driving 2i2c's approach to upstream contributions. We'll share some more specific experiments and team policies that we've designed to help us do our work impactfully and efficiently. More to come!