-
I am a postdoctoral fellow at the Laboratory for Innovation Science at Harvard. I am an applied microeconomist with research interests at the intersection of digital economics, labor and productivity, industrial organization, and socio-technical networks. Specifically, my work has centered around the private provision of public goods, productivity in open collaboration, and welfare effects within the context of open source software (OSS) ecosystems.
-
Working Papers
Generative AI and the Nature of Work (with Manuel Hoffmann, Frank Nagle, Sida Peng, and Kevin Xu)
[SSRN] [Harvard Business School Working Paper 25-021][abstract]
Recent advances in artificial intelligence (AI) technology demonstrate considerable potential to complement human capital intensive activities. While an emerging literature documents wide-ranging productivity effects of AI, relatively little attention has been paid to how AI might change the nature of work itself. How do individuals, especially those in the knowledge economy, adjust how they work when they start using AI? Using the setting of open source software, we study individual level effects that AI has on task allocation. We exploit a natural experiment arising from the deployment of GitHub Copilot, a generative AI code completion tool for software developers. Leveraging millions of work activities over a two year period, we use a program eligibility threshold to investigate the impact of AI technology on the task allocation of software developers within a quasi-experimental regression discontinuity design. We find that having access to Copilot induces such individuals to shift task allocation towards their core work of coding activities and away from non-core project management activities. We identify two underlying mechanisms driving this shift - an increase in autonomous rather than collaborative work, and an increase in exploration activities rather than exploitation. The main effects are greater for individuals with relatively lower ability. Overall, our estimates point towards a large potential for AI to transform work processes and to potentially flatten organizational hierarchies in the knowledge economy.
Labor Competition and Open Innovation (with Manuel Hoffmann and Frank Nagle)[abstract]
Innovation, a key driver of competitive advantage, increasingly occurs through informal partnerships between firms collaborating on public goods. However, competitive forces may influence contribution patterns. Existing studies primarily consider the role competition plays in closed and private innovation. Therefore, we utilize data on millions of firm contributions to open source software, a crowdsourced public good, to measure the relationship between the level of competition a firm faces and its participation in informal innovation partnerships. We further create a novel measure of the labor competition a firm faces using Lightcast data on job postings. While the relationship between labor and product market competition is nuanced and warrants further investigation, labor competition is particularly important in innovative and technology-focused contexts. We find that labor market competition and open source contributions exhibit an inverted U-relationship. The current findings hint at open innovation being a potentially strategic choice in hiring decisions during times of low labor market power of firms.
Quid Pro Code: Peer Effects and Productivity in Open Source Software
[working paper] [slides][abstract]
We empirically examine the extent to which peer effects influence the private provision of public goods. In the case of public information goods, peer contribution may facilitate or otherwise incentivize further contribution from others, effectively subsidizing private provision. Using the setting of Open Source Software (OSS) contribution, we first utilize a reduced form approach to derive causal estimates of net peer effects in public goods contribution by exploiting a peers-of-peers identification strategy. We next develop a structural model of peer-influenced public good provision that both (1) separates extensive and intensive margin contribution decisions and (2) decomposes contribution into marginal private benefits and costs. We apply these methodologies using a sample of peer contribution histories for 2,287 OSS projects hosted on the GitHub collaboration platform. Both reduced form and structural approaches suggest peer effects are much stronger along the extensive margin than the intensive margin. Contemporaneous intensive margin effects, while heterogenous across time and projects, are small and centered around zero, suggesting that strategic complementarity and substitution in peer contribution likely offset one another. Our counterfactual analysis suggests (extensive margin) peer effects account for nearly 56% of cumulative aggregate contribution in the sample, which translates to a value-added of 1-1.5 million software developer labor hours. These results support the notion that OSS is largely developed by disproportionate efforts from smaller groups of dedicated core maintainers, who integrate incremental contributions from the wider community, and casts doubt on the promise for peer effects alone to deliver sustained maintenance labor to individual projects.
No Free Lunch For Programmers: Digital Supply Chains and the Economics of Software Dependency Management
[working paper] [slides][abstract]
Developers of software projects can leverage the functionality of existing open source projects. This practice can potentially lower the cost of development albeit at the inherent risk of relying on external components. A “downstream” project maintainer can choose to “import” elements of an “upstream” project to outsource functionality, but is uncertain how future changes in this dependency project may expose her own project to software faults or vulnerabilities. Software dependency networks therefore represent a “digital supply chain”, an ecosystem of interdependent public goods that confer an intricate set of both positive and negative externalities for project maintainers and end users. Focusing on microeconomic fundamentals of the dependency management problem faced by the risk averse project maintainer, we use both reduced form and structural approaches to study how dependency networks create value, what forces shape their formation, and how individual behavior can influence the robustness of equilibrium network structure. We use a sample of open source software projects from the Node.js JavaScript packaging ecosystem for which contribution and dependency formation decisions are observed in real-time. Finally, we consider several policy interventions that can improve equilibrium welfare. In particular, we find that removing less that 1% of core projects can reduce aggregate project quality by more than 5% for the remaining peers.
In Progress
2024 Survey of Open Source Software Funding (with GitHub, Inc. and the Linux Foundation) -
[pdf]
Education
Univerity of Southern California 2022 Ph.D. Economics Tufts University 2016 M.S. Economics UCLA 2013 B.A. Economics, cum laude Employment
Harvard Business School 2023 - present Postdoctoral Fellow (Laboratory for Innovation Science at Harvard) USC Public Exchange 2018 - 2022 Researcher City of Los Angeles Department of Finance 2018 - 2019 Researcher University of Chicago 2016 - 2017 Research Assistant Awards and Honors
Daniel and Mary James Fellowship 2018 - 2019 USC Union Bank Fellowship 2017 - 2018 USC Linda Datcher Loury Award in Economics for outstanding master’s thesis 2016 Tufts Department of Economics Graduate Research Fellowship 2015 - 2016 Tufts Skills
Python, SQL, R, Julia, Bash, Linux system administration, git / GitHub, reproducible research, geospatial analysis, social network analysis.
-
Resources
awesome-oss-research-data: A (curated) list of empirical research and datasets in the space of open source software. Contributions welcome! [repository]project-template: A simple template for empirical research projects [repository]Software
retrograde: Rewind a git repository through its development history [repository]librariesio-postgres: a set of scripts to load the Libraries.io dataset into PostgreSQL [repository]ghtorrent-postgres: a set of scripts to load the GHTorrent dataset into PostgreSQL [repository]Other Projects
"Female Political Participation in Sub-Saharan Africa". Involvement: lead research assistant, oversaw data collection and processing, built project website and interative visualizations.How do Employment Verification Mandates Impact U.S. Agriculture? (master's thesis) [working paper][abstract]
Employment verification mandates require firms to prove each new hire’s legal right to work in the U.S. and sanction employers who knowingly hire undocumented workers. The effects of such policies are of particular concern to the agricultural sector, which draws from a labor pool in which undocumented workers have been historically overrepresented. This paper empirically estimates the impact of state-level mandates on farm outcomes. By exploiting geographic variation in statewide E-Verify mandates implemented between 2007 and 2012, I use an identification strategy that combines both differences-in-differences and regression discontinuity techniques to isolate the effect of E-Verify on agricultural labor patterns. While addressing a number of identification and estimation challenges, I find that worker density falls by roughly 35% at the policy-change border for states with stringent mandates. Furthermore, the average farm spends less per worker within counties adjacent to this same set of E-Verify states, suggesting some presence of spillover effects.