Introducing the ERW Data Quarry: A Data Sharing System for Enhanced Rock Weathering
Enhanced Rock Weathering (ERW) has the potential to deliver meaningful carbon removal alongside agriculture benefits. Yet there remains a number of open scientific questions related to carbon removal quantification↗ and safety that will be critical to address to unlock this potential.
Without addressing these open questions, the ERW field risks being stuck in a limited-demand future: one in which substantial uncertainty discounting, expensive, measurement-intensive quantification, field heterogeneity, and an incomplete health and safety regulatory framework limits ERW’s ability to scale. We don’t want this future for ERW. We want to quickly, transparently, and collaboratively answer these open questions to help ERW reach its full climate and agronomic impact potential.
But answering these questions requires data. Granular, real-world data. And lots of it.
The Opportunity
Unlocking commercial ERW datasets to answer the most pressing scientific questions facing the field can help to jumpstart a virtuous cycle of deployment-led learning. In a deployment-led learning scenario, real-world deployments help further scientific understanding, which builds trust in the process, unlocking more deployments, and more carbon and farmer benefits.
Why tap into commercial datasets?
First, commercial datasets are big. The majority of academic research draws from just a handful of ERW trial locations. In contrast, commercial ERW deployments are expected to top 20,000 hectares this year, representing over 200x more area than all academic field trials combined.1
Second, commercial datasets are diverse, set across a wide range of soil, climate, and feedstock types. This diversity is crucial to help understand ERW’s effectiveness across a broad suite of contexts, and not just in the Global North where much of the academic research has taken place to date.
Third, commercial datasets are detailed. Many ERW project developers are robustly sampling their ERW deployments to characterize the netCDR of these projects. Maximizing learning from these datasets will unlock the scientific understanding needed to scale the field.
The Challenge:
In practice, commercial data sharing is hard for three major reasons. First, ERW companies have invested millions of dollars into data collection. Data can be seen as proprietary; for example, helping companies identify optimal deployment locations or feedstocks. Second, these companies have responsibilities around farmer data privacy which need to be taken seriously, as ERW relies on the trust and collaboration of farmers around the world. Finally, there is a meaningful collective action issue for market participants: “Why should I share? How do I know others will do the same?”
Layer these challenges together, and it’s easy for the field to remain trapped in the status quo of limited researcher access to commercial data. Many ERW companies are engaged in individual research collaborations with external partners, leading to important, albeit slow, steps forward on specific research topics in specific regions. But to respond to the climate emergency, we need the power of multiple datasets, across multiple contexts, in the hands of multiple researchers to rapidly advance the state of ERW science.
The Action
We are incredibly energized to announce the ERW Data Quarry–the first-ever ERW data sharing system–together with 10 leading ERW companies who have proactively committed to sharing datasets as a part of this system. This is one of the first systems of its kind to proactively share commercial data for the benefit of broad scientific learning in the CDR space. Each of the following companies have committed to sharing specific dataset(s) tied to individual deployments, within a set time frame, through this system.
Frontier, as a leading catalytic buyer coalition in the space, has also committed to requiring dataset contributions to the Data Quarry as part of recent pre-purchases and future ERW offtakes. MilkyWire, another catalytic ERW buyer, will also incorporate data sharing requirements in their next major ERW purchase. These buyers are helping to signal the importance of community-wide data sharing in growing a quality ERW market.
More specifically, these leading companies are committing to sharing data through the following system:
The ERW Data Quarry Solution
A data sharing system that protects commercial and farmer interests while accelerating scientific progress. Finding the right balance between transparency, speed, and commercial interests is a delicate balancing act. Through dozens of stakeholder listening sessions and iterative designs, Cascade has developed the ERW Data Quarry to navigate these tensions by building around the following core principles:
Focus on priority questions: Building on the “Foundations”↗ process, we have identified four priority research topics that commercial datasets can help address. These questions informed the variables collected through the Data Quarry system.
Segment initial access: Health and safety data will be made fully public to build trust in the safety of ERW. Additional data will be made available to researchers through a proposal-based request system.
Fair use: A commitment to use data shared for advancing scientific learning–not for commercial purposes.
Build on existing data norms: Consistent with farmer data sharing norms, ensure farmers’ consent to data sharing and anonymize location data to protect farmer privacy. Consistent with academic norms, make datasets used for peer-reviewed articles public upon publication.
Complement to existing market infrastructure: The Data Quarry focuses on advancing the state of ERW science, and not on validating carbon claims. It is in no way a replacement for the critical role that registries, validation and verification bodies (VVBs), and other carbon market integrity initiatives play in the ecosystem.
A detailed description of how we will be putting these principles into action is available in the post-script below.
Next Steps
Rocks take time to dissolve. And thus, datasets take time to collect. We will be announcing the formal launch of the system in the coming months. In the meantime, reach out to [email protected] if:
ERW Companies: You’d like to be part of the movement to build a healthy ERW market.
Buyers: You are interested in playing a catalytic role in advancing the ERW field.
Researchers: You are interested in accessing commercial ERW data and/or collaborating with leading ERW companies. Please share your ERW-related research interests and expertise, and we will reach out with opportunities to engage.
Commercial datasets are just one part of the puzzle. More complete and accessible datasets from non-commercial field trials can also help advance the state of the field. If you have data from an ERW field trial and would like to be part of advancing the state of ERW science, please reach out.
For Anyone Else: You are interested in following along in this journey, sign up for our newsletter here↗.
We hope that this data sharing system is just the beginning. The organizations committing to the Data Quarry have recognized that the field will grow faster through collaboration and shared learning than if research is conducted in silos–and that they can tap into additional scientific resources as world class researchers build on their work. As researchers begin to connect with commercial datasets, new collaborations between partners proliferate, and scientific understanding of ERW advances, collectively, we’ll learn a lot more than we would have otherwise.
At Cascade, we are committed to stewarding this data sharing process to support the ERW community in advancing the most pressing scientific questions. We expect to learn a few things about unlocking deployment-led learning that can help other climate pathways along the way.
In the meantime, tip your hat to the leading companies who have rolled up their sleeves, moved from talk to action, and are committing real data to advance the state of ERW science.
Post Script: Additional Details
Data Flow
The data system will be comprised of two components:
A Publicly Accessible Data Source: This will include: a) all health and safety data, and b) datasets that have been used for academic peer reviewed articles, released at the time of article publication. CarbonPlan is leading development on CDRXIV, a new centralized hub for open academic and commercial CDR preprints and data, which will be used as the public repository.
A Permissioned Data Catalog: Cascade will also host the ERW Data Quarry that provides a description of the datasets and variables available. Researchers will be able to request access to individual datasets by submitting a brief research proposal and a financial conflict of interest statement. Cascade will facilitate a neutral governing board to determine dataset release based on: a) if there are any credible conflicts of interest, and b) if the research proposal relates to advancing ERW science. Research proposals will be shared with the Data Quarry community to facilitate collaboration across research groups.
See Figure 1 for a summary of the data flow described above.
Figure 1: How datasets and information will flow across organizations
Data Variables:
Whenever possible, datasets in the Data Quarry will share a common set of core variables, as listed in the pdf below. These variables were selected based on their importance to answering the highest priority research questions that overlap with data collected in commercial deployments.
Building on learnings from the “Foundations”↗ process, these research themes are:
Health and safety
Time lags in carbon removal
Dissolution rates
Soil organic carbon fluxes
Data will be provided at the individual sample level, anonymized to the administrative district to protect farmer anonymity. Quality metadata, consistent with FAIR principles, will be enforced to ensure datasets are usable for research purposes.
Outputs:
Scientific understanding will be advanced primarily through peer-reviewed article publications to build our collective understanding of ERW processes. In addition, Cascade is committed to an annual ‘State of the Science’ white paper to facilitate rapid dissemination of insights drawn from the full set of data available.
How We Got Here:
Cascade hosted over 50 listening sessions with academics, project developers, and buyers to develop a data sharing system that addresses each community’s concerns. We also reviewed over twenty analog data sharing systems across a range of fields, from cancer research to ocean exploration to soil organic carbon.
Footnotes
Based on a survey conducted with 16 project developers, analysis of publicly announced deployments and a scan of published ERW field trials.
↩