<?xml version="1.0" encoding="utf-8" standalone="yes"?><feed xmlns="http://www.w3.org/2005/Atom"><title xml:lang="en-us">Research Organization Registry (ROR)</title><subtitle xml:lang="en-us">The Research Organization Registry (ROR) is a global, community-led registry of open persistent identifiers for research and funding organizations.</subtitle><author><name>Research Organization Registry (ROR)</name><uri>https://ror.org/</uri></author><id>https://ror.org/feed.xml</id><link rel="self" type="application/atom+xml" href="https://ror.org/feed.xml" hreflang="en-us"/><link rel="alternate" type="application/rss+xml" href="https://ror.org/index.xml" hreflang="en-us"/><link rel="alternate" type="text/html" href="https://ror.org/" hreflang="en-us"/><generator uri="https://gohugo.io">Hugo 0.101.0</generator><icon>https://ror.org/img/favicon.ico</icon><logo>https://ror.org/img/ror-logo-800.png</logo><rights>All ROR IDs and metadata are provided under the Creative Commons CC0 1.0 Universal Public Domain Dedication https://creativecommons.org/publicdomain/zero/1.0/. All other content on ror.org is licensed under the Creative Commons Attribution 4.0 International License https://creativecommons.org/licenses/by/4.0/.</rights><updated>2026-04-16T00:19:41+00:00</updated><entry xml:lang="en-us"><title xml:lang="en-us">Strengthening ROR’s Commitment to the Principles of Open Scholarly Infrastructure</title><id>https://doi.org/10.71938/2xy9-em92</id><link rel="alternate" href="https://ror.org/blog/2026-03-27-strengthening-commitment-to-posi/"/><published>2026-03-27T00:00:00+00:00</published><updated>2026-03-27T13:31:40-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="General Updates"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<!-- Commonly used content 

<figure class="featured-figure"><img src="/img/blog/"/>
</figure>


<div class='callout green'>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-info'></i></div><div class="col-md-10"><span></span></div>
	</div>
</div>

<figure class="blog-figure"><img src="/img/blog/"/>
</figure>



<div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
  <iframe src="https://www.youtube.com/embed/XXX" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="YouTube Video"></iframe>
</div>


<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Write us at <a href="mailto:support@ror.org">support@ror.org</a> with any comments or questions.</span></div>
	</div>
</div> 
-->
<figure class="featured-figure"><img src="/img/lions-fountain.jpg"
         alt="Fontaine la rotonde -- Aix en Provence -- France, by lecreusois on Pixabay."/>
</figure>

<p>Since its founding, ROR has been committed to providing open infrastructure for the global community. As part of this commitment, in December 2020, ROR released a <a href="/blog/2020-12-16-aligning-ror-with-posi/">public self-assessment</a> of our alignment with the <a href="https://openscholarlyinfrastructure.org/">Principles of Open Scholarly Infrastructure</a> (POSI).</p>
<p>At the time, the POSI framework, originally articulated in 2015, had been <a href="https://openscholarlyinfrastructure.org/posi-v1.0/">recently (re)codified</a> as a set of concrete guidelines, inspiring a small <a href="https://openscholarlyinfrastructure.org/adopters/">group</a> of open scholarly infrastructure initiatives to assess how they were aligned with these principles.</p>
<p>For ROR, this context was particularly relevant as it was in the early stages of formalizing an operating model and exploring pathways to long-term sustainability. POSI provided a helpful roadmap to inform ROR’s strategic decision-making and a useful framework for contextualizing and organizing its operations.</p>
<p>A revised version of the Principles of Open Scholarly Infrastructure (<a href="https://openscholarlyinfrastructure.org/POSI_compare_october2025_rationale_3.pdf">v2.0</a>) was released in October 2025 (following a minor <a href="https://openscholarlyinfrastructure.org/posi-v1.1-revisions.pdf">v1.1 revision</a> in November 2023). As an early POSI adopter and a member of the POSI adopters group that formed after the 2020 recodification, ROR was involved in shaping the revision alongside other community collaborators. The v2.0 revision does not alter the spirit or intent of the principles, but rather clarifies and expands on some key points and nuances that emerged through community consultations and also in response to ongoing changes in the scholarly infrastructure landscape.</p>
<p>In the wake of the POSI v2.0 release, in recognition of the five years since our initial self-assessment, and having recently <a href="https://ror.org/blog/2026-03-03-ror-annual-meeting-highlights/">marked the milestone of ROR’s seventh anniversary</a>, we wanted to take this opportunity to reflect on the current state of ROR’s alignment with the POSI framework and re-assert our commitment to providing open scholarly infrastructure for the global community.</p>


<h2 id="evaluation-process">Evaluation process 
</h2>
<p>The POSI framework focuses on three key areas: <strong>accountability (governance)</strong>, <strong>resourcing (sustainability)</strong>, and <strong>protection of community interests (insurance)</strong>. Twenty principles are articulated within these three areas. The principles are intended to provide guidance; they are not intended to function as a set of rules or a checklist, and it is not expected or required that all of the principles will be relevant for every organization or initiative.</p>
<p>For ROR’s revised self-assessment, we reviewed the updates from POSI version 1.1 and version 2.0 and evaluated ROR’s current and planned activities against these newer versions of the principles.</p>
<p>Similar to the 2020 assessment, we evaluated degrees of alignment with the principles using a “traffic light” ranking, where green indicates strong alignment, yellow indicates partial alignment or alignment in process, and red indicates weak alignment or non-alignment. There is no standard POSI assessment metric; some organizations have adopted the traffic light framework while others have taken different approaches. We present the results of the new assessment alongside the results from the 2020 assessment to indicate where alignment has remained consistent or evolved.</p>
<p>As ROR is a jointly operated initiative and not a standalone organization by design, some of the principles are less applicable than others. If an item is marked as green, this doesn’t necessarily mean ROR does this perfectly or that our work in this area is done; it simply indicates that we feel confident about our level of alignment and continued ability to maintain this alignment.</p>
<p>While we believe that the assessment indicators are helpful for understanding the state of ROR’s alignment at a glance, we also take seriously the details behind the indicators, and care about documenting and understanding the specific actions and decisions make alignment possible, so that we can integrate this work as a daily practice and continue to seek opportunities for growth and improvements over time.</p>


<h2 id="governance">Governance 
</h2>
<figure><img src="/img/blog/posi/posi-table-governance.png"
         alt="ROR self-evaluation of compliance with POSI governance principles."/>
</figure>

<div class='callout dark-grey'>
	<div class="row">
		<div class="col-md-12"><span><strong>Coverage across the scholarly enterprise</strong> – Research transcends disciplines, geography, institutions, and stakeholders. Organisations and the infrastructure they run need to reflect this.</span></div>
	</div>
</div>
<p>ROR is a global registry that aims to represent all research organizations in the world and to be adopted and used all over the world. ROR IDs and accompanying metadata records encompass a diverse and global set of research organizations and organization types. The registry continues to be updated to include new organizations, particularly from regions historically underrepresented in research infrastructure, and to reflect metadata updates that support global adoption, such as including name variants in multiple languages and character sets.</p>
<p>ROR infrastructure is freely and openly available and can be used in any type of system and anywhere in the world without restrictions.</p>
<p>ROR community activities aim to reflect and support global usage and engagement. Community meetings and case studies feature a range of users and implementations. ROR’s Steering Group, Curation Advisory Board, and community forums similarly encompass and reflect diverse national and regional perspectives as well as a mix of organization types.</p>
<p>While ROR communications and community activities are primarily conducted in English and meetings are primarily conducted during Western Hemisphere working hours, we aim to broaden the linguistic and geographic scope of our communications and activities by producing translations of key community resources, using video translations and captions, scheduling some meetings and events in alternate time zones, and conducting meetings and communications in non-English languages when possible.</p>
<div class='callout dark-grey'>
	<div class="row">
		<div class="col-md-12"><span><strong>Stakeholder governed</strong> – A board-governed organisation drawn from the stakeholder community builds confidence that the organisation will make decisions driven by community consensus and a balance of interests.</span></div>
	</div>
</div>
<p>ROR is not a standalone organization by design and therefore has no official mechanism for board governance. ROR governance operations are carried out by three nonprofit organizations that are representative of ROR’s key stakeholder communities and specified in an official Memorandum of Agreement.</p>
<p>ROR also engages stakeholder input and guidance on strategic directions through its Steering Group, open community calls, Curation Advisory Board, and public calls for feedback.</p>
<div class='callout dark-grey'>
	<div class="row">
		<div class="col-md-12"><span><strong>Non-discriminatory participation or membership</strong> – We see the best option to be an “opt-in” approach with principles of non-discrimination and inclusivity, where any relevant group may express an interest and should be welcome. Representation in governance must reflect the character of the community or membership.</span></div>
	</div>
</div>
<p>As a multi-organization initiative, ROR’s governance model is implemented by its three operating organizations, which are nonprofit organizations that collectively represent key stakeholder communities for ROR and whose commitment to ROR has been approved by their respective governing bodies.</p>
<p>ROR is designed to be a community-driven open infrastructure initiative that has no membership or service model by design and that encourages active community engagement to inform its ongoing activities. ROR infrastructure can be accessed and used by anyone without a membership or a license or any other type of restriction. ROR community activities, such as community meetings and events, calls for feedback, bug reports and feature suggestions, and curation requests are openly available for anyone to opt-in.</p>
<p>As described in more detail elsewhere in this assessment, while ROR’s community activities and communications are conducted primarily in English during Western Hemisphere working hours, we take steps to support multilingual, global, multi-modal, and asynchronous engagement, to convey that all community members are welcome to participate regardless of where they work or how they prefer to communicate. We also seek to highlight users and adopters from a range of organization types and industries.</p>
<p>The registry’s inclusion criteria for organizations is based on adherence to ROR’s scope, which is established by community experts and stakeholders. Any organization can be included in the registry within this scope; there are no fees or licenses or other permissions required to add an organization to the registry or to maintain the metadata associated with the organization’s record.</p>
<p>ROR convenes a Curation Advisory Board and Steering Group made up of members of the community that are invited to provide specific guidance and advice on ROR activities. These groups do not have formal governance authority over ROR. Members are currently selected through an invitation process that is overseen by ROR’s operating organizations and core staff members; anyone is welcome to express interest in participating. Group operations are documented on GitHub and the ROR website.</p>
<div class='callout dark-grey'>
	<div class="row">
		<div class="col-md-12"><span><strong>Transparent governance</strong> – To foster trust, the processes and policies for governing the organisation and selecting representatives to governance groups should be transparent (within the constraints of privacy laws).</span></div>
	</div>
</div>
<p>ROR is operated as a collaborative initiative by California Digital Library, Crossref, and DataCite. The three ROR governing organizations collectively assume and share responsibility for ROR governance, operations, resourcing, and decision-making. These responsibilities are defined in a Memorandum of Agreement (MOA) that is approved by the governance bodies of each organization, renewed on a regular basis, and published on the ROR website. The MOA outlines how ROR governing operations are conducted and also stipulates specific principles of governance, for example, that ROR cannot be governed by, purchased by, controlled by, sold to, or otherwise transferred to a commercial entity.</p>
<p>ROR’s Memorandum of Agreement was first signed in 2020. Since then, the ROR operating organizations have renewed and revised the agreement multiple times to provide more clarity and transparency about ROR’s governance model and to keep the agreement aligned with current operations. The Memorandum of Agreement is now also made available on the ROR website.</p>
<div class='callout dark-grey'>
	<div class="row">
		<div class="col-md-12"><span><strong>Cannot lobby</strong> &ndash; Infrastructure organisations should not lobby for regulatory change to cement their own positions or narrow self-interest. However, an infrastructure organisation’s role is to support its community, and this can include advocating for policy changes.</span></div>
	</div>
</div>
<p>ROR’s primary focus is to provide useful infrastructure for the community in the form of data and services that support a variety of use cases and needs. ROR does not lobby nor include regulatory change as part of its remit.</p>
<p>ROR operates on a core set of values and principles, including but not limited to POSI, based on the premise that scholarly infrastructure should be open. These values and principles are not motivated by self-interest but by the belief that open infrastructure provides a collective benefit.</p>
<p>ROR team members and community stakeholders participate in numerous activities and initiatives to advance ROR’s objectives in the name of the common good. Some of these activities and initiatives may directly or indirectly play a role in influencing policy decisions, such as in the emergence of national persistent identifier policies in multiple countries that may specifically mention ROR, but that are focused on addressing the needs of the global research community.</p>
<div class='callout dark-grey'>
	<div class="row">
		<div class="col-md-12"><span><strong>Living will</strong> &ndash; To build trust, organisations should establish and communicate clear commitments regarding their long-term stewardship responsibilities, including the principles by which assets, data, resources, services, and staff would be responsibly transferred to a successor or the organisation or service wound down. The commitments should address future governance, with defined criteria for acceptable successor organisations. This should include continued alignment with POSI and any legal or structural constraints.</span></div>
	</div>
</div>
<p>ROR operations and governance structures have accounted for long-term stewardship as well as future scenarios in which the service might be wound down.</p>
<p>ROR data is openly available under a CC0 waiver by default, allowing access to persist beyond the life of the initiative. ROR code is openly licensed on GitHub and can be forked by anyone. ROR data files are published on Zenodo and archived through Zenodo’s processes.</p>
<p>ROR’s operating organizations have a Memorandum of Agreement, which is published on the ROR website, that describes how ROR would be shut down and how relevant responsibilities could be transferred to responsible successors, if applicable. The succession plan also accounts for assets such as code repositories, cash funds, and technical infrastructure.</p>
<div class='callout dark-grey'>
	<div class="row">
		<div class="col-md-12"><span><strong>Regular review of purpose and community value</strong> &ndash; Organisations and services should regularly review their relevance, effectiveness, and the level of community support to determine whether their continued operation is necessary. If no longer needed, they should take responsible steps to transition or wind down operations in consultation with the community and in alignment with their living will.</span></div>
	</div>
</div>
<p>ROR’s operating organization and core team members regularly assess ROR’s strategic objectives, level of adoption and usage, and ongoing progress in order to ensure that ROR’s offering is aligned with community needs.</p>
<p>In the event that ROR’s operating organizations identify evidence that ROR is no longer needed, they will take the necessary steps to wind down the initiative in accordance with the process outlined in the ROR operating organizations’ Memorandum of Agreement.</p>


<h2 id="sustainability">Sustainability 
</h2>
<figure><img src="/img/blog/posi/posi-table-sustainability.png"
         alt="ROR self-evaluation of compliance with POSI sustainability principles."/>
</figure>

<div class='callout dark-grey'>
	<div class="row">
		<div class="col-md-12"><span><strong>Transparent operations</strong> &ndash; To enable organisational accountability and openness, the operating policies and procedures, detailed financials, sustainability models, fees, strategic and product roadmaps, organisational charts, and other appropriate operational information should be made openly available (within the constraints of privacy laws). Information should be available for investigation and reuse by the community.</span></div>
	</div>
</div>
<p>While ROR has always maintained transparency in terms of code and technical development, other aspects of its operations have not always been transparent and we therefore have taken deliberate steps to increase transparency on various fronts.</p>
<p>ROR’s curation process began transitioning to GitHub in 2020 and was made fully transparent and operational in 2022. Anyone with or without a GitHub account can access registry curation requests in the pipeline, view decisions that are made, and track when updates are implemented.</p>
<p>ROR’s product development roadmap is also publicly available on GitHub and users are invited to follow progress and submit issues for consideration.</p>
<p>The Memorandum of Agreement that outlines ROR’s operating and governance model is publicly available on the ROR website. The ROR website includes additional documentation about ROR operations, such as information about current team members and a timeline of how ROR first emerged. ROR has also begun posting annual financial reports on the website that go back to 2023 following the establishment of its current operating model in 2022. We do not currently make detailed financial information publicly available, such as staff salaries, or the amounts that individual organizations contribute to ROR.</p>
<p>Organizations and community members that participate actively in ROR operations are acknowledged on the website, including financial contributors, Curation Advisory Board members, and Steering Group members. We do not publicize the names of community members on our mailing list or those who attend our community meetings.</p>
<p>We continue to identify and consider future opportunities to make operations more transparent to ensure trust and also provide useful information for the community. Such areas could include documentation about how we run community meetings and structure community groups, incident response operations, and communication guides.</p>
<div class='callout dark-grey'>
	<div class="row">
		<div class="col-md-12"><span><strong>Time-limited funds are used only for time-limited activities</strong> &ndash; Operations should be supported by sustainable revenue sources, whereas time-limited funds are used only for time-limited activities. Depending on grants to fund ongoing and/or long-term operations fully makes organisations fragile and distracts from maintaining core infrastructure.</span></div>
	</div>
</div>
<p>When ROR was first being established, it was resourced through a mixed and unpredictable funding model that combined in-kind support, grant funding, and community investments. As of 2022, ROR’s core expenses, including personnel, are currently funded through in-kind support from ROR’s operating organizations and are not dependent on any time-limited funds. ROR receives additional ad hoc contributions from community supporters but this is not our primary source of funding.</p>
<p>ROR does not charge for access and usage and does not intend to do so in the future.</p>
<div class='callout dark-grey'>
	<div class="row">
		<div class="col-md-12"><span><strong>Goal to generate surplus</strong> &ndash; It is not enough to merely survive; organisations and services have to be able to adapt and change. Organisations and services that define long-term sustainability based only on recovering costs risk becoming brittle and stagnant. To weather economic, social and technological volatility, organisations and services need financial resources beyond immediate operating costs.</span></div>
	</div>
</div>
<p>ROR’s core operating costs have stable funding from its operating organizations, plus additional funds from organizations that provide ad hoc financial support for ROR. We maintain reserves to carry forward 1-2 years. We review our budget throughout the year to anticipate any volatility.</p>
<div class='callout dark-grey'>
	<div class="row">
		<div class="col-md-12"><span><strong>Establish and maintain financial reserves guided by policy</strong> &ndash; Organisations and services should have a clear policy on maintaining financial reserves, including the purpose, minimum and maximum level, and governance of these funds. The actual level of reserves should be determined and periodically reviewed by the governing body, ensuring that resources are available to support Living Will implementation, including an orderly wind-down, transition to a successor, or response to major unforeseen events. A financial reserve policy might include how funds will be held, under what circumstances they will be used, and how much would be necessary for an adequate wind-down or transfer of assets, given the complexity of the organisation’s infrastructure.</span></div>
	</div>
</div>
<p>In addition to outlining support for ROR’s ongoing operations, ROR’s Memorandum of Agreement specifies procedures for winding down ROR operations and transferring assets as needed.</p>
<p>ROR’s financial planning and modeling processes are carried out by representatives of its operating organizations. ROR does annual and mid-year forecasting to anticipate any unforeseen expenses.</p>
<p>ROR aims to keep costs as low as possible while effectively and sustainably supporting core expenses and operations. ROR adheres to an informal policy of maintaining reserves to carry forward 1-2 years and we are working to codify this policy and make it more transparent and durable.</p>
<div class='callout dark-grey'>
	<div class="row">
		<div class="col-md-12"><span><strong>Mission-consistent revenue generation</strong> – Revenue sources should be evaluated against the infrastructure’s mission and not run counter to the aims of the organisation or service.</span></div>
	</div>
</div>
<p>ROR has no revenue model and no current plans to offer a paid service; our objective is to continue offering ROR data and services freely available to everyone.</p>
<p>While ROR explored different potential revenue models during its startup phase, the ROR operating organizations made a decision in 2022 to fund ROR’s core operating expenses on a long-term basis so that ROR could continue to operate as a free and open service for everyone without being dependent on time-limited funding and without needing to develop a paid service model or distinct revenue stream.</p>
<p>ROR receives additional ad hoc funds from contributor organizations that believe in our mission. These are not directed contributions and there is no reward or “pay-to-play” option offered in exchange for these contributions. Such contributions help to offset ROR’s operating expenses and support time-limited activities. ROR does not depend on these contributions to be operational.</p>
<p>ROR has received grant funding in the past for time-limited activities. In the future, we may pursue additional grant funding for time-limited projects in accordance with our strategy and mission.</p>
<div class='callout dark-grey'>
	<div class="row">
		<div class="col-md-12"><span><strong>Revenue generated from services, not data</strong> &ndash; Data related to the running of the scholarly infrastructure should be community property. Appropriate revenue sources might include value-added services, consulting, API Service Level Agreements, or membership fees.</span></div>
	</div>
</div>
<p>ROR makes both its data and services openly available for free. Data and code are made available under a CC0 waiver and MIT license, respectively. Our aim is to continue operating ROR as a free and open dataset and service.</p>
<p>While a value-added service model was considered after ROR’s initial launch, we have no current plans to develop any paid services. We may consider some service access constraints in the future such as API credentials to ensure overall stability and performance of our services, but such changes would not involve a fee model.</p>
<div class='callout dark-grey'>
	<div class="row">
		<div class="col-md-12"><span><strong>Volunteer labour</strong> &ndash; Organisations that rely on volunteers and their labour should recognise this as a valuable resource for the organisation’s long-term viability, and factor it into sustainability planning and risk management.</span></div>
	</div>
</div>
<p>ROR has a dedicated team of paid staff members that provide support for ROR’s core operations. While ROR operates as a community-driven initiative, and regularly seeks community input and engagement to inform its operations and strategic directions, it does not rely on community volunteers for essential operations.</p>
<p>In the early days of the initiative, ROR relied on some community volunteers to advise on curation processes. We identified when this level of involvement was no longer viable and needed to be replaced with dedicated resourcing. We have not otherwise relied on volunteers for any core activities.</p>
<div class='callout dark-grey'>
	<div class="row">
		<div class="col-md-12"><span><strong>Transition planning</strong> &ndash; Organisations that are heavily dependent on a limited number of individuals should take steps to reduce their dependence on these individuals, including via transition and succession planning, so that the organisation is not at risk of collapse in the event of their departure.</span></div>
	</div>
</div>
<p>ROR’s Memorandum of Agreement supports continued staffing of core personnel roles so that we can ensure ongoing support for key functions. We have processes and logins documented and centralized to support continuity of institutional knowledge, and we have experienced staff transitions to test the quality of these safeguards, both in terms of staff members switching organizations and also leaving entirely. We maintain comprehensive onboarding and offboarding checklists for joining and departing team members, and documentation for new hires. Our technical processes are documented and regularly reviewed with backup support.</p>
<p>Like any small, mission-driven, community-oriented initiative, ROR depends on a limited number of passionate individuals who dedicate time and energy to their work and who have established trusted relationships with advisors, users, and stakeholders. While we have put several measures in place to make sure that we are not dependent on specific individuals such that we would be at risk of collapse, any departure of key personnel will always have an impact, and it would be naive to assume otherwise.</p>


<h2 id="insurance">Insurance 
</h2>
<figure><img src="/img/blog/posi/posi-table-insurance.png"
         alt="ROR self-evaluation of compliance with POSI insurance principles."/>
</figure>

<div class='callout dark-grey'>
	<div class="row">
		<div class="col-md-12"><span><strong>Open source</strong> &ndash; All software and non-physical assets required to run the infrastructure should be available under an open-source licence. This does not include other software that may be involved with running the organisation.</span></div>
	</div>
</div>
<p>All of ROR’s code and software processes are openly stored and documented on GitHub. Code is published under a fully permissible MIT License. We rely on additional tools and software to support our development work, such as hosting and logging. Wherever possible, we leverage open source components. We provide extensive documentation about how to access and query ROR data, and how to run the REST API locally, to enable other projects to use our data and code.</p>
<div class='callout dark-grey'>
	<div class="row">
		<div class="col-md-12"><span><strong>Ensure open and secure data accessibility within legal and ethical constraints</strong> &ndash; To support potential forking or replication, infrastructure should aim to make all relevant data openly available, following best practices such as applying a CC0 waiver where appropriate. This must be balanced with compliance with privacy, data protection, and security requirements. Organisations should have a clear policy outlining how private or sensitive data will be handled—particularly in the event of a transfer to another organisation—to ensure continuity, legal compliance, and responsible stewardship.</span></div>
	</div>
</div>
<p>The ROR dataset is made openly available under the terms of a CC0 waiver and can be accessed via ROR’s data file or by querying the ROR API. ROR code is openly available and forkable under the terms of a MIT License.</p>
<p>No private or privileged information is included in the ROR dataset. ROR has a privacy policy published on our website that outlines how we handle private information, such as personal details of individuals on our community mailing list.</p>
<div class='callout dark-grey'>
	<div class="row">
		<div class="col-md-12"><span><strong>Available and preserved</strong> &ndash; It is not enough that content, data, and software be “open” if there is no practical way to obtain them. These resources should be made easily available with clear public documentation about where they are and how to access them, as well as an open licence where possible. It is not enough that “open” resources are available. In line with the Living Will, it is essential to deposit content, data, and software with at least one trusted third-party digital archive.</span></div>
	</div>
</div>
<p>ROR data is openly available under a CC0 waiver and can be accessed in multiple ways, all of which are publicly documented. The ROR data file is published on the Zenodo digital repository and is subject to Zenodo’s preservation practices, which include regular backup, replication, and fixity processes, a retention policy of at least 20 years, and succession planning in the event of repository closure.</p>
<p>ROR code is available on GitHub under the terms of a MIT License and code repositories are automatically archived in Software Heritage. The archiving schedule is determined by Software Heritage. We may consider other automated or proactive mechanisms for code preservation moving forwards.</p>
<div class='callout dark-grey'>
	<div class="row">
		<div class="col-md-12"><span><strong>Patent non-assertion</strong> &ndash; The organisation should commit to a patent non-assertion policy or covenant. The organisation may obtain patents to protect its own operations, but not use them to prevent the community from replicating the infrastructure.</span></div>
	</div>
</div>
<p>ROR is a fully open and public registry of factual information about research organizations. The metadata practices and curatorial stewardship that we apply to ROR registry data are not considered to be intellectual property of any ROR-related entity. Due to its factual nature, the information stored in the registry cannot be copyrighted or patented. In addition, ROR asserts no claims to registry data via the application of the CC0 1.0 public domain dedication.</p>
<div class='callout dark-grey'>
	<div class="row">
		<div class="col-md-12"><span><strong>Prioritise interoperability and open standards to ensure continuity and resilience</strong> &ndash; Infrastructures should adopt and support widely accepted open standards—both formal and de facto—to ensure that systems, data, and services can be replicated, migrated, or integrated with minimal disruption without the use of proprietary extensions or software. Where relevant, organisations should document dependencies on standards.</span></div>
	</div>
</div>
<p>ROR data can be accessed easily without proprietary extensions or software. For example, the ROR data file is available in both json and csv formats, and any integration can retrieve metadata via the REST API.</p>
<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Questions? Comments? Write us at <a href="mailto:support@ror.org">support@ror.org</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">With the release of v2 of the Principles of Open Scholarly Infrastructure (POSI), we take this opportunity to reflect on ROR’s alignment with the POSI framework and re-assert our commitment to providing open scholarly infrastructure for the global community.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Enriching Europe PMC with SciLifeLab's ROR ID</title><id>https://doi.org/10.71938/swyy-1w10</id><link rel="alternate" href="https://ror.org/blog/2026-03-23-europepmc-scilifelab-ror/"/><published>2026-03-23T00:00:00+00:00</published><updated>2026-03-25T09:55:53-04:00</updated><author><name>Melissa Harrison</name><uri>https://orcid.org/0000-0003-3523-4408</uri></author><category term="Case Studies"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<!-- Commonly used content 

<figure class="featured-figure"><img src="/img/blog/"/>
</figure>


<div class='callout green'>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-info'></i></div><div class="col-md-10"><span></span></div>
	</div>
</div>

<figure class="blog-figure"><img src="/img/blog/"/>
</figure>



<div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
  <iframe src="https://www.youtube.com/embed/XXX" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="YouTube Video"></iframe>
</div>


<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Write us at <a href="mailto:support@ror.org">support@ror.org</a> with any comments or questions.</span></div>
	</div>
</div> 
-->
<figure class="featured-figure"><img src="/img/blog/europepmc/Europe-PMC-Extended-Logo.png"
         alt="Europe PMC logo."/>
</figure>

<p>Europe PMC has <a href="https://ror.org/tags/europe-pmc">long been at the forefront of ROR adoption</a>. In their latest ROR-related project, they&rsquo;ve used ROR in publication metadata to help their partner SciLifeLab, a major research facility and funder, make almost 17,000 linked research publications discoverable on Europe PMC. Read on to learn more about this important open metadata initiative.</p>
<hr>


<h2 id="tell-us-about-europe-pmc-and-scilifelab">Tell us about Europe PMC and SciLifeLab. 
</h2>
<p><a href="https://europepmc.org">Europe PMC</a> is hosted by <a href="https://www.ebi.ac.uk/">EMBL’s European Bioinformatics Institute (EMBL-EBI),</a> one of the six sites of the <a href="https://www.embl.org/">European Molecular Biology Laboratory (EMBL)</a>, an intergovernmental organization dedicated to molecular biology research, and Europe’s flagship laboratory for life sciences. <a href="https://www.scilifelab.se/">SciLifeLab</a> is a Swedish national center for large-scale research and one of the largest molecular biology research laboratories in Europe.</p>
<p>Building on an existing collaborations, the two organisations established a joint Memorandum of Understanding (MoU). As part of this the Europe PMC team has been working closely with SciLifeLab on opportunities to jointly strengthen support of open science.</p>


<h2 id="how-are-you-using-ror-ids">How are you using ROR IDs? 
</h2>
<p>Too often, research outputs aren’t linked to the organisations that fund them. Even when ROR IDs exist, they aren’t always connected to the relevant research, making it challenging to identify which organisations supported the work. This is the exact challenge SciLifeLab worked with Europe PMC to tackle.</p>
<p>SciLifeLab identified, through human curation, numerous relevant articles that they supported, which weren’t linked to them. SciLifeLab shared those article identifiers with Europe PMC, which linked their ROR ID to the relevant articles. This added ROR IDs to over over 16,500 items in the Europe PMC corpus, now openly available via the <a href="https://europepmc.org/RestfulWebService#!/Europe32PMC32Articles32RESTful32API/search">Europe PMC Articles API</a> and through search on the <a href="https://europepmc.org/search?query=ORG_ID%3Aror.org%2F04ev03g22">website interface</a> using the syntax <code>ORG_ID:ror.org/04ev03g22</code>.</p>
<figure><img src="/img/blog/europepmc/ror-search-scilifelab-europe-pmc.png"
         alt="Screenshot of Europe PMC search for SciLifeLab&#39;s ROR ID."/><figcaption>
            <p>Users can now find research supported by SciLifeLab by searching on SciLifeLab&rsquo;s ROR ID <a href="http://ror.org/04ev03g22">ror.org/04ev03g22</a>.</p>
        </figcaption>
</figure>



<h2 id="how-did-you-add-ror-ids">How did you add ROR IDs? 
</h2>
<p>The team at SciLifeLab have been developing ways to identify and link their research outputs such as software and data to publications. An initial step in this process is to identify their publication output, which is no small feat! SciLifeLab is not always included as an author affiliation so current advances in machine learning and AI would not identify this. SciLifeLab has an internal archive of publications that are produced by, or affiliated with, SciLifeLab, which is available via an <a href="https://publications-affiliated.scilifelab.se/publications/table">published online web interface</a>.
 In order to generate this list, human curation/reporting is required. This is a similar challenge shared by many institutions, the EMBL open science team included. Both EMBL and SciLifeLab will continue to send periodic updates to Europe PMC to keep the data current.</p>


<h2 id="what-else-would-you-like-to-say">What else would you like to say? 
</h2>
<p>Complete, open metadata is fundamental to how science is communicated, discovered and built upon. When the link between research outputs and the organisations behind them is missing, that chain breaks down. This work with SciLifeLab has strengthened the Europe PMC corpus with trusted, human-verified ROR ID metadata, and is a step towards a more complete and transparent scholarly record.</p>
<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Comments? Questions? Write us at <a href="mailto:support@ror.org">support@ror.org</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Europe PMC has long been at the forefront of ROR adoption. In their latest ROR-related project, they've used ROR in publication metadata to help their partner SciLifeLab, a major research facility and funder, make almost 17,000 linked research publications discoverable on Europe PMC. Read on to learn more about this important open metadata initiative.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">From User Stories to High-Quality Data: Implementing ROR on the Janeway Platform</title><id>https://doi.org/10.71938/e3fb-v153</id><link rel="alternate" href="https://ror.org/blog/2026-03-17-janeway-and-ror/"/><link rel="related" href="https://www.openlibhums.org/news/926/"/><published>2026-03-17T00:00:00+00:00</published><updated>2026-03-17T07:55:19-04:00</updated><author><name>Joseph Muller</name><uri>https://orcid.org/0000-0003-3230-6090</uri></author><category term="Case Studies"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<figure class="featured-figure"><img src="/img/blog/janeway/olh-janeway-logo.png"
         alt="Open Library of Humanities and Janeway logo."/>
</figure>

<p>Joe Muller, Senior Publishing Technologies Developer at the <a href="https://www.openlibhums.org/">Open Library of Humanities</a>, tells us in this case study how and why the <a href="https://www.openlibhums.org/site/janeway/">Janeway</a> platform uses ROR, ensuring unambiguous author affiliations by retrieving ROR IDs from ORCID profiles as well as allowing authors to find their institution with a ROR-powered search.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>
<h2 id="key-quotations">Key quotations 
  <a href="#key-quotations" aria-label="Key quotations">
<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 640 512" class="ha ha-link" width="16" height="16" alt="Key quotations"><path d="M579.8 267.7c56.5-56.5 56.5-148 0-204.5c-50-50-128.8-56.5-186.3-15.4l-1.6 1.1c-14.4 10.3-17.7 30.3-7.4 44.6s30.3 17.7 44.6 7.4l1.6-1.1c32.1-22.9 76-19.3 103.8 8.6c31.5 31.5 31.5 82.5 0 114L422.3 334.8c-31.5 31.5-82.5 31.5-114 0c-27.9-27.9-31.5-71.8-8.6-103.8l1.1-1.6c10.3-14.4 6.9-34.4-7.4-44.6s-34.4-6.9-44.6 7.4l-1.1 1.6C206.5 251.2 213 330 263 380c56.5 56.5 148 56.5 204.5 0L579.8 267.7zM60.2 244.3c-56.5 56.5-56.5 148 0 204.5c50 50 128.8 56.5 186.3 15.4l1.6-1.1c14.4-10.3 17.7-30.3 7.4-44.6s-30.3-17.7-44.6-7.4l-1.6 1.1c-32.1 22.9-76 19.3-103.8-8.6C74 372 74 321 105.5 289.5L217.7 177.2c31.5-31.5 82.5-31.5 114 0c27.9 27.9 31.5 71.8 8.6 103.9l-1.1 1.6c-10.3 14.4-6.9 34.4 7.4 44.6s34.4 6.9 44.6-7.4l1.1-1.6C433.5 260.8 427 182 377 132c-56.5-56.5-148-56.5-204.5 0L60.2 244.3z"/></svg>
  </a>
</h2>
<p>&ldquo;In the diamond open-access (OA) funding landscape, ROR &hellip; provides a key to understanding fair funding streams.&rdquo;</p>
<p>&ldquo;Use the ORCID option to register on Janeway, and we will pull your latest public affiliation. If the affiliation has a ROR ID on ORCID, which most of them do, it will have a ROR ID on Janeway.&rdquo;</p>
<p>&ldquo;If we make it easy for authors to include ROR-linked affiliations in their Janeway affiliations, they will.&rdquo;</p>
<p>&ndash; Joe Muller, Senior Publishing Technologies Developer at the Open Library of Humanities</p>
</span></div>
	</div>
</div>
<hr>
<p>As an open-access, scholar-led publisher, the Open Library of Humanities (OLH) should do everything we can to make sure metadata from our platform Janeway works seamlessly across the wide spectrum of places where published works might be discovered, read, and archived. The Research Organization Registry (ROR) provides the key to identifying the author affiliations attached to those works unambiguously, through the magic of uniform resource identifiers (URIs).</p>
<p><strong>In the diamond open-access (OA) funding landscape, ROR also provides a key to understanding fair funding streams.</strong> Do the employers of Janeway authors contribute to Janeway’s development via membership in the OLH? Or are there imbalances, where institutions with a lot of Janeway authors and a lot of wealth do not contribute to a part of the diamond OA ecosystem they benefit from? We can crunch the data to find out, if we have unambiguous ways to link up a majority of articles in Janeway with their authors’ institutions.</p>


<h2 id="how-janeway-makes-it-easy-for-authors-to-use-ror">How Janeway makes it easy for authors to use ROR 
</h2>
<p>On Janeway, you interact with ROR when you record your institutional affiliation(s). Most often, that’s when you submit an article to be published on Janeway. You can search for the name or acronym of your institution to choose the correct ROR record. We display the institution’s name variants, location, website, and a link to the original record on the ROR website to help with tricky cases of identifying the right organization. You can put in multiple affiliations, and you can also enter a job title, department, and start and end dates, for most journals.</p>
<figure class="blog-figure"><img src="/img/blog/janeway/janeway-ror-search.png"
         alt="ROR-powered search for an organization in Janeway."/><figcaption>
            <p>You can search for the name or acronym of your institution and choose the correct ROR record.</p>
        </figcaption>
</figure>

<p>It’s even simpler if you have an ORCID with a public profile. <strong>Use the ORCID option to register on Janeway, and we will pull your latest public affiliation. If the affiliation has a ROR ID on ORCID, which most of them do, it will have a ROR ID on Janeway.</strong></p>
<figure class="blog-figure"><img src="/img/blog/janeway/orcid-login.png"
         alt="ORCID login screen."/><figcaption>
            <p>Logging in with ORCID saves you time and effort in Janeway.</p>
        </figcaption>
</figure>

<figure><img src="/img/blog/janeway/janeway-author-information.png"
         alt="Screenshot of author information pulled from ORCID in Janeway."/><figcaption>
            <p>If you’re a first-time user, Janeway brings your latest public affiliation over from ORCID, including ROR IDs, and then gives you the option to edit it.</p>
        </figcaption>
</figure>

<p>After publication, the ROR ID will be included in structured metadata streams such as Crossref DOI records, JATS XML files, and OAI-PMH feeds.</p>
<figure><img src="/img/blog/janeway/janeway-ror-crossref-doi.png"
         alt="JSON metadata from Crossref API."/><figcaption>
            <p>The Crossref DOI metadata for an article in the Walt Whitman Quarterly Review includes ROR IDs for author affiliations. See <a href="https://api.crossref.org/works/https://doi.org/10.17077/0737-0679.34889">https://api.crossref.org/works/https://doi.org/10.17077/0737-0679.34889</a>.</p>
        </figcaption>
</figure>

<figure><img src="/img/blog/janeway/janeway-ror-article-display.png"
         alt="Web page for article in the Walt Whitman Review."/><figcaption>
            <p>A link to the ROR ID is also displayed next to the author&rsquo;s affiliation on the landing page for the article at <a href="https://doi.org/10.17077/0737-0679.34889">https://doi.org/10.17077/0737-0679.34889.</a>.</p>
        </figcaption>
</figure>

<p>Scholars have to wade through so many digital systems to get their work done, funded, published, and recognized, that they have understandable fatigue with learning new systems.</p>
<p><strong>If we make it easy for authors to include ROR-linked affiliations in their Janeway affiliations, they will.</strong> On the other hand, if we expect them to do careful data entry for a purpose that is only really clear in light of some abstract library science concepts (URIs and metadata interoperability), they may skip it, or potentially introduce errors. This is why we not only disallow manual entry of ROR IDs, but also bring ROR IDs over automatically from public ORCID profiles.</p>
<p>Of course, once the ROR-linked affiliation is in Janeway, we give authors ample time to see what it looks like in Janeway and make manual changes if needed. Most authors are very happy that it is there&ndash;they have already made the decision to make it public on their ORCID profile&ndash;and it is an extra bonus that they did not have to manually enter it again.</p>
<figure><img src="/img/blog/janeway/janeway-affiliations.png"
         alt="Author affiliations screen in Janeway."/><figcaption>
            <p>You can edit affiliations, add and remove new ones, and re-synchronize with ORCID.</p>
        </figcaption>
</figure>

<figure class="blog-figure"><img src="/img/blog/janeway/janeway-affiliation-details.png"
         alt="Author affiliation details screen in Janeway."/><figcaption>
            <p>You can change job title, department, whether the affiliation is primary, and the start and end dates. Journal editors can also choose to turn off some of these fields when ORCID profiles are linked that show more detailed information.</p>
        </figcaption>
</figure>



<h2 id="how-janeway-integrated-ror">How Janeway integrated ROR 
</h2>
<p>Our design process started as usual with user stories. Janeway editors talked to us about the features they wanted to see, and how they wanted them to work, as did librarians who manage Janeway installations at their institutions.</p>
<figure><img src="/img/blog/janeway/janeway-wireframes.png"
         alt="Mockups of author affiliation user interfaces in Janeway."/><figcaption>
            <p>We went through <a href="https://github.com/openlibhums/janeway/issues/4519">several rounds of wireframes</a> to work out usability details of the interface the authors would see when submitting articles.</p>
        </figcaption>
</figure>

<p>In this case, once we had a sense of user requirements, developer Steph Driver and technology director Andy Byers mocked up a few new possibilities for the author submission page. These went through several design reviews: our technology team, which included our UX researcher Katherine Parker-Hay and our publishing technologies librarian Siobhan Haime, thought through the implications for usability, accessibility, systems design, security, and maintainability.</p>
<p>I then got stuck in on actual <a href="https://github.com/openlibhums/janeway/pull/4483">implementation</a>, beginning with porting our old data to the new data model, writing an interface for backwards compatibility, and creating a command-line interface for server administrators to load the ROR data dump into their Janeway installation and set it up to automatically update every month, with several engineering reviews by Andy and senior developer Mauro Sanchez. I then wrote the <a href="https://github.com/openlibhums/janeway/pull/4697">new user interface</a>, incorporating a feature to implement the Contributor Roles Taxonomy (CRediT) started by Martin Paul Eve at the same time. I also wrote tools for <a href="https://github.com/openlibhums/janeway/pull/4651">auto-matching existing affiliations</a> with ROR records. Finally, we made sure ROR data in Janeway sees the light of day, by updating our DOI deposits, <a href="https://github.com/openlibhums/janeway/pull/4726">metadata streams</a>, reader interface, and plugins.</p>
<figure class="blog-figure"><img src="/img/blog/janeway/olh-institutions-chart.png"
         alt="Chart of institutions represented in OLH journals."/><figcaption>
            <p>We were only able to auto-match part of our dataset, but we can already see the spread of institutions represented in OLH journals. Universities in the middle of the list, with 25-30 affiliations, include the University of Antwerp, the National University of Singapore, and the University of Leicester.</p>
        </figcaption>
</figure>

<p>In the end, implementing ROR was absolutely worthwhile, and also a lot of work. This was not because of ROR &ndash; the ROR data and documentation are exemplary &ndash; but rather because of the need to balance an expanded Janeway data model with backwards compatibility for existing bulk import tools. We also needed to let any user enter and edit custom institution names as before, in case they do not yet exist in ROR, but keep them from changing the ROR-linked records in Janeway, since those names are provided by the ROR authority records automatically loaded in.</p>
<p>The rollout of the integration has gone well, beginning with OLH journals in mid-2025, and then going to our installations at universities in Darmstadt, Ghent, Huddersfield, London, Wales, Dublin, Massachusetts, and Virginia. Our development partner California Digital Library (CDL) also began their rollout by testing and contributing bug reports. The new user interface has been well received, with editors at <em>Glossa</em> and other journals chiming in with suggestions that we were able to implement quickly.</p>
<p>Since October 2025, 74% of articles published have ROR-linked affiliations. For records that we could auto-match in the OLH backlist, the number is 43%. Future plans include improving the auto-matching capabilities using ROR’s latest tools, adding support for translation of organization names in the user interface, and adding features for coauthors.</p>
<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Write us at <a href="mailto:support@ror.org">support@ror.org</a> with any comments or questions.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Joe Muller of the Open Library of Humanities tells us in this case study how and why the Janeway platform took the time to integrate ROR in a way that reduces the burden of data entry for authors, ensuring unambiguous author affiliations by retrieving ROR IDs from ORCID profiles as well as allowing authors to find their institution with a ROR-powered search.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Highlights from the 2026 ROR Annual Community Meeting</title><id>https://doi.org/10.71938/A764-9G57</id><link rel="alternate" href="https://ror.org/blog/2026-03-03-ror-annual-meeting-highlights/"/><published>2026-03-03T00:00:00+00:00</published><updated>2026-03-17T09:09:46-04:00</updated><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><category term="Event Recaps"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<!-- Commonly used content 

<figure class="featured-figure"><img src="/img/blog/"/>
</figure>


<div class='callout green'>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-info'></i></div><div class="col-md-10"><span></span></div>
	</div>
</div>

<figure class="blog-figure"><img src="/img/blog/"/>
</figure>



<div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
  <iframe src="https://www.youtube.com/embed/XXX" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="YouTube Video"></iframe>
</div>


<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Write us at <a href="mailto:support@ror.org">support@ror.org</a> with any comments or questions.</span></div>
	</div>
</div> 
-->
<figure class="featured-figure"><img src="/img/sasreu-lions.jpg"
         alt="Image by Sascha Reuen from Pixabay"/>
</figure>

<p>In the ROR Annual Community Meeting in February, we celebrated seven years of ROR and the ROR community that makes it all possible in three terrific online sessions featuring ROR team members and ROR enthusiasts from all over the world. If you missed any of the sessions, read the highlights and access slides and recordings below, and be sure set yourself a reminder to attend live next year in early 2027!</p>


<h2 id="reflections-on-where-ror-is-now">Reflections on where ROR is now 
</h2>
<p>The annual <a href="https://ror.org/events/2026-02-03-annual-meeting-community-update/">ROR Community Update</a> is always an opportunity for reflection on how far we&rsquo;ve come, but in this seventh edition of the meeting we also took the opportunity to reflect on what has remained the same. In the last seven years, ROR has changed a lot in terms of how we&rsquo;re used, how we operate, and what we offer, but we&rsquo;ve remained true to our guiding vision and values. And of course, one thing that will never change is our perpetual gratitude to all of you around the world who support ROR in so many different ways: from submitting registry updates to integrating ROR in your systems and from telling colleagues and friends about ROR to investing in our sustainability.</p>
<p>Highlights from the community update:</p>
<ul>
<li>Key ROR system integrations in 2025 include <strong>Symplectic, RAID, DOCID, RDA, and OJS 3.5</strong>.</li>
<li>Crossref also added support for <strong>ROR IDs as funder identifiers</strong>, and entities such as Fonds de recherche du Québec and the Worldwide Protein Data Bank registered thousands of new Crossref records with ROR IDs.</li>
<li><strong>The number of Crossref records with ROR IDs more than tripled in 2025</strong>, and the rate of ROR use in DataCite and ORCID records also remains very high.</li>
<li>In 2025, <strong>ROR&rsquo;s curators processed over 12,200 user-submitted curation requests</strong>, adding over 8,500 new relationships and expanding domain metadata.</li>
<li>ROR curators focused on streamlining workflows and increasing global coverage in 2025; <strong>notable regional growth in ROR includes East Asian research organizations</strong>.</li>
<li>ROR&rsquo;s technical team <strong>deprecated v1</strong> of the ROR schema and API in 2025, improved the performance and stability of the ROR API, and launched a new &ldquo;single search&rdquo; name-to-ROR-ID matching strategy.</li>
<li>Technical goals for 2026 include improving curation workflows, enhancing the web search experience, upgrading core technologies, and <strong>making single search the default affiliation matching strategy</strong>.</li>
<li>The timeline for requiring a <strong>client ID</strong> in order to receive the current ROR API rate limits has been extended to Q3 of 2026.</li>
</ul>
<div class="container">

<iframe src="https://docs.google.com/presentation/d/e/2PACX-1vQQCjH1UvvaGwnol0T9AL4zLW8SKTBcVCM0gAf9zOMKeXWXAeKF3lcUMjHJ6FgeRT5N2Z14cipIZ2SE/pubembed?start=false&amp;loop=false&amp;delayms=3000" width="100%" height="700" frameborder="0" allowfullscreen="allowfullscreen" allow="geolocation *; microphone *; camera *; midi *; encrypted-media *" class="responsive-iframe"></iframe>

</div>
<div class='callout green'>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-brands fa-youtube'></i></div><div class="col-md-10"><span><a href="https://youtu.be/A2niUz4kSBc?si=3KgXLeMErKiQ41Pd">Watch the recording of the Community Update</a></span></div>
	</div>
</div>


<h2 id="ror-funders-and-open-funding-metadata">ROR, funders, and open funding metadata 
</h2>
<p>One of those aforementioned changes in the last seven years is that ROR is now often used as an identifier for organizations that <em>fund</em> research as well as for organizations that <em>perform</em> research. The annual meeting session <a href="https://ror.org/events/2026-02-04-how-ror-can-help-research-funders/">How ROR Can Help Research Funders</a> offered several perspectives on ROR&rsquo;s usefulness in helping funders track the research that has resulted from their support.</p>
<p>Highlights from the funder session:</p>
<ul>
<li>
<p><a href="https://orcid.org/0000-0002-1189-9133">Hans de Jonge</a>, Head of Open Science Policies, <a href="https://www.nwo.nl/en">NWO</a>, made a compelling case for the general need for open funding metadata in scholarly publishing. Shortly after the annual meeting, Hans also wrote a blog post for us explaining <a href="https://doi.org/10.71938/s296-5k62">how and why NWO has integrated ROR into its open API</a>.</p>
</li>
<li>
<p><a href="https://orcid.org/0000-0002-8133-6395">Ana Van Gulick</a>, Head of Customer Engagement for Government, Funders &amp; Nonprofits, <a href="https://www.digital-science.com/">Digital Science</a> and (in absentia) <a href="https://orcid.org/0000-0001-9697-9599">Ishwar Chandramouliswaran</a>, Lead Program Director, <a href="https://datascience.nih.gov/">Office of Data Science Strategy, NIH</a>, both of the <a href="https://datascience.nih.gov/data-ecosystem/generalist-repository-ecosystem-initiative">GREI initiative</a>, explained how and why the seven participating generalist repositories implemented ROR and announced some exciting forthcoming ROR-powered metrics and metadata dashboards.</p>
</li>
<li>
<p>Ami Gala, Sales Director, Americas, <a href="https://altum.com">Altum</a>, gave a detailed overview of how and why the popular Proposal Central grants management system uses ROR.</p>
</li>
</ul>
<div class="container">

<iframe src="https://docs.google.com/presentation/d/e/2PACX-1vT36jeBeF7zZqltSQSeMxIxNLMNF2ZH67mvAZDFAnnav03pgG3ps4OGh-Qq9ALYSvNoLXlrM1k8DORh/pubembed?start=false&amp;loop=false&amp;delayms=3000" width="100%" height="700" frameborder="0" allowfullscreen="allowfullscreen" allow="geolocation *; microphone *; camera *; midi *; encrypted-media *" class="responsive-iframe"></iframe>

</div>
<div class='callout green'>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-brands fa-youtube'></i></div><div class="col-md-10"><span><a href="https://youtu.be/Iifg7qEDCAQ?si=6fxF6-Qb6-lOHZ3A">Watch the recording of How ROR Can Help Research Funders</a></span></div>
	</div>
</div>


<h2 id="ror-and-national-research-strategies-in-the-asia-pacific-region">ROR and National Research Strategies in the Asia-Pacific Region 
</h2>
<p>As we mentioned in the Community Update, ROR worked hard in 2025 to make ROR a truly global registry: of the 8,000+ organization records we added to ROR last year, nearly 20% represent organizations located in East Asia. The annual meeting session <a href="https://ror.org/events/2026-02-05-ror-in-national-research-strategies/">The Role of ROR in National Research Strategies</a> therefore featured speakers from Japan and Korea as well as from Australia and was held at an APAC-friendly time.</p>
<p>Highlights from the session on national research strategies:</p>
<ul>
<li>
<p><a href="https://orcid.org/0000-0003-1996-4259">Kazuhiro Hayashi</a>, Principal Senior Fellow, <a href="https://www.nistep.go.jp/en/">NISTEP</a>, gave an overview of local PIDs in Japan, explained recent collaborative work to link the NISTEP Dictionary and ROR, and pointed to ROR as an example of an important PID in the proposal for the next STI Basic Plan of the Science Council of Japan.</p>
</li>
<li>
<p><a href="https://orcid.org/0009-0005-6722-7669">Clare Nulley-Valdes</a>, Assistant Director Evaluation and Impact Policy, <a href="https://www.arc.gov.au">Australian Research Council</a>, spoke on Australia&rsquo;s National PID Strategy and ARC&rsquo;s PID Action Plan and reported that ARC has incorporated ROR IDs into its internal systems and will begin requiring researchers to use ROR in funding acknowledgements.</p>
</li>
<li>
<p><a href="https://orcid.org/0000-0003-2700-0601">Jungwoo Lee</a>, Senior Researcher, <a href="https://www.kisti.re.kr/eng/">KISTI</a>, discussed the challenges of building and maintaining an in-house organization registry, reported on a recent ROR / KISTI collaboration to improve coverage of Korean organizations in ROR, and explained how KISTI uses ROR and Web of Science to track Korean research.</p>
</li>
</ul>
<div class="container">

<iframe src="https://docs.google.com/presentation/d/e/2PACX-1vTs999L6viVXnH41GayiTK9Iz4gXHg8J0Y0H9t0wtpqOvN2lWhdMBA2gbjOBSrxVUKlAvBwCYIpDlTn/pubembed?start=false&amp;loop=false&amp;delayms=3000" width="100%" height="700" frameborder="0" allowfullscreen="allowfullscreen" allow="geolocation *; microphone *; camera *; midi *; encrypted-media *" class="responsive-iframe"></iframe>

</div>
<div class='callout green'>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-brands fa-youtube'></i></div><div class="col-md-10"><span><a href="https://youtu.be/htRFtLwUG8I?si=U13MQF2SEzmlWmbu">Watch the recording of The Role of ROR in National Research Strategies</a></span></div>
	</div>
</div>


<h2 id="looking-ahead">Looking ahead 
</h2>
<p>Change is one of the few constants in life, but what won&rsquo;t change is that ROR will always be community-led and will always be committed to improving our content and services based on your feedback. We&rsquo;ve got plenty to do in the months and years ahead, as you can see from our <a href="https://github.com/ror-community/ror-roadmap">roadmap</a>, but we&rsquo;re still always happy to hear from you about what else you&rsquo;d like to see us do and how we can better support your needs for open, high-quality, interoperable organization information. Join us in 2026 at our bi-monthly Community Calls and other <a href="https://ror.org/events">events</a> and write us at <a href="mailto:support@ror.org">support@ror.org</a> with questions and suggestions.</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">In the ROR Annual Community Meeting in February, we celebrated seven years of ROR and the ROR community that makes it all possible in three terrific sessions. If you missed any of the sessions, here are the highlights.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Dutch Research Council (NWO) and ROR</title><id>https://doi.org/10.71938/s296-5k62</id><link rel="alternate" href="https://ror.org/blog/2026-02-11-dutch-research-council-and-ror/"/><published>2026-02-11T16:32:15-05:00</published><updated>2026-02-27T10:10:27-05:00</updated><author><name>Hans de Jonge</name><uri>https://orcid.org/0000-0002-1189-9133</uri></author><category term="Case Studies"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<!-- Commonly used content 

<figure class="featured-figure"><img src="/img/blog/"/>
</figure>


<div class='callout green'>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-info'></i></div><div class="col-md-10"><span></span></div>
	</div>
</div>

<figure class="blog-figure"><img src="/img/blog/"/>
</figure>



<div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
  <iframe src="https://www.youtube.com/embed/XXX" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="YouTube Video"></iframe>
</div>


<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Write us at <a href="mailto:support@ror.org">support@ror.org</a> with any comments or questions.</span></div>
	</div>
</div> 
-->
<figure class="featured-figure-small"><img src="/img/blog/nwo/nwo-logo.jpg"
         alt="NWO logo"/>
</figure>

<p>The Dutch Research Council (NWO) <a href="https://www.nwo.nl/nieuws/research-organisation-registry-identifiers-geintegreerd-in-open-api">announced recently that it has integrated ROR in its open API</a>. Hans de Jonge, director of Open Science, tells us why and how NWO uses ROR in this latest installment of our <a href="/categories/case-studies">case study series</a>.</p>
<p>See also Hans de Jonge&rsquo;s recent presentation at the ROR Annual Meeting session <a href="/events/2026-02-04-how-ror-can-help-research-funders/">How ROR Can Help Research Funders.</a></p>
<hr>


<h2 id="what-made-you-decide-to-use-ror"><strong>What made you decide to use ROR?</strong> 
</h2>
<p><a href="www.nwo.nl">NWO</a> is the major research funding council in the Netherlands with a longstanding commitment to Open Science. About five years ago, we decided we needed to apply the principles of Open Science not only to our grantees but also—as much as possible—to our own data. We decided to move away from using proprietary data for our annual <a href="https://doi.org/10.5281/zenodo.13885012">open access monitoring</a> and use open metadata only.</p>
<p>We developed an <a href="https://www.nwo.nl/en/how-to-use-the-nwopen-api">open API</a> which makes structured data for all our funded projects and their outputs openly available as open data for anyone to use. Universities use this data for instance in developing institutional dashboards about the external funding they acquire, but the data is also available for researchers.</p>
<p>We also developed a <a href="https://doi.org/10.5281/zenodo.4674512">PID strategy</a> for our organization, outlining that over time we would implement DOIs for outputs, ORCIDs for researchers, <a href="https://doi.org/10.64000/x7d4h-x3r11">Crossref Grant IDs</a> for our awards, and ROR for organizations. We have now integrated ROR into our API, meaning that the affiliations of all researchers in that database (PIs as well as their collaborators) are now identified with ROR.</p>


<h2 id="how-do-you-use-ror"><strong>How do you use ROR?</strong> 
</h2>
<p>For the moment we only use ROR in our NWOpen API. In the back end a database is running which converts organizational information to ROR IDs. The challenge here was that our own organisation master list included both top-level organisation names and child organisations. Quite a bit of manual validation came into this.</p>
<p>The ambition is to integrate ROR more fully in our grant management system. Like many national funding councils, we have a custom-built grant management system called ISAAC,  and we are currently in the process of renewing that system. PID integrations (ROR, ORCID and Grant ID) are high on the list of requirements.</p>
<p>For now we hope that exposing ROR through our NWOpen API will already help users to better identify organizations. Our old field consisted of a mix of top level organizational names and faculties and departments (“University of Utrecht | Faculty of Veterinary sciences | department Clinical Sciences”). Introducing ROR allows people to just search for Utrecht University’s ROR and retrieve <a href="https://nwopen-api.nwo.nl/NWOpen-API/api/Projects?ror_id=04pp8hn57">all funded projects</a>.</p>
<p>We are also considering exposing ROR on our <a href="https://www.nwo.nl/en/projects">project website</a>.</p>


<h2 id="what-were-the-steps-you-took-to-integrate-ror-into-your-systems-and-workflows"><strong>What were the steps you took to integrate ROR into your systems and workflows?</strong> 
</h2>
<p>The honest answer is that it took us quite some time. The difficulty was that it collided with other data-related projects within our organization. But we are very proud to have pulled it off, because it immediately shows people within and outside of our organization the enormous potential of persistent identifiers. We experienced the same thing when we introduced <a href="https://www.nwo.nl/en/news/nwo-funded-research-projects-get-unique-identifier-with-grant-id">Grant IDs</a> two years ago.</p>


<h2 id="anything-more-to-say-about-ror">Anything more to say about ROR? 
</h2>
<p>We are proud to join the growing ROR community. Persistent identifiers are a key enabler in the world of scholarly communication and open science. We are very well aware that we as a funding council have a role to play &ndash; not only in using PIDs to our own benefit but contributing to this landscape by integrating them into our systems and exposing them so all these entities can be better connected to the benefit of an open scholarly ecosystem.</p>
<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Write us at <a href="mailto:support@ror.org">support@ror.org</a> with any comments or questions.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">The Dutch Research Council (NWO) announced recently that it has integrated ROR in its open API. Hans de Jonge, director of Open Science, tells us why and how NWO uses ROR in this latest installment of our case study series.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">ROR in 2025: Year in Review</title><id>https://doi.org/10.71938/xwf9-xd80</id><link rel="alternate" href="https://ror.org/blog/2025-12-16-year-in-review/"/><published>2025-12-16T18:12:19-05:00</published><updated>2025-12-19T15:23:03-05:00</updated><author><name>ROR Core Team</name><uri>https://ror.org/about/team/#core-team</uri></author><category term="Year in Review"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<figure class="featured-figure"><img src="/img/lion-notched-ear.jpg"
         alt="Lion, savannah, wildlife. Image by u_y49ekyffz5 from Pixabay"/>
</figure>

<p>This time last year, when we <a href="https://doi.org/10.71938/h7r4-fd19">reflected on ROR&rsquo;s milestones in 2024</a>, one of the major highlights was the <a href="https://doi.org/10.71938/v406-1m30">launch of version 2 of the ROR schema and API</a>. Now, as we think back on 2025, where we were a year ago makes a nice bookend to one of the major highlights of the current year: last week&rsquo;s <a href="https://doi.org/10.71938/y9pa-8m20">sunset of version 1 of the ROR schema and API</a>. Developing and launching new products and services is both strategically important and satisfying, and we&rsquo;ve certainly done that in 2025, but as any initiative matures, it also becomes essential to put away certain things, clearing the path to sustainable maintenance and renewed innovation.</p>
<p>ROR, in short, is fully grown now: we&rsquo;re well-established as the obvious and trustworthy choice for research organization identification, we&rsquo;re processing more and more (and more and more) requests for changes and additions to the registry, and we&rsquo;ve established systems and processes that let us run like a well-oiled engine.</p>
<p>Fulfilling its <a href="https://doi.org/10.71938/sna1-zc49">original purpose</a>, ROR plays a key part in providing open affiliation metadata for numerous workflows and use cases: to ensure research integrity, manage transformative agreements, track research output, and analyze institutional collaborations. ROR IDs have also proven to be valuable for many more use cases and types of users than anyone originally envisioned. Notably, both publishers and funders have begun using ROR IDs to identify funders, and we expect this use of ROR to increase dramatically in the future, especially as more and more funders register DOIs for awards.</p>
<p>Here&rsquo;s a summary of what ROR did in 2025.</p>


<h2 id="refashioning-rors-schema-and-api">Refashioning ROR&rsquo;s schema and API 
</h2>
<p>Much of the technical work we&rsquo;ve done this year was dedicated to infrastructure and service-level improvements intended to keep our API fast and reliable and our metadata schema comprehensive and flexible in the future.</p>
<p>At the end of July, we <a href="https://ror.readme.io/changelog/2025-07-30-default-response-to-api-requests-is-now-version-2">changed the default response to ROR API queries to return results in version 2</a> of the ROR schema. This significant change went very smoothly, with the vast majority of ROR API users having prepared their ROR integrations for the switch before this date. With this first phase of version 1 deprecation in place, we continued to remind ROR users that version 1 would be sunset in December. By the time the <a href="https://ror.readme.io/changelog/2025-12-04-version-1-of-the-ror-api-and-schema-is-sunsetting">sunset date of December 9th</a> rolled around, fewer than 8% of ROR API users were using version 1. As of this writing we still haven&rsquo;t received any reports of <a href="https://www.seangoedecke.com/good-api-design/">broken userspaces</a>, which makes us very happy! The <a href="https://ror.readme.io/docs/ror-data-structure">current version of the ROR API and schema</a> was designed in collaboration with our community for long-term support. We are glad that its more descriptive and useful form will be the default experience for all users.</p>
<p>Another major technical initiative this year has been the implementation of a new strategy for affiliation string matching, which <a href="https://doi.org/10.71938/zz90-g810">we announced in early December</a>. This new strategy, developed in collaboration with Crossref&rsquo;s Director of Technology, <a href="https://www.crossref.org/people/dominika-tkaczyk">Dominika Tkaczyk</a>, is both faster and more efficient than our existing matching, while also achieving better performance for this task. As such, it is particularly well-suited for large-scale affiliation string matching and was adopted<a href="https://doi.org/10.71938/mtx2-vb16"> by OpenAlex</a> at the start of the year to significantly improve their ROR ID coverage. We look forward to hearing more from ROR API users about this new matching strategy in 2026, which we plan to make the default next year.</p>


<h2 id="ensuring-high-quality-open-metadata-through-community-curation">Ensuring high-quality open metadata through community curation 
</h2>
<p>We continue to see logarithmic growth in <a href="https://github.com/ror-community/ror-updates/issues">requests</a> to create new ROR records and modify existing ROR records, and while we still process many requests from individuals and individual research organizations, we are working more and more with national and international research information initiatives on large-scale curation projects. This year, for instance, we collaborated closely with the Korea Institute of Science &amp; Technology Information (KISTI) on a <a href="https://github.com/ror-community/ror-updates/issues/19804">project to add and modify approximately 450 Korean research organizations</a>, continued our <a href="https://github.com/ror-community/ror-updates/issues/12742">work with the Fundação para a Ciência e Tecnologia</a> to add and modify hundreds of records for research organizations in Portugal, and <a href="https://github.com/ror-community/ror-updates/issues/23636">added and modified many ROR records for African research organizations</a> in collaboration with the Africa PID Alliance.</p>
<p>Just a few of ROR&rsquo;s other collaborative curation projects in 2025 include the following:</p>
<ul>
<li>Adding <a href="https://github.com/ror-community/ror-updates/issues/20835">new Category 2 Centres for UNESCO</a></li>
<li>Working with Japan&rsquo;s National Institute of Science and Technology Policy (NISTEP) to <a href="https://github.com/ror-community/ror-updates/issues/16741">reconcile ROR records with the NISTEP Dictionary of Names of Universities and Public Organizations</a></li>
<li>Adding and updating <a href="https://github.com/ror-community/ror-updates/issues/24294">research infrastructure facilities for Australia&rsquo;s National Collaborative Research Infrastructure Strategy (NCRIS)</a></li>
<li>Working with The National Library of Luxembourg to <a href="https://github.com/ror-community/ror-updates/issues/26521">add and update Luxembourgian organizations</a></li>
<li>Adding <a href="https://github.com/ror-community/ror-updates/issues/18629">Czech Large Research Infrastructures</a> listed in Czechia&rsquo;s national &ldquo;Road Map of Infrastructures&rdquo;</li>
</ul>
<p>If your organization would like to work with ROR on a national-level project, you&rsquo;re always welcome to use our <a href="https://ror.org/registry/#bulk-requests">bulk processing spreadsheets</a> to send us a large number of requests for additions or updates.</p>


<h2 id="supporting-new-users-and-uses-of-ror">Supporting new users and uses of ROR 
</h2>
<p>In 2025, we saw a marked increase in the use of ROR to address key use cases for funding organizations and for funding metadata. Open Funder Registry records have been incorporated into ROR, and ROR is recommended in both the <a href="https://www.crossref.org/documentation/funder-registry/funding-data-overview/">Crossref metadata schema</a> and the <a href="https://support.datacite.org/docs/connecting-to-organizations">DataCite metadata schema</a> as the preferred identifier for funding organizations. Community initiatives such as the <a href="https://barcelona-declaration.org">Barcelona Declaration on Open Research Information</a> are also emphasizing the importance of ROR in bringing broader visibility to open funding metadata to enable downstream reporting and tracking of research.</p>
<p>Soon after Crossref <a href="https://doi.org/10.13003/156081">announced that its members could now use ROR IDs to identify funders</a> in any type of content, for instance, Québec&rsquo;s premier funder Fonds de Recherche du Québec (FRQ) became an enthusiastic adopter of ROR. We <a href="https://github.com/ror-community/ror-updates/issues/20812">worked with FRQ</a> to add and modify ROR records representing the affiliated institutions of the principal investigators on FRQ grants so that FRQ could <a href="https://frq.gouv.qc.ca/en/persistent-unique-identifiers-doi/">register more than 20,000 grants with Crossref</a>, using <a href="https://ror.org/00w3qhf76">its own ROR ID</a> to identify itself.</p>
<p>Throughout 2025, we&rsquo;ve also continued to improve the quality of metadata in <a href="https://github.com/ror-community/ror-updates/issues/20496">Chinese-language records identifying funders</a>. Recently, in advance of the <a href="https://osf.io">Open Science Framework</a>&rsquo;s transition to using ROR IDs for funders, we&rsquo;ve also been working with OSF <a href="https://github.com/ror-community/ror-updates/issues/27414">to review nearly 150 funders</a> and match them to ROR IDs or add them to ROR. We anticipate even more of this work next year as more systems continue to <a href="https://ror.readme.io/docs/funder-registry">use ROR IDs to identify funders</a> for all kinds of DOI content types, including grants.</p>
<p>As for publishing systems, we were thrilled to see the release of PKP&rsquo;s Open Journal Systems (OJS) 3.5 in which <a href="https://docs.pkp.sfu.ca/learning-ojs/about-ojs/en/#other-changes">ROR IDs for author affiliations are now part of the core codebase</a> instead of being managed by a plugin. OJS is used by thousands of small publishers all across the globe, and this update means that these small publishers can more easily identify author affiliations in DOI metadata. Another notable publishing integration is in the <a href="https://www.openlibhums.org/site/janeway/">Janeway</a> system created by the Open Library for the Humanities, where authors who use an ORCID ID to log in during the manuscript submission workflow will see their institutional affiliation automatically added thanks to Janeway&rsquo;s ability to pull that information from the ORCID API using the ROR ID. Both the ORCID ID and the ROR ID are then included in DOI metadata.</p>
<p>Publishing systems certainly aren&rsquo;t the only ones to use ROR. CRIS systems are increasingly standardizing on ROR, including Symplectic Elements, which <a href="https://doi.org/10.71938/1fch-hb40">began to enrich and normalize organization data with ROR</a> this year. We&rsquo;ve also seen the <a href="https://www.raid.org/">Research Activity Identifier (RAID)</a> initiative to identify and register research projects using ROR for <a href="https://metadata.raid.org/en/v1.6/core/organisations.html">organization identifiers in its schema</a> and <a href="https://www.youtube.com/watch?v=3aBjcM3ou1M&amp;list=PL4n_Cvd0PpoHfsM3_6VfhAovGIfL3Z79x">the Research Data Alliance incorporating ROR into its member registration form</a> in a thoughtfully-designed user interface.</p>


<h2 id="contributing-to-key-conversations-and-initiatives">Contributing to key conversations and initiatives 
</h2>
<p>As a community-driven initiative ourselves, we recognize how important it is to do professional service for others and participate in collaborative activities and discussions. In 2025, ROR&rsquo;s participation in such work extended well beyond our own <a href="https://ror.org/events/">Community Calls and events</a>. ROR team members serve on working groups and advisory bodies for the <a href="https://barcelona-declaration.org/">Barcelona Declaration</a>, the <a href="https://niso.org">National Information Standards Organization (NISO)</a>, <a href="https://openalex.org">OpenAlex</a>, the <a href="https://pkp.sfu.ca">Public Knowledge Project (PKP)</a>, and <a href="https://www.cometadata.org/">Collaborative Metadata (COMET)</a>, just to name a few. As a selected infrastructure in the <a href="https://scoss.org/">Global Sustainability Coalition for Open Science Services (SCOSS)</a>, we are also involved with SCOSS events and activities, helping to support the global open infrastructure that makes so many research information services possible.</p>
<p>We also participated this year in the final NSF-funded workshop on <a href="https://ncar.github.io/FAIR-Facilities-Instruments/">FAIR Facilities and Instruments</a>, offering our perspective in particular on <a href="https://doi.org/10.71938/r7ar-eq11">how research facilities can use persistent identifiers</a> to help their important work gain better recognition. Finally, we presented at global conferences such as <a href="https://ror.org/events/2025-05-15-ror-eurocris/">euroCRIS</a>, <a href="https://ror.org/events/2025-06-16-open-repositories">Open Repositories</a>, the <a href="https://ror.org/events/2025-05-28-ssp">Society for Scholarly Publishing (SSP)</a>, and the <a href="https://ror.org/events/2025-06-24-joss">Japan Open Science Summit</a>, reaching a range of communities through these events.</p>
<p>As 2025 comes to a close, we&rsquo;d like to thank all those who have helped bring ROR to this point. We remain grateful for your questions, your requests, your comments, and your contributions, and we remain committed to learning from and supporting the ROR community in 2026, starting with the <a href="https://ror.org/events">ROR Annual Community Meeting sessions</a> on February 3rd and 4th, 2026. See you next year!</p>
<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Write us at <a href="mailto:support@ror.org">support@ror.org</a> with any comments or questions.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">In 2025, ROR deprecated a schema version, launched a new affiliation matching strategy, worked with national and international groups on community metadata curation, supported new users of and use cases for ROR, devoted our time to key research information initiatives, and did much, much more.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">How ROR IDs Help the EarthScope Consortium Track Organizational Partnerships</title><id>https://doi.org/10.71938/t5bn-bm23</id><link rel="alternate" href="https://ror.org/blog/2025-12-12-ror-earthscope-organizational-partnerships/"/><published>2025-12-12T00:00:00+00:00</published><updated>2026-03-17T09:43:56-04:00</updated><author><name>Ted Habermann</name><uri>https://orcid.org/0000-0003-3585-6733</uri></author><author><name>Jim Riley</name><uri>https://orcid.org/0000-0001-8163-5662</uri></author><category term="Case Studies"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<figure class="featured-figure"><img src="/img/blog/earthscope/partnerships-chart.png"
         alt="Chart of partnerships between the EarthScope Consortium and other organizations."/>
</figure>

<p>In the past few years, a great deal of attention has been paid to the importance of using ROR IDs to identify organizations that researchers are affiliated with. Less well-known, but equally important, is that organizations can also act as creators or contributors themselves, and these organizational creators and contributors can also be identified with ROR IDs, which is particularly useful for tracking organizational partnerships.</p>


<h2 id="organizational-partners">Organizational Partners 
</h2>
<p><a href="https://datacite.org">DataCite</a> now has over 1,700 members and, as the membership expands, different types of organizations join the DataCite community. Some of these new members are organizations with important partnerships with other organizations. The DataCite <a href="https://schema.datacite.org/">metadata schema</a> for DOIs includes the capability to define <a href="https://datacite-metadata-schema.readthedocs.io/en/4.5/properties/creator/#a-nametype">a type for creators and contributors</a> which can be Personal (default) or Organizational. Personal creators and contributors are typically identified with <a href="https://orcid.org">ORCID</a> iDs, of course, whereas Organizational creators and contributors are identified with ROR IDs.</p>
<p>An organization might be listed as a creator of or contributor to a dataset for a number of reasons, many of which reflect organizational partnerships. An organization that hosts datasets for another organization, for instance, might be listed as an organizational contributor in the metadata for that dataset. How are current DataCite members using ROR IDs to identify and track partnerships with other organizations?</p>
<p>The <a href="https://www.earthscope.org/">EarthScope Consortium</a> operates major scientific facilities focused on Geophysical Sciences, primarily Geodesy and Seismology. These organizations existed independently as UNAVCO, which operated the NSF GAGE facility,  and Incorporated Research Institutes for Seismology (IRIS), which operated the NSF SAGE facility. As of October 2025, these entities merged into the <a href="https://www.earthscope.org/news/earthscope-consortium-awarded-national-geophysical-facility/">National Geophysical Facility</a> with funding from the U.S. National Science Foundation. Since 2016, both facilities have used DataCite DOIs to identify datasets, and they manage significant repositories. The legacy names of these repositories in DataCite metadata are <code>iris.iris</code> (for the NSF SAGE facility operated by IRIS) and <code>unavco.unavco</code> (for the NSF GAGE facility operated by UNAVCO).</p>
<p>EarthScope worked with Metadata Game Changers to increase utilization of identifiers in both repositories during 2025 (<a href="https://doi.org/10.5281/ZENODO.17238701">Habermann and Riley, 2025</a>). Over 450,000 identifiers were added to several fields in the EarthScope repository during this project. It turned out that 97% of the creators in the NSF SAGE repository were organizations, so it provided an opportunity to explore how ROR IDs can be used to track organizational partnerships.</p>
<p>Table 1 shows the organizations that are included as authors in the IRIS repository and the number of times they occur. Together, these organizations account for over 95% of the datasets in this repository. &ldquo;IRIS DMC&rdquo; and &ldquo;Incorporated Research Institutions for Seismology&rdquo; are two legacy names for the same host facility used in the metadata, as indicated by their association with the same ROR ID. The other organizational creators include universities and government agencies that produce specialized geophysical datasets.</p>
<table>
<thead>
<tr>
<th style="text-align:center"><strong>Organization</strong></th>
<th style="text-align:center"><strong>ROR</strong></th>
<th style="text-align:center"><strong>Occurrences</strong></th>
</tr>
</thead>
<tbody>
<tr>
<td style="text-align:center">Global CMT Project</td>
<td style="text-align:center"><a href="https://ror.org/04knth696">https://ror.org/04knth696</a>*</td>
<td style="text-align:center">72,644</td>
</tr>
<tr>
<td style="text-align:center">IRIS DMC</td>
<td style="text-align:center"><a href="https://ror.org/05xkn9s74">https://ror.org/05xkn9s74</a> </td>
<td style="text-align:center">13,518</td>
</tr>
<tr>
<td style="text-align:center">Princeton [University]</td>
<td style="text-align:center"><a href="https://ror.org/00hx57361">https://ror.org/00hx57361</a></td>
<td style="text-align:center">9,019</td>
</tr>
<tr>
<td style="text-align:center">Northwestern University</td>
<td style="text-align:center"><a href="https://ror.org/000e0be47">https://ror.org/000e0be47</a> </td>
<td style="text-align:center">6,306</td>
</tr>
<tr>
<td style="text-align:center">Oregon State University</td>
<td style="text-align:center"><a href="https://ror.org/00ysfqy60">https://ror.org/00ysfqy60</a></td>
<td style="text-align:center">2,442</td>
</tr>
<tr>
<td style="text-align:center">Geological Survey of Canada</td>
<td style="text-align:center"><a href="https://ror.org/03wm7z656">https://ror.org/03wm7z656</a> </td>
<td style="text-align:center">2,121</td>
</tr>
<tr>
<td style="text-align:center">U.S. Geological Survey</td>
<td style="text-align:center"><a href="https://ror.org/035a68863">https://ror.org/035a68863</a></td>
<td style="text-align:center">2,061</td>
</tr>
<tr>
<td style="text-align:center">Complete MT Solutions</td>
<td style="text-align:center"></td>
<td style="text-align:center">1,929</td>
</tr>
<tr>
<td style="text-align:center">Incorporated Research Institutions For Seismology</td>
<td style="text-align:center"><a href="https://ror.org/05xkn9s74">https://ror.org/05xkn9s74</a> </td>
<td style="text-align:center">1,890</td>
</tr>
<tr>
<td style="text-align:center"><strong>Total</strong></td>
<td style="text-align:center"></td>
<td style="text-align:center"><strong>111,930</strong></td>
</tr>
</tbody>
</table>
<div class="caption">Table 1. Number of occurrences of organizational authors that occurred more than 1,000 times in the iris.iris repository. *New ROR assigned as part of this project.
</div>
<p>For example, the <a href="https://www.globalcmt.org/">Global CMT Project</a> calculates Centroid Moment Tensors that give key source parameters like magnitudes, fault orientations, and locations for earthquakes all over the world while <a href="https://global.shakemovie.princeton.edu/">Global ShakeMovies</a> generated by Princeton University and given DataCite DOIs are used to visualize ground motion as seismic waves travel around the world.</p>
<figure class="blog-figure"><img src="/img/blog/earthscope/myanmar-shakemovie.gif"
         alt="Earthquake GIF."/><figcaption>
            <p>Part of a Global ShakeMovie for an earthquake in Myanmar, <a href="https://doi.org/10.17611/dp/23119472">https://doi.org/10.17611/dp/23119472</a></p>
        </figcaption>
</figure>

<p>All but one of the organizations listed in Table 1 had ROR IDs when we started this project, and these were added to the DataCite DOI metadata as creator identifiers. The Global CMT Project became a community facility during 2020 and a ROR ID was created for the facility during this project, making it possible to add a creator identifier to over 72,000 datasets created by The CMT Project.</p>
<div class='callout '>
	<div class="row">
		<div class="col-md-12"><span>
<h3 id="defensive-metadata-tip-for-organizations">Defensive metadata tip for organizations 
  <a href="#defensive-metadata-tip-for-organizations" aria-label="Defensive metadata tip for organizations">
<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 640 512" class="ha ha-link" width="16" height="16" alt="Defensive metadata tip for organizations"><path d="M579.8 267.7c56.5-56.5 56.5-148 0-204.5c-50-50-128.8-56.5-186.3-15.4l-1.6 1.1c-14.4 10.3-17.7 30.3-7.4 44.6s30.3 17.7 44.6 7.4l1.6-1.1c32.1-22.9 76-19.3 103.8 8.6c31.5 31.5 31.5 82.5 0 114L422.3 334.8c-31.5 31.5-82.5 31.5-114 0c-27.9-27.9-31.5-71.8-8.6-103.8l1.1-1.6c10.3-14.4 6.9-34.4-7.4-44.6s-34.4-6.9-44.6 7.4l-1.1 1.6C206.5 251.2 213 330 263 380c56.5 56.5 148 56.5 204.5 0L579.8 267.7zM60.2 244.3c-56.5 56.5-56.5 148 0 204.5c50 50 128.8 56.5 186.3 15.4l1.6-1.1c14.4-10.3 17.7-30.3 7.4-44.6s-30.3-17.7-44.6-7.4l-1.6 1.1c-32.1 22.9-76 19.3-103.8-8.6C74 372 74 321 105.5 289.5L217.7 177.2c31.5-31.5 82.5-31.5 114 0c27.9 27.9 31.5 71.8 8.6 103.9l-1.1 1.6c-10.3 14.4-6.9 34.4 7.4 44.6s34.4 6.9 44.6-7.4l1.1-1.6C433.5 260.8 427 182 377 132c-56.5-56.5-148-56.5-204.5 0L60.2 244.3z"/></svg>
  </a>
</h3>
<p>Most ROR IDs in DataCite are currently used to identify creator or contributor affiliations, so many searches for ROR IDs query either the <code>creators.affiliation.affiliationIdentifier</code> field or the <code>contributors.affiliation.affiliationIdentifier</code> field. In the organizational creator case, the ROR ID fits in the <code>creators.nameIdentifiers.nameIdentifier</code> field because it is the identifier for the creator. However, also including the ROR ID as an affiliation identifier ensures that the resource will be discovered regardless of how the search is done.</p>
</span></div>
	</div>
</div>
<p><a href="https://commons.datacite.org">DataCite Commons</a> is a great tool for browsing resources associated with organizations using their ROR IDs. The <a href="https://commons.datacite.org/ror.org/04danrt76">DataCite Commons Earthscope page</a> shows over 124,000 works for this organization (Figure 1, left).</p>
<p>The creators and contributors list shows only creators with ORCID iDs, i.e., people, but the works associated with the organizations in Table 1 can be found using the names of the organizations in the Filter Works box (green box). For example, filtering works for “Princeton” yields 9,020 works, the shake movies in the EarthScope repository authored by Princeton. Clicking the title for a work shows the creator tab which includes Princeton as a creator of the dataset.</p>
<figure><img src="/img/blog/earthscope/datacite-commons.png"
         alt="Side by side screenshots."/><figcaption>
            <p>Figure 1. DataCite Commons pages from EarthScope (left) and Princeton (right) that show ShakeMovies created by Princeton in the EarthScope repository.</p>
        </figcaption>
</figure>

<p>The fact that ROR IDs of creators appear in the Commons output for the repositories that hold the resources and mint the DOIs is great, but it is expected. An unexpected benefit of these ROR IDs is that they create a two-way link between the repository and the partner organization. This link is visible through the DataCite Commons page for Princeton which shows over 13,000 works (Figure 1, right). Filtering these for “ShakeMovie” returns the Global ShakeMovies in the EarthScope repository. The link added to the EarthScope metadata makes the connection to Princeton that is discovered by DataCite Commons.</p>
<p>At the time that these data were collected, the Princeton ROR ID was associated with about 13,000 items discovered by DataCite Commons. Over 9,000 of these, i.e., 68%, are the shake movies in the EarthScope repository. The amazing thing is that these items showed up on the Princeton DataCite Commons “profile” automagically – without Princeton doing anything. This behavior is like items added to DataCite showing up on a researcher’s ORCID profile, but for organizations instead of individuals. The DataCite Commons page for an organization ROR acts like an ORCID profile for an individual.</p>
<figure class="blog-figure"><img src="/img/blog/earthscope/partnerships-chart.png"
         alt="Bubble chart."/><figcaption>
            <p>Figure 2. Resources created by partners in the EarthScope repository and on the partner DataCite Commons pages. The percentage of items on each DataCite Commons page represented by EarthScope resources is shown for each partner.</p>
        </figcaption>
</figure>

<p>Figure 2 shows partner resources in the EarthScope repository and in the partner Commons pages. EarthScope resources represent over 68% of DataCite resources for many partners. In the scenario described here, EarthScope has provided data management services to partners for many years. Adding ROR IDs to the EarthScope metadata for these datasets adds an increasingly important connection service to these partnerships.</p>
<p>We can explore organizational creators in other repositories using this DataCite API query:</p>
<p><code>https://api.datacite.org/dois?&amp;affiliation=true&amp;publisher=true&amp;page%5Bsize%5D=1&amp;query=creators.nameType:Organizational</code></p>
<p>Results from this query indicate that almost 1.4 million records in DataCite have organizational creators. Table 2 shows the ten repositories with the most organizational creators and the number of those creators in each. EarthScope just makes it into the list at number ten!</p>
<p>In a sample of 10,000 records with organizational creators from each of these repositories, the Distributed System of Scientific Collections (<a href="https://www.dissco.eu/">DiSSCo</a>) is the only one besides the SAGE repository that includes identifiers for their organizational creators. The most common creator ROR ID in DiSSCO (<a href="https://ror.org/0566bfb96">https://ror.org/0566bfb96</a>) identifies the Naturalis Biodiversity Center, a biodiversity museum in The Netherlands that coordinates closely with DiSSCo. The <a href="https://commons.datacite.org/ror.org/0566bfb96">DataCite Commons page for The Naturalis Biodiversity Center</a> currently shows 7,036,619 works associated with this ROR ID, with 99% of them created in a major expansion of the DiSSCo repository during 2025. This is a clear bright spot that uses ROR IDs effectively to recognize their partners!</p>
<table>
<thead>
<tr>
<th><strong>Repository Name</strong></th>
<th><strong>DataCite Repository ID</strong></th>
<th><strong>Number of Records in Repository</strong></th>
</tr>
</thead>
<tbody>
<tr>
<td><a href="https://commons.datacite.org/repositories/zr3cyxn">Distributed System of Scientific Collections (DiSSCo)</a></td>
<td>ylqb.ybhfwy</td>
<td>7,036,619</td>
</tr>
<tr>
<td><a href="https://commons.datacite.org/repositories/4ygcg72">Global Biodiversity Information Facility</a></td>
<td>gbif.gbif</td>
<td>4,108,688</td>
</tr>
<tr>
<td><a href="https://commons.datacite.org/repositories/3znc71">International Treaty on Plant Genetic Resources for Food and Agriculture</a></td>
<td>fao.itpgrfa</td>
<td>1,911,979</td>
</tr>
<tr>
<td><a href="https://commons.datacite.org/repositories/or2c5dv">Genebank Information System of the IPK Gatersleben</a></td>
<td>ipk.gbis</td>
<td>209,708</td>
</tr>
<tr>
<td><a href="https://commons.datacite.org/repositories/y79cxl">Plutof. Data Management and Publishing Platform</a></td>
<td>estdoi.bio</td>
<td>3,201,243</td>
</tr>
<tr>
<td><a href="https://commons.datacite.org/repositories/k6qcx2">Leibniz-Institut für Astrophysik Potsdam (AIP)</a></td>
<td>tib.aip</td>
<td>279,264</td>
</tr>
<tr>
<td><a href="https://commons.datacite.org/repositories/6eycrn">HEPData</a></td>
<td>cern.hepdata</td>
<td>177,346</td>
</tr>
<tr>
<td><a href="https://commons.datacite.org/repositories/y79c166">TERN IGSN ID Catalogue</a></td>
<td>tern.igsn</td>
<td>130,741</td>
</tr>
<tr>
<td><a href="https://commons.datacite.org/repositories/wqjcn8j">Hochschularchiv SRP</a></td>
<td>stdp.jwqddo</td>
<td>124,452</td>
</tr>
<tr>
<td><a href="https://commons.datacite.org/repositories/rv5cenq">NSF Seismological Facility for the Advancement of Geoscience (SAGE)</a></td>
<td>iris.iris</td>
<td>117,674</td>
</tr>
</tbody>
</table>
<div class="caption">Table 2. Top ten DataCite repositories with Organizational creator nameTypes and the number of records in each with that type.
</div>


<h2 id="conclusion">Conclusion 
</h2>
<p>ROR IDs can serve as identifiers for research organizations in several roles. Historically attention has focused on organizations that researchers are affiliated with, but organizations can also act as creators or contributors themselves. This information can be reflected in research metadata such as in the DataCite metadata schema for DOIs. Over 1.4 million DataCite resources have organizations as creators and over 44 million have organizations as contributors, providing rich opportunities for DataCite members to take advantage of ROR IDs in their metadata.</p>
<p>Eight organizations account for over 110,000 resources in the EarthScope repository and seven of these had ROR IDs at the beginning of this project. We worked with ROR to create a new ROR for the remaining organization (the Global CMT Project) to complete the identifier coverage.</p>
<p>Adding identifiers for these organizations to the EarthScope repository represents an important “connection service&quot; provided to these partners by EarthScope and inherent in the two-way nature of ROR IDs and other identifiers. This kind of connection occurs for all ROR IDs included in DataCite DOI metadata, and DataCite Commons provides an auto-update of resources associated with organizations like the auto-update capabilities for ORCID iDs for researchers. If you are an organization that hosts, creates or contributes to datasets that have DOIs registered in DataCite, make sure your ROR ID gets into the metadata so that your resources are visible through DataCite Commons and discoverable in DataCite’s APIs.</p>
<figure class="blog-figure"><img src="/img/blog/earthscope/metadata-game-changers.png"
         alt="Metadata Game Changers logo"/>
</figure>

<figure class="blog-figure"><img src="/img/blog/earthscope/earthscope-logo.png"
         alt="EarthScope Consortium logo"/>
</figure>

<div class='callout green'>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Comments or questions? Contact <a href="mailto:support@ror.org">support@ror.org</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Learn how ROR IDs can help track organizational partnerships in this deep dive into work that Metadata Game Changers did with the EarthScope Consortium to add identifiers to DataCite DOI metadata for geophysical datasets.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Announcing a New Affiliation Matching Strategy in the ROR API</title><id>https://doi.org/10.71938/zz90-g810</id><link rel="alternate" href="https://ror.org/blog/2025-12-02-announcing-a-new-affiliation-matching-strategy/"/><published>2025-12-02T09:27:10-05:00</published><updated>2025-12-02T18:23:33-05:00</updated><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><author><name>Adam Buttrick</name><uri>https://orcid.org/0000-0003-1507-1031</uri></author><category term="Technical News"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<!-- Commonly used content 

<figure class="featured-figure"><img src="/img/blog/"/>
</figure>


<div class='callout green'>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-info'></i></div><div class="col-md-10"><span></span></div>
	</div>
</div>

<figure class="blog-figure"><img src="/img/blog/"/>
</figure>



<div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
  <iframe src="https://www.youtube.com/embed/XXX" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="YouTube Video"></iframe>
</div>


<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Write us at <a href="mailto:support@ror.org">support@ror.org</a> with any comments or questions.</span></div>
	</div>
</div> 
-->
<figure class="featured-figure"><img src="/img/bibake-uppal-83uG2S0aT_I-unsplash.jpg"
         alt="Leopard standing on tree branch. Photo by Bibake Uppal on Unsplash."/>
</figure>

<p>We are delighted to announce the launch of a new matching strategy in the ROR API to help you match complex affiliation text strings to ROR IDs at scale.</p>
<p>Publishing and repository systems have traditionally stored author and affiliation information as unstructured data, capturing intermingled details about an organization as a single text string in a single field. This unstructured information makes it difficult to reliably connect research outputs with associated institutions.</p>
<p>Many repositories and publishers are <a href="https://bit.ly/ror-integrations">beginning to use ROR instead,</a> recognizing the importance of producing and sharing open, structured affiliation metadata, and here at ROR we continue to encourage Crossref and DataCite members in particular to include ROR IDs in DOI metadata. Often, a crucial step for Crossref and DataCite members in ROR adoption is to convert text strings extracted from submitted manuscripts or stored in legacy data to ROR IDs, and this is one reason the affiliation matching endpoint is the <a href="https://p.datadoghq.eu/sb/db1aec04-0c1a-11ec-860a-da7ad0900005-7d7c572812608235cca3359ee5ec591a">most-used endpoint of the ROR API</a>.</p>
<p>We know that accurately matching text strings to ROR IDs is a priority for many users, and therefore we have been continuing to develop ways of meeting this need with our API and with other services.</p>


<h2 id="how-and-why-we-developed-the-new-affiliation-matching-strategy">How and why we developed the new affiliation matching strategy 
</h2>
<p>ROR&rsquo;s first affiliation matching strategy, developed by Crossref’s Director of Technology, <a href="https://www.crossref.org/people/dominika-tkaczyk/">Dominika Tkaczyk</a>, was <a href="https://doi.org/10.71938/36jw-rs79">announced in 2019</a>. It has been available in the ROR API ever since. This strategy works by breaking long affiliation strings into substrings and making multiple queries against the ROR registry. If a match is found, the ROR API returns the ROR record identified with a <code>chosen:true</code> indicator.</p>
<p>Beginning in 2023, the ROR team began collaborating with <a href="https://crossref.org">Crossref</a> again to develop a new matching strategy that would be both faster and more precise at scale. Throughout 2024, the principles behind this collaborative research were explained in a popular <a href="https://ror.org/tags/matching/">series of blog posts about matching on the ROR blog</a> co-written by ROR Product Manager Adam Buttrick and Dominika Tkaczyk.</p>
<p>Early in 2025, we <a href="https://doi.org/10.71938/mtx2-vb16">reported</a> that this collaboration between ROR and Crossref had produced a new, faster, more precise strategy for matching affiliation strings to ROR IDs. Unlike the original &ldquo;multisearch&rdquo; matching strategy, the new strategy performs a single search of the ROR registry to find the most likely match, and therefore we&rsquo;ve dubbed it &ldquo;single search.&rdquo;</p>
<p>OpenAlex quickly incorporated this new strategy into <a href="https://ror.org/blog/2023-09-13-openalex-case-study/">its own affiliation matching</a>, and Crossref used it to produce a <a href="https://doi.org/10.5281/zenodo.15254992">dataset</a> of 95 million affiliation strings from its metadata matched to ROR IDs. At ROR, we then began the work to include this strategy in our own API, which was recently completed and released last month. Like all of ROR’s code and data, <a href="https://gitlab.com/crossref/marple/-/tree/main/strategies_available/affiliation_single_search">the single search strategy</a> is freely and openly available.</p>


<h2 id="benefits-of-the-new-affiliation-matching-strategy">Benefits of the new affiliation matching strategy 
</h2>
<p>In a demonstration of the single search strategy given at the <a href="https://ror.org/events/2025-11-19-ror-community-call/">November 2025 ROR Community Call</a>, Adam Buttrick explains that the new strategy has been designed to reduce the rate of false positive matches, because we know that accuracy is of primary importance for unsupervised automated matching. The single search strategy is more computationally efficient, better able to find relevant results, and more accurate than the already high-performing multisearch strategy.</p>
<p>In just one of several examples drawn from real Crossref metadata, Adam shows in the below video that a string such as &ldquo;Department of Cognitive Sciences, School of Social Sciences, University of California, Irvine, California;&rdquo; is not matched by the current multisearch affiliation strategy on the ROR API but is correctly matched by the new single search strategy to the ROR ID for the University of California, Irvine: <a href="https://ror.org/04gyf1771">https://ror.org/04gyf1771</a>.</p>

<div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
  <iframe src="https://www.youtube.com/embed/DC7mZSnECsQ" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title=" ROR single search affiliation matching strategy demo Nov 2025"></iframe>
</div>



<h2 id="using-the-new-affiliation-matching-strategy">Using the new affiliation matching strategy 
</h2>
<p>To try out the new single search matching strategy, append the term <code>&amp;single_search</code> to a ROR API <code>?affiliation</code> query as described in our <a href="https://ror.readme.io/docs/api-affiliation#/single-search-strategy">updated documentation</a>.</p>
<p>Note that the single search strategy is currently in a preview period, so the existing <a href="https://ror.readme.io/docs/api-affiliation#/multisearch-strategy">multisearch matching strategy</a> is still the default for <code>?affiliation</code> queries. Both matching strategies will remain available in the ROR API for the foreseeable future, but <strong>we plan to make the single search strategy the default for <code>?affiliation</code> queries in the first quarter of 2026.</strong></p>
<p>To give us feedback, you can open an <a href="https://github.com/ror-community/ror-roadmap/issues">issue on our roadmap</a> or contact us at <a href="mailto:support@ror.org">support@ror.org</a>. Please note that any individual string might be less well matched using the new strategy than using the existing strategy, so we are most interested in tests that use sizable open datasets.</p>
<p>ROR&rsquo;s essential purpose is to improve metadata throughout the scholarly information ecosystem so that reliable connections can be made between organizations and the research they support. Clean, normalized, structured, open organization information helps ensure research integrity and enables better tracking and analysis of research. Our hope is that by constantly improving ROR&rsquo;s systems and services we can improve scholarly metadata for everyone.</p>
<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Write us at <a href="mailto:support@ror.org">support@ror.org</a> with any comments or questions.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">We are delighted to announce the launch of a new matching strategy in the ROR API to help you match complex affiliation text strings to ROR IDs at scale.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Cambridge University Press and ROR</title><id>https://doi.org/10.71938/xysb-vp37</id><link rel="alternate" href="https://ror.org/blog/2025-10-22-cambridge-university-press-ror/"/><published>2025-10-22T00:00:00+00:00</published><updated>2025-10-22T13:59:33-04:00</updated><author><name>John Lewis</name><uri>https://ror.org/03jsdjx34</uri></author><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><category term="Case Studies"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>John Lewis, Data Operations Manager at <a href="https://cambridge.org">Cambridge University Press</a>, tells us why and how Cambridge uses ROR in this latest installment of our <a href="/categories/case-studies">case study series</a>.</p>


<h2 id="what-made-you-decide-to-use-ror">What made you decide to use ROR? 
</h2>
<p>Cambridge University Press originally adopted GRID. After ROR became the primary community-driven, open registry of persistent identifiers (PIDs) for organisations producing research, Cambridge adopted ROR IDs into publishing workflows that require a unique identifier for institutions. A PID for institutions associated with the research we publish makes publishing services more efficient, streamlines business analysis and is essential for accurate business insights.</p>
<p>As the academic publishing industry pivots towards an Open Access publishing model, PIDs are becoming even more crucial in the nuanced delineation of research funding bodies, institutions funding the publishing of academic research and the relationships they have with the authors conducting the research.</p>


<h2 id="how-do-you-use-ror">How do you use ROR? 
</h2>
<p>ROR is a critical facet in our analytical toolkit; understanding the totality of our relationship with a research body regardless of how we interact with them is impossible without a persistent identifier drawing our immense and very different data holdings together. As developments in AI continue to influence present strategy and future thinking on the efficacy of publishing models, bad data becomes a business risk. Too often, data attributes associated with research bodies are uncertain, inadequately classified and not easily resolvable. Poorly maintained data holdings prevent the dynamic opportunities AI offers to the industry.</p>
<p>Using ROR as a PID in our systems redefines our data; it becomes a new, exciting asset rather than an artefact of the publishing process. PIDs are a simple, yet very powerful catalyst to creating insights which enables publishers like Cambridge to make objective data-driven decisions; to define, assess, refine and deliver strategic goals and to make publishing programs as successful, rigorous, efficient and cost-effective as possible.</p>


<h2 id="what-were-the-steps-you-took-to-integrate-ror-into-your-systems-and-workflows">What were the steps you took to integrate ROR into your systems and workflows? 
</h2>
<p>The use of ROR in our publishing processes and the application of it to our data holdings is an ongoing process. It affects multiple workflows, databases, repositories, dashboards, publishing tools, and extensive examination of our data holdings associated with research institutions across systems used for different aspects of publishing and providing access to research.</p>
<p>Cambridge, like many academic publishers, has existed for quite some time and our data holdings are vast, so our data strategy adapts all the time as we assess each system, repository and data asset. Cambridge is modernising and upgrading many of its publishing tools; making sure the metadata that defines our relationships with publishing partners is unambiguous is a priority.</p>
<p>Over the years, institutions have been referred to in myriad ways, for lots of reasons, not least system variances, interactivity with other technologies, human intervention and the evolving cultural, historical and political environments academic research organisations exist within. The relationships are not always obvious, but this is precisely what the use of a PID like ROR is attempting to overcome.</p>


<h2 id="what-else-would-you-like-to-say-about-ror">What else would you like to say about ROR? 
</h2>
<p>Cambridge is proud to be a member of the <a href="https://groups.google.com/a/ror.org/g/ror-communit">ROR community</a> and to contribute to the adoption of ROR by associating it with the metadata for our published research and incorporating it into our workflows. A global, open, community-led registry that enables precise associations between researchers, their research and the institutions associated with it naturally allies with our continuing commitment to being a diligent contributor to the academic publishing community and helps us to realise our goals and ambitions as an Open Access publisher.</p>
<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Write us at <a href="mailto:support@ror.org">support@ror.org</a> with any comments or questions or if you&rsquo;d like to contribute to the ROR blog.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">John Lewis, Data Operations Manager at Cambridge University Press, tells us why and how Cambridge uses ROR in this latest installment of our short case study series.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Journey of a Curation Request: What Happens When You Ask for an Update to ROR</title><id>https://doi.org/10.71938/T128-EA02</id><link rel="alternate" href="https://ror.org/blog/2025-10-08-journey-of-a-curation-request/"/><published>2025-10-08T00:00:00+00:00</published><updated>2025-11-05T16:17:42-08:00</updated><author><name>Riley Marsh</name><uri>https://orcid.org/0009-0005-2271-4033</uri></author><category term="Explainers"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<figure class="featured-figure"><img src="/img/blog/journey-curation-request/ror-curation-workflow.png"
         alt="Schematic of a workflow."/>
</figure>

<p>Have you ever wondered exactly what happens once you request a new ROR record or suggest a change to an existing ROR record? In this blog post, we take you through all the steps involved in ROR&rsquo;s open, community-driven process for making sure that the information in the Research Organization Registry is complete and accurate.</p>
<hr>
<p>ROR&rsquo;s <a href="https://ror.org/registry/#submitting-registry-updates">community-driven process</a> for maintaining high-quality information about research organizations in our registry means that anyone can request a new ROR record or suggest a change to an existing one. Our curatorial process for assessing these requests involves both human and automatic review, and all this review happens in the open so that anyone can follow along. Here&rsquo;s a deeper dive into how that works.</p>


<h2 id="step-1-you-submit-a-request">Step 1: You submit a request 
</h2>
<p>Requests come in two main forms: individual requests and bulk requests. In addition to processing requests, ROR curators also carry out independent quality assurance projects, which fall outside the scope of this post.</p>
<p>Individual requests are typically submitted through our <a href="https://curation-request.ror.org/">public curation request form</a>. The two most common actions are to add a new organization or change an existing record; however, you can also request to deprecate an existing ROR record, merge two or more ROR records, or split an existing ROR record into two or more records. Whatever the request type, the best requests have the most complete information possible, which speeds up processing time. There is a human behind every review, and so the more information is sent, the more context ROR curators have to process the request, and the less time they have to spend seeking further information.</p>
<p>For new record requests, it is <a href="https://ror.org/about/faqs/#is-my-organization-in-scope-for-ror">particularly important to provide multiple DOIs or URLs for research publications</a> that use the organization as an affiliation or that acknowledge the organization in some way, such as for funding, in order to meet ROR&rsquo;s definition of what constitutes a &ldquo;research organization.&rdquo; For all other types of requests, providing a link for evidence that a change should be made, such as a name change announcement on the organization’s website, is also very helpful for faster processing.</p>
<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-info'></i></div><div class="col-md-10"><span>Read more about ROR&rsquo;s criteria for inclusion in the <a href="#step-4-curators-review-the-request">section on curator review</a> below or read the <a href="https://ror.org/about/faqs/#is-my-organization-in-scope-for-ror">detailed public criteria we provide to our curators</a> to help them evaluate requests to add new organizations to ROR.</span></div>
	</div>
</div>
<p>If you are having difficulty submitting through our form, you can always create your own request directly on GitHub if you have an account. We have <a href="https://github.com/ror-community/ror-updates/issues/new/choose">templates</a> available for adding a new organization or modifying an existing record.</p>
<p>Bulk requests for larger sets of organizations can be submitted via email through a <a href="https://ror.org/registry/#bulk-requests">spreadsheet template</a>. There are two spreadsheets available—one for adding new organizations and one for updating existing records—so please choose the correct spreadsheet for your requests or send us both spreadsheets if you have both new and update requests. Instructions for how to complete the spreadsheets are in the headers of each column.</p>


<h2 id="step-2-a-github-issue-is-created-for-the-request">Step 2: A GitHub issue is created for the request 
</h2>
<p>Behind the scenes, your request is automatically converted into a GitHub issue. This means it’s immediately visible in our <a href="https://github.com/ror-community/ror-updates/issues">public curation updates tracker</a>, where anyone can see requests and anyone with a GitHub account can comment. Because transparency is central to the ROR model, all requests and discussions are open by default. Personal information from the requester never gets published or shared. ROR currently receives approximately 2000 individual submissions per month in addition to bulk requests of various sizes. Processing for individual requests takes about four to six weeks, depending on complexity.</p>
<figure><img src="/img/blog/journey-curation-request/ror-updates-tracker.png"
         alt="ROR&#39;s public curation updates tracker on GitHub allows anyone to see requests for changes and additions to ROR."/><figcaption>
            <p>ROR&rsquo;s <a href="https://github.com/ror-community/ror-updates/issues/">public curation updates tracker on GitHub</a> allows anyone to see requests for changes and additions to ROR.</p>
        </figcaption>
</figure>

<p>For bulk requests, a single project issue is created, and when it is processed, each row in the spreadsheet is converted into individual sub-issues linked to the project. Because bulk requests range in size from tens to thousands of requests, they can take a longer time to process compared to individual requests.</p>
<p>You will receive an email confirmation with a link to your new issue on GitHub where you can follow along for next steps. If you have any additional context later, such as additional links to research or a correction to the form, please comment on your GitHub issue. Again, this requires a GitHub account, but they are free and easy to make. As always, you can also email <a href="mailto:support@ror.org">support@ror.org</a> if you have questions or additional information at any point in the process.</p>


<h2 id="step-3-automatic-tools-do-initial-triage-of-the-request">Step 3: Automatic tools do initial triage of the request 
</h2>
<p>Next, ROR’s automated tools perform initial checks on the request. These are scripts and workflows we’ve developed that take the form of &ldquo;bots&rdquo; that you might see interacting with your GitHub issue. A ROR curation bot helps us format the human-submitted data in the request by deleting extra fields, fixing formatting so it’s ready to process, and adding language tags. You can see the changes this bot makes in a comment below the issue, where the red highlight with the minus (-) symbol indicates what’s been deleted and the green highlight with the plus (+) symbol indicates what’s been added.</p>
<figure><img src="/img/blog/journey-curation-request/ror-triage-bot.png"
         alt="A ROR curation bot formats the data in a request to add Manila Doctors Hospital to ROR."/><figcaption>
            <p>A ROR curation bot formats the data in a <a href="https://github.com/ror-community/ror-updates/issues/23765">request to add Manila Doctors Hospital to ROR</a>.</p>
        </figcaption>
</figure>

<p>ROR curation bots also find information that helps with curator review and metadata quality. For example, you might see a bot find a <a href="https://geonames.org">GeoNames</a> ID for the city and country provided, flag potential duplicates in ROR, check for additional identifiers like an ISNI or Wikidata ID, or look for research where the organization is used as an affiliation.</p>
<figure><img src="/img/blog/journey-curation-request/ror-curator-bot.png"
         alt="A ROR curation bot helps find additional information to help curators review a request to add University of the City of Muntinlupa to ROR."/><figcaption>
            <p>A ROR curation bot helps find additional information to help curators review a <a href="https://github.com/ror-community/ror-updates/issues/17885">request to add University of the City of Muntinlupa to ROR</a>.</p>
        </figcaption>
</figure>

<p>These automated steps help ROR curators spend their time on meaningful review rather than on data clean-up. Our curators are always looking for ways to improve these processes by refining and building new curation bots and scripts, all of which can be found in our <a href="https://github.com/ror-community/curation_ops">Curation Ops GitHub repository</a>.</p>


<h2 id="step-4-curators-review-the-request">Step 4: Curators review the request 
</h2>
<p>After automated triage, human curators step in to review the request in detail to ensure it is appropriate for ROR&rsquo;s scope and consistent with other records. Metadata in each issue is reviewed and corrected to conform with ROR’s <a href="https://github.com/ror-community/ror-updates/wiki/ROR-Metadata-Policies#policies-for-specific-metadata-elements">metadata policies</a> and issue formatting.</p>
<p>ROR&rsquo;s definition of &ldquo;research organization&rdquo; is broad by design: ROR strives to be a truly global registry, one that includes research organizations of many kinds from all over the world, so that these organizations can reliably be connected to the research outputs and activities they support. At the same time, ROR also upholds a high degree of consistency and integrity of the registry as a trusted and widely adopted source of institutional data. Therefore, for new organizations to be included in ROR, <strong>one of the most important criteria is whether the organization is cited in published research.</strong></p>
<p>An organization should have clear reference to its activities in sources like contributor affiliations and funding acknowledgements in public research outputs to be considered a &ldquo;research organization&rdquo; that is in scope for ROR. If no links to research are provided in the request for a new ROR ID, if only links to self-published research are provided, if the organization is neither used as an affiliation nor acknowledged as a source of funding in open research repositories and knowledge graphs, and if no research is listed on the organization&rsquo;s website, the request to add that organization to ROR will be declined. Of course, organizations may also reapply once ROR requirements are met.</p>
<p>In addition, since ROR IDs are persistent identifiers intended for long-term stability, <strong>ROR IDs identify organizations with long-term stability.</strong> Thus, ROR also requires that an organization is used as an affiliation in published research outputs by multiple people, indicating that it is not a one-person organization or consultancy that will not exist past the lifetime of a single person. Similarly, ROR expects instances of mention in multiple research outputs. Requests that have been declined can always be reconsidered if the requestor can provide evidence of multiple instances of research usage with multiple authors.</p>
<p>For changes to existing organizations, we verify the information against reliable public sources like the organization’s website, authority files, and national-level registries; we sometimes also reach out to the requester for clarification. The updates are then coded in the issue itself. You will see a string following “Update:” for changes to the record or “Related organizations:” for changes to the record relationships.</p>
<figure><img src="/img/blog/journey-curation-request/ror-update-related.png"
         alt="A request to update the existing ROR record for Laboratoire Navier gets a new acronym and a new parent organization."/><figcaption>
            <p>A <a href="https://github.com/ror-community/ror-updates/issues/25907">request to update the existing ROR record for Laboratoire Navier</a> gets a new acronym and a new parent relationship.</p>
        </figcaption>
</figure>

<p>Any approved request will be moved to “Ready for sign-off / metadata QA” in our ROR Updates project board in GitHub. When a request is ambiguous or complex, it may be marked as “Second Review” or “Needs Discussion” and referred to the <a href="https://ror.org/registry/#curation-advisory-board">Curation Advisory Board</a> for asynchronous review. The board also meets monthly to discuss tricky cases and broader policy questions.</p>
<p>Read more about our <a href="https://github.com/ror-community/ror-updates#ror-scope">scope, criteria for inclusion, and curator workflows</a>.</p>


<h2 id="step-5-curators-prepare-the-release">Step 5: Curators prepare the release 
</h2>
<p>Every two to four weeks, all reviewed and approved requests are grouped into the next ROR data release. All metadata is double-checked using both automated scripts and manual review. This ensures that names, identifiers, and other fields are consistent across the registry and that everything aligns with ROR’s metadata policies. Additional checks are run to ensure there are no duplicates and that the metadata passes various quality and integrity validations.</p>
<p>Metadata from issues is then extracted programmatically to create new and updated records, turning the approved changes into JSON files that follow the ROR format. These files are then further refined in our release data pipeline, where we do things like update the date last modified and build new relationships between organizations that are again reviewed. Once the draft release passes those checks, we upload it to the staging environment to test it to make sure the new and updated records show up correctly in the API and search before moving everything into production.</p>


<h2 id="step-6-a-new-release-of-ror-is-published">Step 6: A new release of ROR is published 
</h2>
<p>And then, hooray! A <a href="https://github.com/ror-community/ror-updates/releases/">new release</a> of ROR is published and becomes part of the open, global registry.</p>
<p>The release is then announced across our community channels, including on social media – <a href="https://www.linkedin.com/company/ror-research-organization-registry/">LinkedIn</a>, <a href="https://bsky.app/profile/researchorgs.bsky.social">Bluesky</a>, and <a href="https://mastodon.social/@ResearchOrgs">Mastodon</a> – and the <a href="https://groups.google.com/a/ror.org/g/ror-tech">ROR Technical Forum</a>. After the release goes live, you’ll see a comment on your GitHub issue letting you know it’s been published.</p>
<figure><img src="/img/blog/journey-curation-request/ror-new-id.png"
         alt="A post-release comment is added to a request for a ROR ID for the Altamash Institute of Dental Medicine."/><figcaption>
            <p>A post-release comment is added to a <a href="https://github.com/ror-community/ror-updates/issues/25783">request for a ROR ID for the Altamash Institute of Dental Medicine</a>.</p>
        </figcaption>
</figure>



<h2 id="the-best-requests">The best requests 
</h2>
<p>Don’t forget that the best way to ensure faster processing of your request is to submit accurate, clean, and complete metadata! Please follow along on the journey of your own requests directly on GitHub to see updates, comments, and the decision-making process described here. We are always available to help with questions about requests at <a href="mailto:support@ror.org">support@ror.org</a>. In our next post in this series, we&rsquo;ll go into more detail about how to submit an accurate, clean, and complete ROR request!</p>


<h2 id="your-requests-help-make-ror-great">Your requests help make ROR great 
</h2>
<p>Now, the next time you submit a new request or think about suggesting a change, you’ll know exactly what happens behind the scenes. By keeping curation open and collaborative while applying careful checks and rigorous processes, ROR ensures that the registry remains complete, accurate, and trusted by the entire research community.</p>
<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Questions or comments about the ROR curation process? Write us at <a href="mailto:support@ror.org">support@ror.org</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Have you ever wondered exactly what happens once you request a new ROR record or suggest a change to an existing ROR record? In this blog post, we take you through all the steps involved in ROR's open, community-driven process for making sure that the information in the ROR registry is complete and accurate.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Advancing Open PID Infrastructure at Scale: A New Face and Exciting Projects for the ROR Technical Team</title><id>https://doi.org/10.71938/J5DS-GC46</id><link rel="alternate" href="https://ror.org/blog/2025-10-02-welcome-joseph-rhoads/"/><published>2025-10-02T17:49:25-04:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="Team Updates"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<figure class="featured-figure"><img src="/img/lion-9574689_1280.jpg"
         alt="Image by Armin Forster from Pixabay"/>
</figure>

<p>We are thrilled to introduce the newest member of the ROR pride: <a href="/authors/joseph-rhoads/">Joseph Rhoads</a> joined the ROR team in September as our new Technical Lead.</p>
<p>Joseph follows in the footsteps (pawprints?) of long-time Technical Lead <a href="/authors/liz-krznarich">Liz Krznarich</a>, who supported ROR’s foundational technical development over the past several years to strengthen and scale the registry’s infrastructure and implement key initiatives such as our <a href="/blog/2022-03-17-first-independent-release/">independent curation workflow</a> and <a href="/blog/2024-04-15-announcing-ror-v2/">version 2 of the ROR schema and API</a>. Earlier this year, Liz embarked on a new endeavor in the world of geology and natural history to be a software developer for the <a href="https://home.wgnhs.wisc.edu/">Wisconsin Geological and Natural History Survey</a> (ROR ID <a href="https://ror.org/02kj3rm24">https://ror.org/02kj3rm24</a>) in her home state. We are immensely grateful for Liz’s many contributions to ROR over the years, and for setting ROR up for sustained technical stability and success into the future.</p>
<p>Originally from the US but currently based in Valencia, Spain, Joseph joins ROR from <a href="https://datacite.org">DataCite</a>, where since 2021 he has been supporting software engineering various DataCite technical projects and services, including DataCite Commons. Prior to joining DataCite, Joseph worked at Brown University Library, where he gained extensive experience in repository management and institutional research monitoring, both of which will be valuable perspectives to bring to ROR. Joseph will continue to be based at DataCite as he assumes this new role, according to the <a href="/about/#governance">shared resourcing and governance model</a> that ROR’s three operating organizations have adopted.</p>
<p>Joseph says, &ldquo;I’m excited to dig in to the work of supporting this important open infrastructure. This means focusing my energy on scaling our core systems, ensuring data integrity for the registry, and maximizing interoperability to keep ROR at the heart of global scholarly communication.&rdquo;</p>
<p>As we invite our community to join us in welcoming Joseph to the team, we also want to take this opportunity to highlight a few key technical projects that are underway this year and remind everyone what to look out for and how to get more involved.</p>


<h2 id="schema-version-changes">Schema version changes 
</h2>
<p>As we’ve shared in <a href="/tags/schema/">previous posts</a> and community calls, <a href="/blog/2025-06-11-v1-sunset/">ROR schema version 2 is now the default as of July 31 of this year, and we will officially deprecate schema version 1 in December 2025</a>. If you have questions about what this means for you and/or how to update your integrations as a result of these changes, check out our <a href="https://ror.readme.io">documentation</a> and get in touch with any questions via our technical forum or <a href="mailto:support@ror.org">support@ror.org</a>.</p>


<h2 id="implementation-of-api-client-ids">Implementation of API client IDs 
</h2>
<p>In order to support and scale growing usage of the ROR API while also ensuring it can continue to remain open and free for everyone and anyone to use, we will be implementing new rate limits in early 2026 and enabling a client identification workflow for users to obtain higher rates. <a href="https://ror.readme.io/docs/client-id#/">Learn more about what this means for you, and how to get started by obtaining a client ID</a>.</p>


<h2 id="updates-to-affiliation-matching">Updates to affiliation matching 
</h2>
<p>Affiliation matching is the number-one use of the ROR API, and we’ve been making ongoing tweaks over the years to refine our matching API to support various use cases as more and more systems integrate with ROR. Following <a href="/blog/2025-01-27-faster-affiliation-matching/">development of a new matching strategy at Crossref</a>, we are now working to adapt this for ROR to make our affiliation matching service even more useful and performant.</p>


<h2 id="join-us">Join us 
</h2>
<p>These projects and other technical activities have all been - and continue to be - shaped by active community input and collaboration. We take our position as community-driven open infrastructure very seriously, and that means working directly with users and our wider network of stakeholders to inform our development work, align our services to address concrete user needs, and deliver value at scale for our global community.</p>
<p>If you haven’t already gotten involved in our work, there are many ways to do so, including by joining our <a href="/events">bimonthly community calls</a>, by signing up for the <a href="https://groups.google.com/a/ror.org/g/ror-tech">ROR Technical Forum</a> and/or <a href="https://groups.google.com/a/ror.org/g/ror-community">ROR Community Forum</a>, by <a href="https://curation-request.ror.org">suggesting registry additions and updates</a>, and by <a href="https://github.com/ror-community/ror-roadmap">submitting bug reports and feature requests</a>.</p>
<p>Welcome, Joseph, and thanks to everyone for participating in this journey together!</p>
<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Write us at <a href="mailto:support@ror.org">support@ror.org</a> with any comments or questions.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">We are thrilled to introduce the newest member of the ROR pride: Joseph Rhoads joined the ROR team in September as our new Technical Lead.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">HighWire Press's DigiCorePro and ROR</title><id>https://doi.org/10.71938/97e0-5q39</id><link rel="alternate" href="https://ror.org/blog/2025-09-09-digicorepro/"/><published>2025-09-09T00:00:00+00:00</published><updated>2025-10-22T06:05:03-04:00</updated><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><category term="Case Studies"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<figure class="featured-figure"><img src="/img/blog/digicorepro/DigiCorePro.png"
         alt="DigiCorePro logo"/>
</figure>

<p>In this interview with HighWire Press&rsquo;s <a href="https://www.linkedin.com/in/tonyhopedale/">Tony Alves</a>, we learn that thanks to customer requests and a PID-aware development process, the publishing platform <a href="https://www.highwirepress.com/solutions/digicorepro/">DigiCore Pro</a> uses ROR in form lookups and automatic extraction processes for author affiliations, funder identification, peer reviewer affiliations, user disambiguation, and research integrity.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>
<h2 id="key-quotations">Key quotations 
  <a href="#key-quotations" aria-label="Key quotations">
<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 640 512" class="ha ha-link" width="16" height="16" alt="Key quotations"><path d="M579.8 267.7c56.5-56.5 56.5-148 0-204.5c-50-50-128.8-56.5-186.3-15.4l-1.6 1.1c-14.4 10.3-17.7 30.3-7.4 44.6s30.3 17.7 44.6 7.4l1.6-1.1c32.1-22.9 76-19.3 103.8 8.6c31.5 31.5 31.5 82.5 0 114L422.3 334.8c-31.5 31.5-82.5 31.5-114 0c-27.9-27.9-31.5-71.8-8.6-103.8l1.1-1.6c10.3-14.4 6.9-34.4-7.4-44.6s-34.4-6.9-44.6 7.4l-1.1 1.6C206.5 251.2 213 330 263 380c56.5 56.5 148 56.5 204.5 0L579.8 267.7zM60.2 244.3c-56.5 56.5-56.5 148 0 204.5c50 50 128.8 56.5 186.3 15.4l1.6-1.1c14.4-10.3 17.7-30.3 7.4-44.6s-30.3-17.7-44.6-7.4l-1.6 1.1c-32.1 22.9-76 19.3-103.8-8.6C74 372 74 321 105.5 289.5L217.7 177.2c31.5-31.5 82.5-31.5 114 0c27.9 27.9 31.5 71.8 8.6 103.9l-1.1 1.6c-10.3 14.4-6.9 34.4 7.4 44.6s34.4 6.9 44.6-7.4l1.1-1.6C433.5 260.8 427 182 377 132c-56.5-56.5-148-56.5-204.5 0L60.2 244.3z"/></svg>
  </a>
</h2>
<p>&ldquo;We want to accommodate as many industry-accepted processes and tools as we can, and ROR is of course extremely important now.&rdquo;</p>
<p>&ldquo;Our customers are asking for it. We&rsquo;re seeing ROR on Requests for Proposals (RFPs) as a requirement, especially now that it is being used at Crossref to identify funders.&rdquo;</p>
<p>&ldquo;The person uploads their manuscript, and we then look through the manuscript for header information, pull that out, match it to the ROR ID, and present it as fields that are already filled in that they can change if need be to something that&rsquo;s correct.&rdquo;</p>
<p>&ldquo;And what&rsquo;s great is that because it&rsquo;s open source, an open standard, we don&rsquo;t have to constrain ourselves.&rdquo;</p>
<p>&ldquo;I really appreciate the community around ROR, the vibrancy of that.&rdquo;</p>
<p>– Tony Alves, Senior Vice President, Product Management at HighWire Press, Inc.</p>
</span></div>
	</div>
</div>
<hr>


<h3 id="hahahugoshortcode-s2-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Tell us about what made you decide to use ROR at HighWire, specifically in the <a href="https://www.highwirepress.com/solutions/digicorepro/">DigiCorePro</a> publishing platform.</p>


<h3 id="hahahugoshortcode-s3-hbhb-tony-alves"><figure class="round-figure"><img src="/img/blog/digicorepro/tony-alves.jpg"
         alt="Avatar of Tony Alves"/>
</figure>
 Tony Alves 
</h3>
<p>Sure. There were really a couple of different things. <strong>One is that we want to accommodate as many industry-accepted processes and tools as we can, and ROR is of course extremely important now.</strong> ROR is really helpful, because it provides us with a standardized list of institutions that people can pick from rather than typing them in. And because ROR is free and open, we could implement it without having to pay anybody, which is really, really helpful.</p>
<p><strong>The second thing is that our customers are asking for it. We&rsquo;re seeing ROR on Requests for Proposals (RFPs) as a requirement, especially now that <a href="https://doi.org/10.13003/156081">it is being used at Crossref to identify funders</a>.</strong> I like the fact that now there are fewer PIDs that we need to worry about. In those RFPs, people want ROR as not only an institution identifier for author affiliation, but they also want it as the funding PID. Also, of course, we can offer ROR to people who can&rsquo;t afford a more expensive persistent identifier.</p>


<h3 id="hahahugoshortcode-s4-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Great! You&rsquo;ve touched on this a little bit, but how is ROR used in DigiCorePro?</p>


<h3 id="hahahugoshortcode-s5-hbhb-tony-alves"><figure class="round-figure"><img src="/img/blog/digicorepro/tony-alves.jpg"
         alt="Avatar of Tony Alves"/>
</figure>
 Tony Alves 
</h3>
<p>It is used as the basis of a pick list when people are selecting their institution so at that institution level it&rsquo;s a typeahead, so you start typing, and you&rsquo;ll get the ROR.</p>


<h3 id="hahahugoshortcode-s6-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Do they see the ROR?</p>


<h3 id="hahahugoshortcode-s7-hbhb-tony-alves"><figure class="round-figure"><img src="/img/blog/digicorepro/tony-alves.jpg"
         alt="Avatar of Tony Alves"/>
</figure>
 Tony Alves 
</h3>
<p>They just see the institution name. They don&rsquo;t see an identifier.</p>


<h3 id="hahahugoshortcode-s8-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>That&rsquo;s actually what we recommend, so that&rsquo;s great.</p>


<h3 id="hahahugoshortcode-s9-hbhb-tony-alves"><figure class="round-figure"><img src="/img/blog/digicorepro/tony-alves.jpg"
         alt="Avatar of Tony Alves"/>
</figure>
 Tony Alves 
</h3>
<p>We capture the identifier when the institution name is selected, and so when the metadata is transmitted, if the API wants the ROR, then that will be there for the affiliation institution field. Authors can have multiple affiliations, too, so you would then have multiple ROR IDs for the different affiliations. We have also implemented it as an identifier for funders.</p>


<h3 id="hahahugoshortcode-s10-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>That&rsquo;s great.</p>


<h3 id="hahahugoshortcode-s11-hbhb-tony-alves"><figure class="round-figure"><img src="/img/blog/digicorepro/tony-alves.jpg"
         alt="Avatar of Tony Alves"/>
</figure>
 Tony Alves 
</h3>
<p>In both cases, we do try to anticipate the institution when we extract the metadata and match it with the ROR ID. We do that when we can for both funding and the person&rsquo;s institutional affiliation. <strong>The person uploads their manuscript, and we then look through the manuscript for header information, pull that out, match it to the ROR ID, and present it as fields that are already filled in that they can change if need be to something that&rsquo;s correct.</strong></p>


<h3 id="hahahugoshortcode-s12-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Oh, that&rsquo;s interesting. It&rsquo;s great that you do that. I think that&rsquo;s a popular approach these days, and it&rsquo;s so much easier for the author.</p>


<h3 id="hahahugoshortcode-s13-hbhb-tony-alves"><figure class="round-figure"><img src="/img/blog/digicorepro/tony-alves.jpg"
         alt="Avatar of Tony Alves"/>
</figure>
 Tony Alves 
</h3>
<p>We want to make it as simple as possible. The funder information is hard because there&rsquo;s not a standard way of providing that information. People are still providing it in paragraph format somewhere, you know, even if you&rsquo;ve asked for it in a certain place, right? So that&rsquo;s always a challenge.</p>


<h3 id="hahahugoshortcode-s14-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Those funding acknowledgements are very non-standardized. You mentioned that DigiCorePro transmits this metadata, including the ROR IDs, via an API. Where are some of the places that metadata might be transmitted to?</p>


<h3 id="hahahugoshortcode-s15-hbhb-tony-alves"><figure class="round-figure"><img src="/img/blog/digicorepro/tony-alves.jpg"
         alt="Avatar of Tony Alves"/>
</figure>
 Tony Alves 
</h3>
<p>Oh, there are all kinds of uses for our API. For example, it might go to a research integrity check tool, and depending on what data they want, the ROR and the affiliation information would be part of that. If there&rsquo;s a place in their schema for ROR, it would fit in there. If their schema has just a general institution field, the institution name would go in there. There are also systems like <a href="https://www.convey.org/">Convey</a> that collect conflict of interest information. Convey was created by the <a href="https://www.aamc.org/">Association for American Medical Colleges</a>. We query Convey&rsquo;s database for conflict of interest information, so if ROR was part of that, it would be in that exchange.</p>


<h3 id="hahahugoshortcode-s16-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>I&rsquo;ll contact Convey to see if they use ROR or are planning to use ROR!</p>


<h3 id="hahahugoshortcode-s17-hbhb-tony-alves"><figure class="round-figure"><img src="/img/blog/digicorepro/tony-alves.jpg"
         alt="Avatar of Tony Alves"/>
</figure>
 Tony Alves 
</h3>
<p>It also might be <a href="https://www.prophy.com/">Prophy</a> or <a href="https://globalcampus.ai/">Global Campus</a> or <a href="https://www.reviewercredits.com/">Reviewer Credits</a>, any of these reviewer finding tools that might get the ROR IDs from the API.</p>


<h3 id="hahahugoshortcode-s18-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>I know Prophy uses ROR quite heavily.</p>


<h3 id="hahahugoshortcode-s19-hbhb-tony-alves"><figure class="round-figure"><img src="/img/blog/digicorepro/tony-alves.jpg"
         alt="Avatar of Tony Alves"/>
</figure>
 Tony Alves 
</h3>
<p>There&rsquo;s also <a href="https://orcid.org">ORCID</a>. We can pull in the ROR if it&rsquo;s in the ORCID record, or we can push the ROR out if somebody is establishing an ORCID record.</p>


<h3 id="hahahugoshortcode-s20-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yes! I actually can&rsquo;t wait for the day when all of the organization and funder identifiers in ORCID are ROR IDs. And I think that day is not too far away. Increasingly, <a href="https://info.orcid.org/event/better-together-orcid-ror/">ORCID is standardizing on ROR</a>.</p>


<h3 id="hahahugoshortcode-s21-hbhb-tony-alves"><figure class="round-figure"><img src="/img/blog/digicorepro/tony-alves.jpg"
         alt="Avatar of Tony Alves"/>
</figure>
 Tony Alves 
</h3>
<p>Anytime we need to push affiliation information and ROR is in the API, in the schema, that&rsquo;s what we&rsquo;ll do with the information.</p>


<h3 id="hahahugoshortcode-s22-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>What was the development process like for building ROR into DigiCorePro?</p>


<h3 id="hahahugoshortcode-s23-hbhb-tony-alves"><figure class="round-figure"><img src="/img/blog/digicorepro/tony-alves.jpg"
         alt="Avatar of Tony Alves"/>
</figure>
 Tony Alves 
</h3>
<p>We use an Agile process to develop the product itself. Part of what we do is to look at how something is implemented in <a href="https://www.highwirepress.com/solutions/highwire-benchpress/">BenchPress</a>, then try to figure out what might be the best way to do that in DigiCorePro, which is more modularized. Specifically with ROR, we understand that there are multiple ways to use ROR, as I&rsquo;ve been discussing, either as a person&rsquo;s affiliation or as a funding ID. Also, there are multiple journeys for ROR: some systems are going to want the ROR IDs from us and some systems are going to send the ROR IDs to us. We create stories to describe how ROR will be used in DigiCorePro. How is the ROR ID going to get to us or be sent out? Where are we going to be using the ROR ID in the workflow process? Those get written into stories, and then those go into the development process as we develop the various features.</p>
<p>Persistent identifiers in general are always being considered for all the processes. Would persistent identifiers be needed in this process, in this interchange? What will those persistent identifiers be, or might there be a persistent identifier in the future that we might want to be sure is included in the design of this feature or function? We keep that in mind because we can see, just from our understanding of where the industry is going, that different aspects of the manuscript could potentially have different persistent identifiers.</p>
<p>For instance, for reviewers, it&rsquo;s better to have the ROR for that person than it is to have the name of the institution, because the reviewer doesn&rsquo;t get it right when they type it in. They&rsquo;re just going to type in &ldquo;UMass&rdquo; or &ldquo;Harvard.&rdquo; We want to be able to be sure that if we are getting that information about that reviewer, that we have that consistent information.</p>
<p>It also helps us to disambiguate people. The ORCID is important for that, but sometimes when we give a journal staff person an opportunity to look at multiple people&rsquo;s records, if somebody has somehow gotten registered two or three times, ROR is one of those things that will give them enough information to disambiguate and to identify which records need to be merged.</p>
<p>Similarly, with research integrity, if you have somebody who doesn&rsquo;t have an institution that is in ROR, then potentially that may not be a real institution. We know that you don&rsquo;t yet have all the institutions, necessarily, and that sometimes new institutions come up, but it can still be a useful piece of information.</p>
<p>So those are some of the ways that we think about ROR and other persistent identifiers in the development process. Will it be something that will be a useful element in this workflow? <strong>And what&rsquo;s great is that because it&rsquo;s open source, an open standard, we don&rsquo;t have to constrain ourselves.</strong></p>


<h3 id="hahahugoshortcode-s24-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Is there anything else you want to say about ROR?</p>


<h3 id="hahahugoshortcode-s25-hbhb-tony-alves"><figure class="round-figure"><img src="/img/blog/digicorepro/tony-alves.jpg"
         alt="Avatar of Tony Alves"/>
</figure>
 Tony Alves 
</h3>
<p><strong>I really appreciate the community around ROR, the vibrancy of that.</strong> I enjoyed <a href="https://www.pidapalooza.org/">PIDapalooza</a>, and that was really a great <a href="https://doi.org/10.71938/0eh7-xg96">launching pad for ROR</a> because ROR is community-based and has an active advocacy group around it.</p>


<h3 id="hahahugoshortcode-s26-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Lastly, any feature requests or suggestions for future directions?</p>


<h3 id="hahahugoshortcode-s27-hbhb-tony-alves"><figure class="round-figure"><img src="/img/blog/digicorepro/tony-alves.jpg"
         alt="Avatar of Tony Alves"/>
</figure>
 Tony Alves 
</h3>
<p>I think that from a systems perspective, being sure that your API is super-responsive in real time is really useful, because we can&rsquo;t stop the submission process when an author is there to submit. I know it&rsquo;s possible to download ROR and use it locally, but that means having a process to get the most recent version. That&rsquo;s probably the safer implementation, because then if the API goes down, you&rsquo;re not dependent on that, but then you don&rsquo;t always have the most up-to-date information. That&rsquo;s a struggle everybody has with these sorts of things when you&rsquo;re dependent on an API for an essential process like identifying affiliation, so I think that&rsquo;s an important part of your infrastructure to be sure is really stable.</p>


<h3 id="hahahugoshortcode-s28-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yes, ROR is quite a small dataset, so downloading the data and storing it locally is one of the <a href="https://ror.readme.io/docs/forms#/implementation-approaches">implementation approaches</a> we recommend, for precisely the reasons you mention. That being said, the ROR API has been very, very stable. It&rsquo;s had <a href="http://status.ror.org">great uptime</a>.</p>
<p>However, because the ROR API is totally free and open, and because ROR is getting more and more popular, we&rsquo;re seeing some excessive use that can cause issues. That&rsquo;s one of the reasons why we recently started an initiative to implement <a href="https://ror.readme.io/docs/client-id#/">client identification</a> on the ROR API so that API users need to tell us who they are before receiving the current rate limit. We&rsquo;re phasing that in gradually. Once it&rsquo;s implemented, if somebody is using the ROR API incorrectly and therefore degrading the performance of the API for everybody else who&rsquo;s using it, we can contact that API user and help troubleshoot the issue instead of just blocking their IP address.</p>


<h3 id="hahahugoshortcode-s29-hbhb-tony-alves"><figure class="round-figure"><img src="/img/blog/digicorepro/tony-alves.jpg"
         alt="Avatar of Tony Alves"/>
</figure>
 Tony Alves 
</h3>
<p>Good to hear.</p>


<h3 id="hahahugoshortcode-s30-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Thank you so much for talking with me, Tony.</p>


<h3 id="hahahugoshortcode-s31-hbhb-tony-alves"><figure class="round-figure"><img src="/img/blog/digicorepro/tony-alves.jpg"
         alt="Avatar of Tony Alves"/>
</figure>
 Tony Alves 
</h3>
<p>Thanks so much.</p>
<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Write to us at <a href="mailto:support@ror.org">support@ror.org</a> with any comments or questions.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">In this interview with HighWire Press's Tony Alves, we learn that thanks to customer requests and a PID-aware development process, the publishing platform DigiCore Pro uses ROR in form lookups and automatic extraction processes for author affiliations, funder identification, peer reviewer affiliations, user disambiguation, and research integrity.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Symplectic Adds ROR Integration</title><id>https://doi.org/10.71938/1fch-hb40</id><link rel="alternate" href="https://ror.org/blog/2025-07-18-symplectic-adds-ror-integration/"/><link rel="related" href="https://www.symplectic.co.uk/symplectic-elements-introduces-ror-integration-for-enhanced-organization-metadata/"/><published>2025-07-18T00:00:00+00:00</published><updated>2025-10-22T06:05:03-04:00</updated><author><name>Symplectic</name><uri>https://ror.org/00vzax568</uri></author><category term="Adoption News"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span><p>Exciting news: <a href="http://symplectic.co.uk/">Symplectic,</a> a global leader in research information management systems, has integrated ROR into its flagship platform, Symplectic Elements. Symplectic Elements now synchronizes ROR data approximately every 30 days and harvests ROR-enriched data for funders and author affiliations from Crossref and Dimensions, meaning that research offices, librarians, and administrators can now better analyze author collaborations, detect funding trends, and conduct open access reporting.</p>
<p>This integration represents a meaningful step forward in supporting open persistent identifiers and improving metadata quality across the global research ecosystem. Read the whole announcement below or <a href="https://www.symplectic.co.uk/symplectic-elements-introduces-ror-integration-for-enhanced-organization-metadata/">on the Symplectic blog.</a></p>
<p>&ndash; <a href="/authors/ror-core-team">ROR Core Team</a></p>
</span></div>
	</div>
</div>
<figure class="featured-figure"><img src="/img/blog/symplectic/Symplectic_idn_Z91u_n_0.png"
         alt="Symplectic logo, part of Digital Science"/>
</figure>

<p>We’re pleased to announce that Symplectic Elements now includes integration with the Research Organization Registry (ROR) - a key enhancement to our new External Organisations functionality and a significant step forward in enriching and disambiguating affiliation data across the research lifecycle.</p>


<h2 id="what-are-external-organisations-in-elements">What Are External Organisations in Elements? 
</h2>
<p>In <a href="https://www.symplectic.co.uk/symplectic-elements-6-22-release-notes/">Symplectic Elements 6.22</a>, released in early 2025, we introduced a new External Organisations data category. Symplectic Elements now maintains an authoritative list of External Organisations - institutions outside your own - which is used to normalize and enrich metadata relating to co-author affiliations, funding bodies, and other collaborative relationships. This dedicated data category is populated through a combination of trusted sources, including <a href="https://www.grid.ac/">GRID</a>, <a href="https://www.dimensions.ai/">Dimensions</a>, and now <a href="https://ror.org">ROR</a>.</p>
<p>This new approach enables more structured and consistent treatment of external entities across the system, supporting clearer collaboration insights, stronger reporting foundations, and more reliable metadata for open research initiatives.</p>
<p>This means:</p>
<ul>
<li>Fewer manual entries and less ambiguity around institutional names</li>
<li>More accurate tracking of external affiliations across researchers, publications, and grants</li>
<li>Improved support for reporting on collaborations and research partnerships</li>
<li>A stronger foundation for funder acknowledgment metadata and impact analysis</li>
</ul>
<figure><img src="/img/blog/symplectic/symplectic-ror-screenshot.png"
         alt="ROR in Symplectic Elements for an author affiliation."/>
</figure>



<h2 id="how-ror-integration-supports-your-data">How ROR Integration Supports Your Data 
</h2>
<p>ROR is the leading open registry of persistent identifiers for research organizations. With ROR now integrated into Symplectic Elements, the platform can automatically enrich External Organisation records with up-to-date, authoritative metadata, sourced directly from ROR and from partners like Dimensions.</p>
<p>In addition to enhancing administrative workflows, these identifiers will increasingly become visible to researchers as affiliation data is harvested from publication sources such as <a href="https://www.crossref.org/">Crossref</a> - ensuring your users benefit from clearer, more consistent records.</p>
<p>This integration not only enhances the integrity of organizational data, but also supports institutions in tracking external partnerships, mapping research networks, and advancing open research infrastructure.</p>
<p>“Integrating ROR into our External Organisations model is aligned with our mission to connect institutions with high-quality data sources,” said Ben Heartland, Product Manager at Symplectic. “This enhancement ensures accurate and transparent organizational affiliation reporting for our community of researchers worldwide.”</p>
<p>By unifying metadata from across your institutional network with persistent, open identifiers, this development lays the groundwork for even richer research intelligence - all enabled through Symplectic Elements.</p>
<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-info'></i></div><div class="col-md-10"><span>To learn more about how Symplectic Elements can support your research infrastructure, visit <a href="https://symplectic.co.uk">symplectic.co.uk</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Exciting news: Symplectic, a global leader in research information management systems, has integrated ROR into its flagship platform, Symplectic Elements.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Timeline for the Sunset of Version 1 of the ROR API and Schema</title><id>https://doi.org/10.71938/y9pa-8m20</id><link rel="alternate" href="https://ror.org/blog/2025-06-11-v1-sunset/"/><published>2025-06-11T00:00:00+00:00</published><updated>2025-10-31T16:18:02-04:00</updated><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><category term="Technical News"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<figure class="featured-figure"><img src="/img/keyur-nandaniya-8oLnCRz7hwM-unsplash.jpg"
         alt="Pensive lion at golden hour. Photo by Keyur Nandaniya on Unsplash."/>
</figure>

<p>One of the highlights of 2024 for the ROR team and the ROR community was the <a href="/blog/2024-04-15-announcing-ror-v2/">release of version 2 of the ROR API and schema</a> in April 2024. Before that milestone, we spent many months <a href="https://ror.readme.io/docs/feedback-docs#handling-schema-and-api-versioning-in-ror">working together to develop an API and schema versioning policy</a>: that policy states that &ldquo;Plans to sunset a previous version will be announced at least 1 year prior to the planned sunset date.&rdquo; We&rsquo;ve been reminding the ROR community ever since the launch of v2 of the ROR API and schema that version 2 is the recommended, stable version and that v1 of the ROR API would be deprecated in 2025. Today, we&rsquo;re sharing further details and more precise dates for the deprecation of version 1 of ROR so that ROR users have all the lead time they need to make any necessary changes.</p>


<h2 id="timeline-for-ror-v1-sunset">Timeline for ROR v1 sunset 
</h2>
<p>Here&rsquo;s our basic timeline for deprecating version 1 of ROR.</p>
<div class='callout mustard'>
	<div class="row">
		<div class="col-md-12"><span><div class='shortcode-row'>
<div class="col-md-5"><strong>Week of July 28, 2025</strong></div>
<div class="col-md-7"><ul>
<li>ROR API requests with no version in the path will default to v2 instead of v1</li>
</ul>
</div>
</div>
<div class='shortcode-row'>
<div class="col-md-5"><strong>Early or mid-December 2025</strong></div>
<div class="col-md-7"><ul>
<li>ROR API requests with v1 in the path will no longer return a response</li>
<li>V1 files will no longer be included in the ROR data dump</li>
<li>V1 documentation will no longer be available</li>
</ul>
</div>
</div>
</span></div>
	</div>
</div>


<h2 id="what-you-can-expect-in-july">What you can expect in July 
</h2>
<p>Starting the week of <strong>Monday, July 28, 2025</strong>, ROR API requests with no version in the path will default to v2 instead of v1. What does that mean, exactly?</p>
<ul>
<li>If you use ROR API requests with v2 in the path, you do not need to make any changes.</li>
<li>If you use ROR API requests with v1 in the path, you do not need to make any changes before July 28, but you should plan to make changes before December.</li>
<li>If you use ROR API requests with no version in the path, you will need to change your API requests to include v1 in the path if you wish to continue retrieving v1 data. If instead you wish to start retrieving v2 data, you should make sure you understand the differences between <a href="https://ror.readme.io/v1/docs/ror-data-structure">ROR schema version 1.0</a> and <a href="https://ror.readme.io/v2/docs/ror-data-structure">ROR schema version 2.1</a> and update your script or application accordingly before you begin including v2 in the path of your ROR API requests.</li>
</ul>


<h3 id="examples-of-ror-api-requests">Examples of ROR API requests 
</h3>
<p>Here are three examples of the same request to the ROR API for a list of organizations whose type is &ldquo;funder&rdquo;: one that specifies version 2, one that specifies version 1, and one that does not specify a version.</p>


<h4 id="ror-api-request-with-v2-in-the-path">ROR API request with v2 in the path 
</h4>
<p><code>https://api.ror.org/v2/organizations?filter=types:funder</code></p>
<figure class="blog-figure"><img src="/img/blog/v1-sunset/ror-api-v2-request.png"
         alt="ROR API v2.1 response."/>
</figure>



<h4 id="ror-api-request-with-v1-in-the-path">ROR API request with v1 in the path 
</h4>
<p><code>https://api.ror.org/v1/organizations?filter=types:funder</code></p>
<figure class="blog-figure"><img src="/img/blog/v1-sunset/ror-api-v1-request.png"
         alt="ROR API v1 response."/>
</figure>



<h4 id="ror-api-request-with-no-version-in-the-path">ROR API request with no version in the path 
</h4>
<p><code>https://api.ror.org/organizations?filter=types:funder</code></p>
<figure class="blog-figure"><img src="/img/blog/v1-sunset/ror-api-no-version-request.png"
         alt="ROR API v1 response."/>
</figure>

<p>All three of these requests return the same number and set of records, but you&rsquo;ll notice that the request without a version in the path currently returns records that use <a href="https://ror.readme.io/v1/docs/ror-data-structure">ROR schema version 1.0</a>. As of the end of July, that will change, and an API request that does not specify a version in the path will return records that use <a href="https://ror.readme.io/v2/docs/ror-data-structure">ROR schema version 2.1</a>.</p>
<p>Because ROR schema version 1.0 and ROR schema version 2.1 are significantly different, we recommend carefully evaluating the information you retrieve from version 1 of the ROR API before changing your API requests to version 2.</p>


<h2 id="what-you-can-expect-in-december">What you can expect in December 
</h2>
<p>In early or mid-December 2025, we will sunset v1 of the ROR API and schema permanently. This means that ROR API requests with v1 in the path will cease to return responses, and we will no longer provide v1 files in the ROR <a href="https://ror.readme.io/docs/data-dump">data dump</a> on Zenodo at <a href="https://doi.org/10.5281/zenodo.6347574">https://doi.org/10.5281/zenodo.6347574</a>. In addition, <a href="https://ror.readme.io/v1/">ROR documentation pertaining to v1 of the ROR schema and API</a> will no longer be available.</p>


<h2 id="important-additional-points-to-note">Important additional points to note 
</h2>
<p>Version 2 of the ROR API and schema was <a href="https://ror.readme.io/docs/feedback-docs#ror-schema-v20">deliberately designed for both flexibility and stability</a>, so <strong>we don&rsquo;t anticipate any further breaking changes for quite some time.</strong> We&rsquo;re always collecting feedback about future enhancements on our <a href="https://github.com/ror-community/ror-roadmap/">roadmap</a>, so if there&rsquo;s something you&rsquo;d like to see ROR do, please let us know.</p>
<p>Lastly, and perhaps most importantly, we&rsquo;re here to help! If you&rsquo;ve got questions about how to update your script or application from v1 of ROR to v2 of ROR, please <a href="https://ror.readme.io/docs/schema-v2#/">consult ROR&rsquo;s documentation</a>, join the <a href="https://groups.google.com/a/ror.org/g/ror-tech">ROR Technical Forum</a> and post your question there, or email <a href="mailto:support@ror.org">support@ror.org</a>. The ROR Technical Forum is also a great place for developers who&rsquo;ve successfully switched from v1 to v2 to share lessons learned with others, and we&rsquo;d love to hear from you there.</p>
<p>After the sunset of the original version of ROR comes a night of restful sleep, and then a new day! See you bright and early on the other side.</p>
<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Write us at <a href="mailto:support@ror.org">support@ror.org</a> with any questions.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Version 2 of the ROR API will become the query default in July 2025, and version 1 of the ROR API and dataset will be deprecated entirely in December 2025.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">ROR in the US Department of Transportation's Research Repository</title><id>https://doi.org/10.71938/pfdw-jr49</id><link rel="alternate" href="https://ror.org/blog/2025-05-08-usdot-repository/"/><published>2025-05-08T00:00:00+00:00</published><updated>2025-10-22T06:05:03-04:00</updated><author><name>Peyton Tvrdy</name><uri>https://orcid.org/0000-0002-9720-4725</uri></author><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><category term="Case Studies"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<figure class="featured-figure"><img src="/img/blog/usdot/rosap-logo.png"
         alt="US Department of Transportation, Rosa P Repository and Open Science Access Portal"/>
</figure>

<p>The <a href="https://rosap.ntl.bts.gov/">ROSA P digital repository</a>, operated by the <a href="https://ntl.bts.gov/ntl">US National Transportation Library</a> &ndash; part of the <a href="https://www.transportation.gov/">US Department of Transportation</a> &ndash; is one of the top ten providers of <a href="https://api.datacite.org/dois?query=publisher.publisherIdentifierScheme:ROR+OR+publisher.schemeUri:%22https://ror.org%22+OR+publisher.publisherIdentifier:*ror*&amp;publisher=true&amp;page%5Bsize%5D=0">ROR IDs for publisher identification</a> and <a href="https://api.datacite.org/dois?query=fundingReferences.funderIdentifierType:ROR&amp;page%5Bsize%5D=0">ROR IDs for funder identification</a> in DataCite DOI metadata. In this short case study, Peyton Tvrdy tells us why and how her team did such exemplary work in producing metadata for open transportation research funded or produced by USDOT.</p>
<p>This article represents the first in an ongoing series of short <a href="https://ror.org/categories/case-studies">case studies</a> on ROR users that we&rsquo;re calling &ldquo;Four for ROR&rdquo; in which we ask ROR users four questions about why and how they&rsquo;re using ROR.</p>
<hr>


<h2 id="what-made-you-decide-to-use-ror">What made you decide to use ROR? 
</h2>
<p>I was first introduced to ROR IDs Fall 2023 when I started at the National Transportation Library. A coworker of mine, <a href="https://orcid.org/0000-0002-0543-4268">Leighton Christiansen</a>, was introduced to ROR and thought it would be a great idea to create <a href="https://ror.org/02xfw2e90">ROR IDs for the Department of Transportation</a> and our sub-organizations. That fall, I used the <a href="https://curation-request.ror.org">ROR submission form</a> to create seven ROR IDs for the library. At this time we were unsure of how we would use them and integrate ROR into our repository.</p>
<p>While our repository does not have full ROR integration currently, we use ROR to identify our organizations in our <a href="https://datacite.org">DataCite</a> DOI metadata. I decided to <a href="https://ror.readme.io/docs/matching#match-organization-names-to-ror-ids-using-the-ror-api">use the ROR API to match our organizations</a> because our cataloging system has many different controlled values of the same organizations. ROR IDs help us to standardize our organization names while still giving proper attribution to organizations in our DataCite DOI metadata. Additionally, because I have a <a href="https://github.com/ptvrdy/doi-parser">program</a> that uses the DataCite API for DOI updates and maintenance, it was easy to integrate the ROR API for organization matching. Using ROR allows us to better standardize our organizations, fully utilize persistent identifiers, and create better metadata.</p>
<figure><img src="/img/blog/usdot/doi-parser.png"
         alt="Confirming a match to a ROR ID in the DOI parser program"/><figcaption>
            <p>Confirming a match to a ROR ID in the <a href="https://github.com/ptvrdy/doi-parser">DOI parser program</a></p>
        </figcaption>
</figure>



<h2 id="how-do-you-use-ror">How do you use ROR? 
</h2>
<p>Currently, the National Transportation Library only uses ROR IDs programmatically for our DOI metadata in my <a href="https://github.com/ptvrdy/doi-parser">DOI parser program</a>. ROR IDs are used for creator organizations, contributing organizations, publishing organizations, and funding organizations. However, ROR IDs are also used in the README files for our datasets in ROSA P. These README files contain important information about datasets, such as funding information, file information, licensing, and the preferred citation.</p>
<figure><img src="/img/blog/usdot/rosap-dataset-readme.png"
         alt="ROR IDs being used for organizations in a dataset&#39;s README file"/><figcaption>
            <p>ROR IDs being used for organizations in a <a href="https://doi.org/10.21949/rb92-6j61">dataset’s README file</a></p>
        </figcaption>
</figure>



<h2 id="what-were-the-steps-you-took-to-integrate-ror-into-your-systems-and-workflows">What were the steps you took to integrate ROR into your systems and workflows? 
</h2>
<p>Cross-walking our organization names to ROR names and ROR IDs was a lengthy process, but it has resulted in better quality metadata. My program maintains a CSV of our organization names and their mapping to ROR. This allows my program to seamlessly match our organizations to organizations already matched to ROR. For organizations not in this spreadsheet, it queries the ROR API and has the user confirm or manually add a ROR match if there is one. That user response is then saved to the CSV for future use. This lessens the possibility for mistakes as each ROR ID and organization name are only matched once. This process is done for all organization fields in the <a href="https://datacite-metadata-schema.readthedocs.io/en/4.5/">DataCite schema</a>. Mapping to ROR is especially easy for <a href="https://datacite-metadata-schema.readthedocs.io/en/4.5/properties/fundingreference/#funderidentifier">funder organizations</a> because all of our repository’s research is funded exclusively by the Department of Transportation, so all of our contract and grant numbers are attributed properly to the Department of Transportation.</p>
<figure class="blog-figure"><img src="/img/blog/usdot/datacite-json-usdot.png"
         alt="DataCite JSON Metadata of a contract attributed to USDOT with the ROR ID"/><figcaption>
            <p>DataCite JSON Metadata of a contract attributed to USDOT with the ROR ID</p>
        </figcaption>
</figure>



<h2 id="what-else-would-you-like-to-say-about-ror">What else would you like to say about ROR? 
</h2>
<p>ROR as an organization is an amazing resource for organizations looking to standardize and manage their organization names. The staff at ROR and the <a href="/events">community calls</a> are excellent resources to learn more about integrating organization persistent identifiers into your repository, documentation, or other processes. While matching controlled values to ROR IDs can be time-consuming, it is worth the effort in the pursuit of linked data.</p>
<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Write us at <a href="mailto:support@ror.org">support@ror.org</a> with any comments or questions or if you&rsquo;d like to contribute to the ROR blog.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Peyton Tvrdy of the US National Transportation Library tells us why and how her team did such exemplary work in producing metadata for Rosa P, a national digital repository for open transportation research.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Case Study: How ROR in OA.Report Helps HHMI Monitor Open Access Policy Compliance</title><id>https://doi.org/10.71938/dxm0-qw91</id><link rel="alternate" href="https://ror.org/blog/2025-05-01-oareport/"/><published>2025-05-01T10:04:35-04:00</published><updated>2026-03-17T09:07:53-04:00</updated><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><category term="Case Studies"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<!-- Commonly used content 

<figure class="featured-figure"><img src="/img/blog/"/>
</figure>


<div class='callout green'>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-info'></i></div><div class="col-md-10"><span></span></div>
	</div>
</div>

<figure class="blog-figure"><img src="/img/blog/"/>
</figure>



<div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
  <iframe src="https://www.youtube.com/embed/XXX" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="YouTube Video"></iframe>
</div>


<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Write us at <a href="mailto:support@ror.org">support@ror.org</a> with any comments or questions.</span></div>
	</div>
</div> 
-->
<figure class="featured-figure"><img src="/img/blog/oaworks/oareport-hhmi.png"
         alt="OA.Report and HHMI logos."/>
</figure>

<p>In this dual case study, we learn why the Howard Hughes Medical Institute (HHMI) relies on OA.Report and why OA.Report relies on ROR to help HHMI track compliance with its open access policy.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>
<h2 id="key-quotations">Key quotations 
  <a href="#key-quotations" aria-label="Key quotations">
<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 640 512" class="ha ha-link" width="16" height="16" alt="Key quotations"><path d="M579.8 267.7c56.5-56.5 56.5-148 0-204.5c-50-50-128.8-56.5-186.3-15.4l-1.6 1.1c-14.4 10.3-17.7 30.3-7.4 44.6s30.3 17.7 44.6 7.4l1.6-1.1c32.1-22.9 76-19.3 103.8 8.6c31.5 31.5 31.5 82.5 0 114L422.3 334.8c-31.5 31.5-82.5 31.5-114 0c-27.9-27.9-31.5-71.8-8.6-103.8l1.1-1.6c10.3-14.4 6.9-34.4-7.4-44.6s-34.4-6.9-44.6 7.4l-1.1 1.6C206.5 251.2 213 330 263 380c56.5 56.5 148 56.5 204.5 0L579.8 267.7zM60.2 244.3c-56.5 56.5-56.5 148 0 204.5c50 50 128.8 56.5 186.3 15.4l1.6-1.1c14.4-10.3 17.7-30.3 7.4-44.6s-30.3-17.7-44.6-7.4l-1.6 1.1c-32.1 22.9-76 19.3-103.8-8.6C74 372 74 321 105.5 289.5L217.7 177.2c31.5-31.5 82.5-31.5 114 0c27.9 27.9 31.5 71.8 8.6 103.9l-1.1 1.6c-10.3 14.4-6.9 34.4 7.4 44.6s34.4 6.9 44.6-7.4l1.1-1.6C433.5 260.8 427 182 377 132c-56.5-56.5-148-56.5-204.5 0L60.2 244.3z"/></svg>
  </a>
</h2>
<p>“Even back then [in 2019], the best option was to lean on a big, community-owned solution. And it’s been great to see ROR effectively become the standard, the clear way forward for identifying organizations.”</p>
<p>“We think ROR is terrific. We think it’s terrific today, and when we check in and see where ROR is headed, we’re excited. We’re excited to see ROR supersede Funder Registry IDs. We’re excited to see the API and the metadata curation models getting refined over time. And we’re excited to see the ROR team making lots of efforts to get global coverage of research organizations.”</p>
<p>– Joe McArthur, Executive Director, OA.Works</p>
<p>“Another aspect of this that’s definitely relevant here is that what OA.Report is picking up on is not grants or Grant IDs or anything like that, and not even the word ‘HHMI’ just appearing somewhere in the manuscript. It’s actually affiliations. That’s the flag that says to us, ‘This is a paper that you might want to pay attention to.’ &quot;</p>
<p>– Michele Avissar-Whiting, Director of Open Science, Howard Hughes Medical Institute (HHMI)</p>
</span></div>
	</div>
</div>


<h3 id="hahahugoshortcode-s7-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Let’s start with introductions from you both. Joe, can you tell us your name, title, and organization?</p>


<h3 id="hahahugoshortcode-s8-hbhb-joe-mcarthur"><figure class="round-figure"><img src="/img/blog/oaworks/joe-mcarthur.jpg"
         alt="Avatar of Joe McArthur"/>
</figure>
 Joe McArthur 
</h3>
<p>I’m Joe McArthur. I’m the Executive Director of <a href="https://oa.works">OA.Works</a>.</p>


<h3 id="hahahugoshortcode-s9-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Michele?</p>


<h3 id="hahahugoshortcode-s10-hbhb-michele-avissar-whiting"><figure class="round-figure"><img src="/img/blog/oaworks/michele-avissar-whiting.jpg"
         alt="Avatar of Michele Avissar-Whiting"/>
</figure>
 Michele Avissar-Whiting 
</h3>
<p>I’m Michele Avissar-Whiting. I am Director of Open Science Strategy at <a href="https://www.hhmi.org/">HHMI, the Howard Hughes Medical Institute</a>.</p>


<h3 id="hahahugoshortcode-s11-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Wonderful. So, Joe, can you tell us about OA.Works and specifically about <a href="https://oa.report">OA.Report</a> as well?</p>


<h3 id="hahahugoshortcode-s12-hbhb-joe-mcarthur"><figure class="round-figure"><img src="/img/blog/oaworks/joe-mcarthur.jpg"
         alt="Avatar of Joe McArthur"/>
</figure>
 Joe McArthur 
</h3>
<p>OA.Works is a nonprofit project with the mission to build powerful, simple, open access tools with the vision of a just and kind information age. A lot of what we focus on day to day is building tools that make open access easy and equitable, and OA.Report is one of those tools. OA.Report is designed to help organizations unlock the research that they support, and we have a pretty strong focus on helping organizations implement open access policies. So we’ll do that by helping organizations find the articles that they’ve supported, whether that’s via funding or when their staff have authored it. We help them measure how open that research is, and then take concrete action to increase how much of it is open and increase things like policy compliance.</p>
<p>HHMI is a pretty typical example of our work. <a href="https://blog.oa.works/hhmi-using-oareport-to-measure-and-improve-oa-policy-compliance/">We help them track their research output</a> using ROR and figure out how much of it is open access. It will help them figure out the policy compliance, but also other things, like how many articles have data availability statements. I know they’re using a lot of that data to then email authors to ask them to share their work publicly and raise open access policy compliance that way. I think that’s pretty cool. How we do this, I think, is quite neat. We use open data from big scholarly metadata sources like <a href="https://openalex.org">OpenAlex</a>, and also lots of persistent identifiers like ROR, but we also layer in data the institution has that they may be gathering internally, data that only they have.</p>


<h3 id="hahahugoshortcode-s13-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Do you mean the funder institution?</p>


<h3 id="hahahugoshortcode-s14-hbhb-joe-mcarthur"><figure class="round-figure"><img src="/img/blog/oaworks/joe-mcarthur.jpg"
         alt="Avatar of Joe McArthur"/>
</figure>
 Joe McArthur 
</h3>
<p>Funders, universities, they all have data that they’ve collected that can help with this task, and it can really help make your way through the open data effectively. And then we layer in on top of that data that we collect and clean in-house just to provide the broadest possible picture that we can for folks of what’s out there and give people the confidence to use that data to take action.</p>
<figure><img src="/img/blog/oaworks/oareport-sample.webp"
         alt="OA.Report Sample"/><figcaption>
            <p>OA.Report sample</p>
        </figcaption>
</figure>



<h3 id="hahahugoshortcode-s16-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Fascinating. I don’t think I knew that it’s not only open data that you’re using.</p>


<h3 id="hahahugoshortcode-s17-hbhb-joe-mcarthur"><figure class="round-figure"><img src="/img/blog/oaworks/joe-mcarthur.jpg"
         alt="Avatar of Joe McArthur"/>
</figure>
 Joe McArthur 
</h3>
<p>It’s one of the most important parts of the work that we do. We’re massive fans of open data, and there’s so much you can do with it and persistent identifiers, but we do find, perhaps unsurprisingly, that if you enrich that open data with the data people are collecting internally, you can go further.</p>


<h3 id="hahahugoshortcode-s18-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>How do you collect that internal data from universities and funders?</p>


<h3 id="hahahugoshortcode-s19-hbhb-joe-mcarthur"><figure class="round-figure"><img src="/img/blog/oaworks/joe-mcarthur.jpg"
         alt="Avatar of Joe McArthur"/>
</figure>
 Joe McArthur 
</h3>
<p>During the setup process for OA.Report with institutions, we’ll talk them through the types of data that are often useful to us and work out how that’s going to be shared. The answer almost always is that essentially it will be open data by the by the end of the process. Normally, we work with them to identify what sorts of information they may have internally: that can be anything from previous grant reports stored in grant reporting systems or previous systems for tracking compliance or measuring output. It can be APC invoices, it can be lists of ORCIDs. It can be just names. It’s a whole suite of things, right? What I say is, “Just give that to us in whatever form you have it, and we will take it from there, clean it, do what we need to do, then use that to do output tracking on your behalf and help you figure out what isn’t open, what isn’t policy compliant.”</p>
<p>A classic example of this, and something that we work with a lot, is Grant IDs. We’ll have an initial set of Grant IDs from Crossref, which can, as we all know, unfortunately be incomplete or a bit messy. We will clean those up in-house. We will go out and collect more Grant IDs, and then from an organization’s internal data, we will match those Grant IDs to programs or initiatives inside the organization that may match to multiple Grant IDs or things like that. We’re not actually doing that in HHMI’s case, but that’s a good example of how we thread the needle between all these different data sources.</p>


<h3 id="hahahugoshortcode-s20-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Fascinating. Michele, I’m going to turn to you now. First of all, just tell us about HHMI.</p>


<h3 id="hahahugoshortcode-s21-hbhb-michele-avissar-whiting"><figure class="round-figure"><img src="/img/blog/oaworks/michele-avissar-whiting.jpg"
         alt="Avatar of Michele Avissar-Whiting"/>
</figure>
 Michele Avissar-Whiting 
</h3>
<p>HHMI is a research institute. We try to advance science by funding excellent researchers who are doing pioneering scientific research. We do it primarily not through grants: we’re not a traditional funder. We do have some grants, but the vast majority of our investment goes into actually employing researchers in the field at their institutions across the United States. That’s what the <a href="https://www.hhmi.org/programs/investigators">Investigator Program</a> is. When you hear “an HHMI investigator,” that’s an investigator who is typically hosted at some other institution but has a dual affiliation with that institution and HHMI. We also have a research facility in Virginia called Janelia Research Campus.</p>
<p>This is relevant, I’m sure, to the conversation that we’re going to have next about how we use OA.Report, but it’s an important foregrounding. It’s an important aspect to how HHMI functions, because we fund people, not projects. We are finding individuals who are doing excellent work already and giving them the resources that they need to get to new breakthroughs in science.</p>


<h3 id="hahahugoshortcode-s22-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Amazing. Tell us about why HHMI began to use OA.Report.</p>


<h3 id="hahahugoshortcode-s23-hbhb-michele-avissar-whiting"><figure class="round-figure"><img src="/img/blog/oaworks/michele-avissar-whiting.jpg"
         alt="Avatar of Michele Avissar-Whiting"/>
</figure>
 Michele Avissar-Whiting 
</h3>
<p>On January 1, 2022 we launched our <a href="https://www.hhmi.org/about/policies/publishing-sharing">current open access policy</a>, which is a requirement that all major research contributions that are produced by HHMI labs must be shared open access with the <a href="https://creativecommons.org/licenses/by/4.0/">CC BY license</a> immediately upon publication. It’s almost like the <a href="https://www.whitehouse.gov/ostp/news-updates/2022/08/25/ostp-issues-guidance-to-make-federally-funded-research-freely-available-without-delay/">Nelson memo</a> with the addition of the CC BY license requirement, but we did it earlier in 2022.</p>
<p>I came on about mid-2022 and when I came on, we still didn’t have a mechanism in place to enforce that policy beyond really manual work, just trying to pull in whatever publications we could find through things like <a href="https://pubmed.ncbi.nlm.nih.gov/">PubMed</a> and <a href="https://lens.org">Lens.org</a> and other databases to track compliance with the policy. So I started looking around. It was actually Ashley Farley at <a href="https://www.gatesfoundation.org/">Gates</a> who told me about Joe and OA.Report, saying that we should at least talk. I said, “Our situation is a little bit complex, because it’s not all outputs from HHMI, and not all outputs that are related to HHMI or have a grant from HHMI are subject to the policy. There are very specific criteria around it, so we would need to work with someone who could accommodate those finicky criteria.” And she said, “Just talk to Joe.” So I did, and that ended up being really the key to us being able to keep track of compliance with this policy.</p>
<p>We’ve been with OA.Works ever since. It was a process of probably about a year with a lot of back and forth to refine the specifics around what publications we need to be able to have visibility into and what information we need about them. We define “major contributions” as papers that have an HHMI lab head as the first, last or corresponding author, or a co-author in any of those positions. The first author is often a postdoc or graduate student who wrote the paper, and the last author is usually the principal investigator. The corresponding author is usually also the last author, but not always. So that’s very specific. If they’re a middle author, then that paper isn’t subject to the policy, and we don’t need to care about it for compliance purposes. That’s why OA.Report needs to be able to pick up on the order of authors, which before I spoke to Joe I didn’t know was possible to do. They have to be able to pull that out of the metadata and also to pick up on the metadata that ascribes corresponding authorship, which can be across multiple authors on a paper. So it’s complicated.</p>
<p>It’s also not every type of article type that’s subject to the policy. It’s only original research, so things like reviews and editorials and commentaries are excluded. There’s also a temporal aspect of it, which is becoming less important over time, but which was really annoying in the first year and a half or so that we were working together, which is that we only wanted to find papers that were submitted after the time that the policy went into effect. <em>Submitted</em>, not published, after January 1 2022. That was another piece of metadata that we needed, because if it was submitted on December 22 of 2021, then it’s not subject to the policy. We don’t want to be chasing people down for that.</p>
<p>Open access is a core tenet for us. It’s a massive part of our values. We fund this research so that it can help society, and we don’t think it can do that or be maximally impactful unless it’s free to read, obviously, but also maximally free to use. That’s where the CC BY license with no restrictions comes in. So Joe also needs to be able to pick up on CC BY versus <a href="https://creativecommons.org/licenses/by-nc/4.0/">CC BY-NC</a> [<em>non-commercial</em>] versus <a href="https://creativecommons.org/licenses/by-nc-sa/4.0/">CC BY-NC-SA</a> [<em>non-commercial, share-alike under the same license</em>] – all of those derivations that are not actually compliant with our policy and that we need people to take action on to fix if they select the wrong license. And this has become the big challenge for us, because there’s a lot of confusion around licenses, and it’s really hard to communicate with authors why this license matters and how it’s being restrictive if you add these other letters to the end of the open license.</p>


<h3 id="hahahugoshortcode-s24-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>I remember the whole learning experience I went through about the difference between CC BY-NC and CC BY. I was opting into non-commercial uses for a long time, and then I realized through <a href="https://nowviskie.org/2011/why-oh-why-cc-by/">a great blog post by Bethany Nowviskie</a> and some other discussion that actually that does cause problems.</p>


<h3 id="hahahugoshortcode-s25-hbhb-michele-avissar-whiting"><figure class="round-figure"><img src="/img/blog/oaworks/michele-avissar-whiting.jpg"
         alt="Avatar of Michele Avissar-Whiting"/>
</figure>
 Michele Avissar-Whiting 
</h3>
<p>Right - on its face, it seems like maybe the better option, and you think <a href="https://creativecommons.org/licenses/by-nc-nd/4.0/">CC BY-NC-ND</a> [<em>non-commercial, no derivative works</em>] might be somewhat more protectionist, but still open. Then you learn that, for example, if you have an NC license, your work can’t be used in a textbook.</p>


<h3 id="hahahugoshortcode-s26-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>The other thing I wanted to mention here is that Crossref is planning a <a href="https://www.crossref.org/blog/metadata-schema-development-plans/">schema update</a>, and one of the things that will be added is <a href="https://credit.niso.org/">CRediT roles for contributors</a>, which might help with that author order problem. It’s not going to put you out of business, Joe, but it’s something that I think a lot of people will appreciate.</p>


<h3 id="hahahugoshortcode-s27-hbhb-joe-mcarthur"><figure class="round-figure"><img src="/img/blog/oaworks/joe-mcarthur.jpg"
         alt="Avatar of Joe McArthur"/>
</figure>
 Joe McArthur 
</h3>
<p>I mean, I will say that one thing we’re a big fan of is being put out of business! Especially by big open metadata sources. Michele laid out all the challenges for, you know, figuring out whether or not the HHMI policy covers a particular article that may be associated with HHMI. This a problem that we face <em>all</em> the time, and it’s one of the most important pieces for us. What it means to measure policy compliance is to figure out, “Does the policy apply or not?” And it’s surprisingly complicated in a large number of cases, because 90% of the data that Michele mentioned is not available in open metadata in places like <a href="https://crossref.org">Crossref</a>. Or it’s partially there, or it’s there in some sources but not others. So you’ve got to find other ways to get that data, and solving that is a big challenge that we take on at OA.Report.</p>
<p>But we’re really big fans of supporting the long term work that means we don’t have to do this. We don’t want to have to do this. We’d like to be supporting persistent identifiers and, you know, just data structures in the space that get us to not needing to do all of that leg work. We’ve been big proponents of <a href="https://www.crossref.org/services/grant-linking-system/">Crossref’s Grant Linking System</a>, and we’ve been working really hard with Crossref to try and speed up adoption of grant DOIs so that grants metadata is better and we don’t need to do so much work. It’s also great to hear that there are schema updates happening that will make it easier to spot the corresponding author, because it’s increasingly important for policies, and it’s really hard to get your hands on unless you are using multiple data sources or else you’re able to just go and read the paper yourself and pull out the corresponding author. That’s what we do now, but we’d prefer not to do it forever.</p>


<h3 id="hahahugoshortcode-s28-hbhb-michele-avissar-whiting"><figure class="round-figure"><img src="/img/blog/oaworks/michele-avissar-whiting.jpg"
         alt="Avatar of Michele Avissar-Whiting"/>
</figure>
 Michele Avissar-Whiting 
</h3>
<p>And every journal has their own way of demarcating funding and authors. <strong>Another aspect of this that’s definitely relevant here is that what OA.Report is picking up on is not grants or Grant IDs or anything like that, and not even the word “HHMI” just appearing somewhere in the manuscript. It’s actually affiliations. That’s the flag that says to us, “This a paper that you might want to pay attention to.”</strong> It’s if somebody who’s the first, last, or corresponding author has the HHMI affiliation. And of course, there are name variants. There’s “HHMI” [<a href="https://ror.org/006w34k90">https://ror.org/006w34k90</a>], there’s “Howard Hughes Medical Institute” [<a href="https://ror.org/006w34k90">https://ror.org/006w34k90</a>], there’s “Janelia Research Campus” [<a href="https://ror.org/013sk6x84">https://ror.org/013sk6x84</a>], which is part of HHMI. Sometimes people include HHMI with the Janelia Research Campus affiliation, but sometimes they don’t, and that gets into why ROR is necessary.</p>
<p>And sometimes people just don’t put that affiliation, and that’s a problem. That’s probably the next on the list of things for us to tackle with OA.Report, because right now that’s a blind spot for us. If someone just doesn’t put their HHMI affiliation, we don’t see it. I don’t know how often that happens, but it definitely happens with some frequency. So what we would like to do instead is pick up on ORCIDs, and of course those are not used universally, and so that’s another metadata problem. If we could always ensure that we are finding the right author on a paper, then we might not actually need the affiliation anymore.</p>


<h3 id="hahahugoshortcode-s29-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>I look at affiliation metadata in Crossref frequently, to see of course who’s sending ROR IDs to Crossref, but also to see who’s just sending those affiliations. And I’ll tell you, even for new content, I find it surprisingly low. It hovers around 60% of new records that have affiliations. [<em><a href="https://bit.ly/ror-doi-orcid-stats">See ROR IDs in DOI and ORCID records</a> for Crossref API queries that retrieve the number of records with affiliations since 2022 and the total number of records since 2022.</em>]</p>


<h3 id="hahahugoshortcode-s30-hbhb-michele-avissar-whiting"><figure class="round-figure"><img src="/img/blog/oaworks/michele-avissar-whiting.jpg"
         alt="Avatar of Michele Avissar-Whiting"/>
</figure>
 Michele Avissar-Whiting 
</h3>
<p>Joe, I think you use lots of other data sources, though, including OpenAlex.</p>


<h3 id="hahahugoshortcode-s31-hbhb-joe-mcarthur"><figure class="round-figure"><img src="/img/blog/oaworks/joe-mcarthur.jpg"
         alt="Avatar of Joe McArthur"/>
</figure>
 Joe McArthur 
</h3>
<p>Yes. The affiliations challenge speaks directly to why ROR is necessary. And yes, in the Crossref metadata, it would be great to see <a href="https://www.crossref.org/documentation/schema-library/markup-guide-metadata-segments/affiliations/">more publishers putting ROR IDs in directly</a>, and I know the ROR team is hard at work nudging people to do that. So that’s awesome. And yes, one source that we work with a lot now is OpenAlex, and they will take that raw affiliation string from Crossref, match that to a ROR ID, and then everyone using OpenAlex gets the lovely benefits of ROR, even if the publisher is not putting them in. That’s one of the incredible things that open metadata and open sources gets us. Now, it would still be better if the publisher was putting the ROR in, but that’s one of the huge advantages of ROR as an open system. And, you know, increasingly, with community standards, people will go out and do that, and that’s really awesome.</p>
<p>There’s places where that’s still not far enough, though. One of the challenges that we see with HHMI data is that multiple affiliations are sometimes not handled fantastically by different systems. I know when we were doing the major author identification, I believe we were using OpenAlex data, Crossref data, PubMed Central data, <em>and</em> Europe PubMed Central data, because each of them was better at a particular slice of getting some of the corresponding, last, and first authors. Some of them were better at having all of the affiliations, but they’d have the authors in the wrong order, whereas some of them were better at having the right order of authors, but not all of the affiliations. It was a whirlwind! But towards the end, we got to a good place.</p>


<h3 id="hahahugoshortcode-s32-hbhb-michele-avissar-whiting"><figure class="round-figure"><img src="/img/blog/oaworks/michele-avissar-whiting.jpg"
         alt="Avatar of Michele Avissar-Whiting"/>
</figure>
 Michele Avissar-Whiting 
</h3>
<p>I said that maybe if we had reliable ORCIDs for researchers we wouldn’t need the researcher affiliations, but actually it does need to be a sort of triangulation. We also get false positives when people wrongly put HHMI as their affiliation. They have a grant from HHMI, and they or their postdoc wrongly put HHMI as their affiliation. What we really need is some kind of source of truth that could pick up on the ORCID and pick up on the ROR and then see if it’s a part of the list that we have of people who have active appointments at HHMI. That would be the panacea.</p>


<h3 id="hahahugoshortcode-s33-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>I’m on a <a href="https://publicationethics.org/topic-discussions/claiming-institutional-affiliations">Committee on Publication Ethics (COPE) affiliations working group</a> that’s addressing that to an extent. Sometimes people are claiming affiliations they don’t have, sometimes in order to get an APC deal, sometimes for prestige, but sometimes also just by mistake. The question came up of whether some kinds of fellowships should count as affiliations. There will be a paper coming out of that that will help at least to outline some definitions about what affiliation means and to make some recommendations about best practices.</p>
<p>Anyway, coming back to you, Joe. You’ve already talked a little bit about this, but why and how do you use ROR?</p>


<h3 id="hahahugoshortcode-s34-hbhb-joe-mcarthur"><figure class="round-figure"><img src="/img/blog/oaworks/joe-mcarthur.jpg"
         alt="Avatar of Joe McArthur"/>
</figure>
 Joe McArthur 
</h3>
<p>OA.Works has been using ROR as an organization identifier from the very start, since I think 2019. It was in another product just a little bit before OA.Report’s time. Our reason for that was because we wanted an open, community-owned ID that had good coverage globally and was supported by a good team. And you know, that still defines ROR today, except all of those things are more true. Within OA.Reports, ROR IDs are our primary IDs for organizations. We will use that to take a first pass at populating alternative names, relationships, things like that. And then we will use those ROR IDs to go and search other open data sources. If they support ROR, we will just directly go and look at those results, and that will be a lot of how we navigate the open data portion of our work. We’re often using multiple ROR IDs to do that, which as far as we’re concerned is a feature, not a bug. That’s a great thing about ROR. We’re also using the cross-links between between ROR IDs and other IDs, which are a useful starting point for us when we’re starting to get into the weeds with folks.</p>


<h3 id="hahahugoshortcode-s35-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>How did you first come to hear about ROR back in 2019?</p>


<h3 id="hahahugoshortcode-s36-hbhb-joe-mcarthur"><figure class="round-figure"><img src="/img/blog/oaworks/joe-mcarthur.jpg"
         alt="Avatar of Joe McArthur"/>
</figure>
 Joe McArthur 
</h3>
<p>I wish my memory was good enough to remember! We’ve been at this for over a decade now, and from day one we’ve been putting together open code sharing, open data, and generally being a community-run nonprofit, and so we are always on the lookout for friends. Somewhere around that time I remember that people were excited and talking about it.</p>
<p>And we would have been having conversations internally back then about, like, “How do we identify this organization? Do we come up with some ID internally that we are then shepherding? Do we write just the string, with all the obvious issues that come with that?” <strong>Even back then, the best option was to lean on a big, community-owned solution. And it’s been great to see ROR effectively become the standard, the clear way forward for identifying organizations.</strong></p>


<h3 id="hahahugoshortcode-s37-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>I have seen that as well. And even among publishers, who have been slower to adopt ROR, I think they plan to do it: it’s just a question of doing the work. They’ve built all their workflows around other things, and it takes awhile to overhaul those. Just recently, I’ve been focusing a lot on scholarly association publishers, and they’re very open to it. I think over the next one to five years, we’re going to see a lot more ROR IDs coming from publishers in particular.</p>


<h3 id="hahahugoshortcode-s38-hbhb-joe-mcarthur"><figure class="round-figure"><img src="/img/blog/oaworks/joe-mcarthur.jpg"
         alt="Avatar of Joe McArthur"/>
</figure>
 Joe McArthur 
</h3>
<p>That’s great.</p>


<h3 id="hahahugoshortcode-s39-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Wrapping up! Joe, do you have any feature requests for ROR, or things you hope we do in the future?</p>


<h3 id="hahahugoshortcode-s40-hbhb-joe-mcarthur"><figure class="round-figure"><img src="/img/blog/oaworks/joe-mcarthur.jpg"
         alt="Avatar of Joe McArthur"/>
</figure>
 Joe McArthur 
</h3>
<p>I don’t know if I’ve already given it away, but I mean, we are fans. <strong>We think ROR is terrific. We think it’s terrific today, and when we check in and see where ROR is headed, we’re excited. We’re excited to see ROR supersede Funder Registry IDs. We’re excited to see the API and the metadata curation models getting refined over time. And we’re excited to see the ROR team making lots of efforts to get global coverage of research organizations.</strong> No notes, in many ways.</p>
<p>I would say that an area we are especially supportive of is that as ROR becomes more and more of a standard, it’s important to make sure that it’s more and more representative of organizations, to make sure that we don’t miss any organizations out there that need a ROR. That global coverage is a thing that we think ROR is uniquely positioned to do, and we want to make sure that as we’re trying to support bibliodiversity and make sure that wherever research has been published, it can be found, discovered, used, that ROR is a big part of that. That’s our only “plus one,” really.</p>


<h3 id="hahahugoshortcode-s41-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>That’s a great point. The number of curation requests we’re getting for ROR is just a hockey stick curve upward, you know. We <a href="https://dev.ror.org/blog/2024-09-03-welcome-riley-marsh/">added a new curator in August</a> who has been wonderful. We’re often focusing on metadata with non-Latin character languages, as well. We&rsquo;re <a href="https://youtu.be/jRic-l8Nivs?si=3IbFsriwzHcPqKFe&amp;t=1110">working on a long-term project to improve metadata for organizations with names in Chinese</a>, for instance.</p>


<h3 id="hahahugoshortcode-s42-hbhb-joe-mcarthur"><figure class="round-figure"><img src="/img/blog/oaworks/joe-mcarthur.jpg"
         alt="Avatar of Joe McArthur"/>
</figure>
 Joe McArthur 
</h3>
<p>That’s really terrific. And that’s exactly the sort of things that we want ROR to be doing and that we’re really grateful for, because we at OA.Works are not going to become experts in Chinese-language organizations, but it’s important that the systems that get built around ROR can take advantage of that. Doing that work at ROR and doing that high level of curation across the dozens of languages that are important in the scholarly literature is just great. I’m really glad to hear that that’s being done.</p>


<h3 id="hahahugoshortcode-s43-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yes, that’s ongoing work here at ROR. Michele, what else would you like to say about your work, or about ROR, or about anything at all?</p>


<h3 id="hahahugoshortcode-s44-hbhb-michele-avissar-whiting"><figure class="round-figure"><img src="/img/blog/oaworks/michele-avissar-whiting.jpg"
         alt="Avatar of Michele Avissar-Whiting"/>
</figure>
 Michele Avissar-Whiting 
</h3>
<p>I don’t know if it’s a question or a comment, but I’m looking at the item types for the ROR record for <a href="https://ror.org/006w34k90">Howard Hughes Medical Institute</a>, and the organization type is given as “Funder” and “Nonprofit,” which is true. But then, if I go to the record for <a href="https://ror.org/013sk6x84">Janelia Research Campus</a>, which I see has HHMI as its parent, it’s listed as a “Facility.”</p>
<p>Now, the little nuance here is that HHMI, writ large, is a funder, but it’s primarily an institute, so we identify more as an institute than as a funder. What’s not captured in these item types is that the people who are employed by HHMI who are not at Janelia. We’re a little bit of a weird case, but I wonder whether it would make sense to also have HHMI as a “Facility.” As ROR starts to become better integrated into all these publishing systems, I would want Howard Hughes Medical Institute to be in that drop down as an affiliation. It wouldn’t be, necessarily, if it’s only categorized as a funder.</p>


<h3 id="hahahugoshortcode-s45-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>That’s a great note. The first thing I’ll say is that you can <a href="https://curation-request.ror.org">request that change if you like</a>, or I can do it for you, and we can take a look at adding that type to the HHMI record. [<em>Note: this change was made in January 2025, a few weeks after the interview took place.</em>] The second thing I’ll say is that, as Joe mentioned, we’ve been doing a lot of work on <a href="https://ror.org/blog/2024-04-15-announcing-ror-v2/">evolving our schema and API</a>, and one of the things that we’re probably going to look at fairly soon is revising that <a href="https://ror.readme.io/v2/docs/ror-data-structure#types">list of item types</a>, because it isn’t very nuanced. We inherited the list of organization types from GRID, and there’s not very many of them.</p>
<p>The way ROR does this, is we put together a proposal, and then we ask for a lot of <a href="https://ror.readme.io/v2/docs/community-feedback">community feedback</a>. Once we get to that point, I will be sure to send that to you, because actually edge cases like yours are super useful in making sure that everybody has what they need.</p>


<h3 id="hahahugoshortcode-s46-hbhb-michele-avissar-whiting"><figure class="round-figure"><img src="/img/blog/oaworks/michele-avissar-whiting.jpg"
         alt="Avatar of Michele Avissar-Whiting"/>
</figure>
 Michele Avissar-Whiting 
</h3>
<p>I am happy to help or contribute to that conversation, however it’s helpful. I just wanted to flag that, because I wouldn’t want HHMI not to show up in a list of affiliations.</p>


<h3 id="hahahugoshortcode-s47-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>I think it would appear in a list of affiliations by virtue of the fact that it’s also listed as a “Nonprofit,” but that’s a good thing to flag. Actually, in version 1 of our schema, all the organizations only had one type, and it was only after we released version 2 that we introduced the “Funder” type and began adding it to records. That’s why most records that have that “Funder” type also have another type, such as “Government.” I’m sure HHMI was originally listed as a “Nonprofit” only, and then we added that “Funder” type to every record with a Funder ID mapping.</p>


<h3 id="hahahugoshortcode-s48-hbhb-michele-avissar-whiting"><figure class="round-figure"><img src="/img/blog/oaworks/michele-avissar-whiting.jpg"
         alt="Avatar of Michele Avissar-Whiting"/>
</figure>
 Michele Avissar-Whiting 
</h3>
<p>It’s just one of those funny things. We’re always lumped into groups of funders, and, fair enough, we’re part of <a href="https://www.coalition-s.org/">cOAlition S</a>, and we are part of other funder groups. But even though we’re always considered by other people to be a funder, you’ll never hear somebody within HHMI refer to HHMI as a funder. It’s just interesting.</p>


<h3 id="hahahugoshortcode-s49-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Anything else from you, Joe, about your work or about ROR?</p>


<h3 id="hahahugoshortcode-s50-hbhb-joe-mcarthur"><figure class="round-figure"><img src="/img/blog/oaworks/joe-mcarthur.jpg"
         alt="Avatar of Joe McArthur"/>
</figure>
 Joe McArthur 
</h3>
<p>I could say infinite things about both. No, just that we’re really grateful to the ROR team for five years of hard work now. We’re so grateful to the ROR team for providing an infrastructure that so many people can build on. And Michele, we’re grateful to HHMI for getting to work with you for over two years now.</p>


<h3 id="hahahugoshortcode-s51-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Thank you both so much for sitting down with me.</p>


<h3 id="hahahugoshortcode-s52-hbhb-michele-avissar-whiting"><figure class="round-figure"><img src="/img/blog/oaworks/michele-avissar-whiting.jpg"
         alt="Avatar of Michele Avissar-Whiting"/>
</figure>
 Michele Avissar-Whiting 
</h3>
<p>Happy to. Thanks.</p>


<h3 id="hahahugoshortcode-s53-hbhb-joe-mcarthur"><figure class="round-figure"><img src="/img/blog/oaworks/joe-mcarthur.jpg"
         alt="Avatar of Joe McArthur"/>
</figure>
 Joe McArthur 
</h3>
<p>Thank you.</p>
<hr>
<p><em>This interview was conducted in November 2024.</em></p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">In this dual case study, we learn why the Howard Hughes Medical Institute relies on OA.Report and why OA.Report relies on ROR to help HHMI track compliance with its open access policy.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Using ROR IDs in Place of Funder IDs</title><id>https://doi.org/10.71938/20kn-4m06</id><link rel="alternate" href="https://ror.org/blog/2025-03-05-using-ror-ids-in-place-of-funder-ids/"/><link rel="related" href="https://doi.org/10.13003/156081"/><published>2025-03-05T13:46:09-05:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Patricia Feeney</name><uri>https://orcid.org/0000-0002-4011-3590</uri></author><category term="Adoption News"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>Crossref has announced today that its members can now use ROR IDs to identify funders in any place where they currently use Funder IDs in their metadata. To learn more about using ROR IDs to identify funders, read ROR&rsquo;s <a href="https://ror.readme.io/docs/funder-registry">guide to the Open Funder Registry to ROR transition</a> and ROR&rsquo;s <a href="/blog/2024-08-06-using-ror-for-funder-identification/">blog post on systems that are already using ROR for funder identification</a>. Read the full text of the Crossref announcement below.</span></div>
	</div>
</div>
<figure class="featured-figure"><img src="/img/crossref-social.png"
         alt="Crossref logo."/>
</figure>

<p>Today, we&rsquo;re delighted to let you know that <strong>Crossref members can now use ROR IDs to identify funders in any place where you currently use Funder IDs in your metadata.</strong> Funder IDs remain available, but this change allows publishers, service providers, and funders to streamline workflows and introduce efficiencies by using a single open identifier for both researcher affiliations and funding organizations.</p>
<p>As you probably know, the <a href="https://ror.org">Research Organization Registry (ROR)</a> is a global, community-led, carefully curated registry of open persistent identifiers for research organisations, including funding organisations. It’s a joint initiative led by the California Digital Library, Datacite and Crossref launched in 2019 that fulfills the long-standing need for an open organisation identifier.</p>
<p>In 2023, we shared our plan to <a href="https://www.crossref.org/blog/open-funder-registry-to-transition-into-research-organization-registry-ror">transition the Open Funder Registry into ROR</a>. More recently, we announced that we were planning to <a href="https://www.crossref.org/blog/roring-ahead-using-ror-in-place-of-the-open-funder-registry">update our schema so that it is possible to collect ROR IDs where we currently collect Funder IDs</a> such as in the funding metadata section for works and funder section for grants. Now that we have completed this work, Crossref members can start depositing ROR IDs where they would normally deposit Funder IDs. This update also means that the community, including funders, service providers, researchers, and data scientists can retrieve this metadata <a href="https://api.crossref.org/works?filter=has-ror-id:true">via our API</a>.</p>
<p>So come and ROR with us and start depositing ROR IDs for both researcher affiliations and funding organisations.</p>


<h2 id="open-funder-registry-ror-transition">Open Funder Registry-ROR transition 
</h2>
<p>This is of course a significant first step in the Open Funder Registry to ROR transition.</p>
<p>We’ve <a href="https://ror.readme.io/docs/funder-registry">always said</a> that we would continue supporting Funder IDs in our schema and in our tools and services until the community is ready to transition - and we will. In the last year, Crossref and ROR conducted a series of Open Funder Registry user interviews to help us understand how it was being used and identify practical challenges to this transition in our members’ workflow (thank you to those who took part, it was incredibly useful!).</p>
<p>One major takeaway from this consultation was around the pivotal role that peer review management systems played in the Open Funder Registry-ROR transition. We look forward to seeing more service providers integrating with ROR in the future. If you are a service provider and are ready to integrate with ROR, drop <a href="mailto:support@ror.org">support@ror.org</a> an email.</p>


<h2 id="including-ror-ids-in-crossref-metadata">Including ROR IDs in Crossref metadata 
</h2>
<p>If you are ready to begin including ROR IDs in your funding metadata, you only need to include the ROR itself to identify a funder.</p>
<p>For example:</p>
<pre tabindex="0"><code>&lt;fr:program name=&#34;fundref&#34;&gt;
     &lt;fr:assertion name=&#34;ror&#34;&gt;https://ror.org/00fq5cm18&lt;/fr:assertion&gt;
     &lt;fr:assertion name=&#34;award_number&#34;&gt;10.3030/725840&lt;/fr:assertion&gt;
&lt;/fr:program&gt;
</code></pre><p>Examples of more complex combinations of funding information are available in our <a href="https://www.crossref.org/documentation/funder-registry/funding-data-overview/">documentation</a>. This update has been made across all schema that support funding metadata.</p>
<p>Our grants schema has recently been updated to <a href="https://www.crossref.org/documentation/schema-library/grants-schema/">version 0.2.0</a> to support ROR IDs in place of funder identifiers as well. As with funding metadata, only the ROR ID needs to be supplied within the record:</p>
<pre tabindex="0"><code>&lt;funding amount=&#34;750&#34; currency=&#34;USD&#34; funding-percentage=&#34;75&#34; funding-type=&#34;APC&#34;&gt;
  &lt;ROR&gt;https://ror.org/02twcfp32https://ror.org/02twcfp32&lt;/ROR&gt;
  &lt;funding-scheme&gt;Sofa Lending Programme&lt;/funding-scheme&gt;
&lt;/funding&gt;
</code></pre><p>Although previously a funder name was collected with the funder identifier, for both grants records and funding data in an attempt to avoid redundant, incorrect or conflicting metadata, now we’re accepting an identifier only as the ROR ID has an existing metadata record. The organisation name exists within the record in the <a href="https://ror.org/search">ROR registry</a> and the ROR record is the authoritative source of the name.</p>


<h2 id="ror-ids-in-json-outputs">ROR IDs in JSON outputs 
</h2>
<p>We have an existing legacy practice of representing Open Funder Registry IDs as just a DOI, but ROR IDs are represented in the JSON outputs as a full URL with id-type “ROR”, for example:</p>


<h3 id="funding-metadata">Funding metadata 
</h3>
<pre tabindex="0"><code>
 &#34;funder&#34;: [
      {
        &#34;award&#34;: [
          &#34;10.3030/725840&#34;
        ],
        &#34;id&#34;: [
          {
            &#34;id&#34;: &#34;https://ror.org/02twcfp32&#34;,
            &#34;id-type&#34;: &#34;ROR&#34;,
            &#34;asserted-by&#34;: &#34;publisher&#34;
          }
        ]
      }
    ],
</code></pre>

<h3 id="grant-funder-information">Grant funder information 
</h3>
<pre tabindex="0"><code>
&#34;funding&#34;: [
              {
                &#34;type&#34;: &#34;infrastructure&#34;,
        &#34;award-amount&#34;: {
                &#34;amount&#34;: 750.0,
                 &#34;currency&#34;: &#34;USD&#34;,
                 &#34;percentage&#34;: 75
            },
                &#34;funder&#34;: {
                  &#34;id&#34;: [
                    {
                      &#34;id&#34;: &#34;https://ror.org/02twcfp32&#34;,
                      &#34;id-type&#34;: &#34;ROR&#34;,
                      &#34;asserted-by&#34;: &#34;publisher&#34;
                    }
              ]
            }
          }
        ]
      }
    ],
</code></pre><p>If you have any questions or feedback, get in touch with us at <a href="mailto:support@crossref.org">support@crossref.org</a>!</p>
<div class='callout grey'>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Questions about integrating ROR IDs for funders into your system? Write <a href="mailto:support@ror.org">support@ror.org</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Crossref members can now use ROR IDs to identify funders in any place where they currently use Funder IDs in their metadata.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Highlights from the 2025 ROR Annual Community Meeting</title><id>https://doi.org/10.71938/9f0g-9791</id><link rel="alternate" href="https://ror.org/blog/2025-02-11-ror-annual-meeting-highlights/"/><published>2025-02-19T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><category term="Event Recaps"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p><a href="https://flic.kr/p/9DTkax"><figure class="featured-figure"><img src="/img/kinderen-met-whelp-nationaal-archieef.png"
         alt="Nationaal Archief, Kinderen Met Welp / Children Holding Whelp, 2011, photo from Flickr."/>
</figure>
</a></p>
<p>It&rsquo;s been six years since the <a href="https://ror.org/blog/2019-02-10-announcing-first-ror-prototype/">initial launch of ROR</a>, and our <a href="https://ror.org/tags/annual-meeting/">Annual Community Meeting</a> has become a welcome tradition for the ROR community in every new year. It&rsquo;s a chance for us to reflect on the past year, look ahead to the coming year, and express our gratitude for the diverse and growing community of research and information professionals who use and support ROR. Last year, more than 300 people registered for one of the ROR Annual Community Meeting sessions, and this year, that number grew to <strong>more than 500 registrants from 58 different countries</strong>, making this our most well-attended and most international Annual Community Meeting to date! If you weren&rsquo;t one of those who registered or attended, here are some highlights.</p>


<h2 id="reflecting-on-2024-and-planning-for-2025">Reflecting on 2024 and planning for 2025 
</h2>
<p>ROR Director Maria Gould began this year&rsquo;s &ldquo;state of ROR&rdquo; <strong>Community Update</strong> session by reflecting on the core values of openness and community that led to the creation of ROR and that continue to guide ROR today. She then gave a general overview of how these values are realized in ROR&rsquo;s operations and governance.</p>
<p>ROR Product Lead Adam Buttrick and ROR Metadata Manager Riley Marsh reported on ROR data curation activities. In 2024, we added more than 4,000 new records to ROR and enhanced existing records with language tags and organization domains. Areas of focus for ROR&rsquo;s data in 2025 include improving regional coverage for countries such as Japan and Portugal, adding new external identifiers to ROR records, and improving coverage of funders, scholarly societies, and publishers.</p>
<p>ROR Technical Lead Liz Krznarich reminded us that ROR reached a major milestone in 2024 with the launch of version 2 of the ROR schema and API. Four major technical projects are planned for 2025: improving the ROR API&rsquo;s ability to match affiliation strings to ROR IDs, speeding up the overall response times of the API, implementing client identification, and sunsetting v1 of the ROR schema and API at the end of the year.</p>
<p>ROR Technical Community Manager Amanda French showcased some key ROR adoptions in 2024, including the <a href="https://ror.org/blog/2024-07-23-aps-adopts-ror/">American Physical Society</a> and <a href="https://ror.org/blog/2024-12-18-clarivate-integrates-ror/">Web of Science</a>, and mentioned that we&rsquo;re looking forward to seeing ROR incorporated into Atypon and OJS 3.5 in 2025 as well as seeing increased <a href="https://ror.org/blog/2024-08-06-using-ror-for-funder-identification/">use of ROR IDs to identify funders</a> as well as author affiliations.</p>
<p>During the session, participants told us about their recent and forthcoming work with ROR, as well, and we discussed interesting questions such as &ldquo;Which kinds of curation requests are the hardest to resolve?&rdquo;  We followed the Community Update with an informal, unrecorded drop-in session where we continued these and other discussions.</p>
<a class='btn btn-sm btn-round green' href='/events/2025-02-04-annual-meeting-community-update'>See Community Update slides and recording</a>


<h2 id="learning-how-different-countries-use-and-encourage-persistent-identifiers">Learning how different countries use and encourage persistent identifiers 
</h2>
<p>The <strong>National PID Policies and Practices</strong> panel included perspectives from the UK, the US, Czechia, Germany, Brazil, and Ireland. Rachael Kotarski, formerly with the British Library and now with the University of Chicago, mentioned that one key difference between the UK&rsquo;s work to form a national persistent identifier strategy and the US&rsquo;s nascent efforts to do the same is that the US is less focused on research assessment than the UK, so the key driving force in the US tends to be the minimum requirements by national funders for persistent identifier support in repositories.</p>
<p>Taras Hrendaš of the Office of the Government of the Czech Republic explained the methodology behind their annual evaluation of Czech research statistics, which they derive using a combination of data from their national CRIS system (IS VaVaI) and data from Web of Science. Institutional affiliation incoherence is a major problem for their data analysis, so the incorporation of ROR into the Web of Science will be a tremendous help for next year&rsquo;s analysis.</p>
<p>Among the insights uncovered by Germany&rsquo;s PID Network Deutschland, represented by Andreas Czerniak standing in for Steffi Genderjahn, is that the best-known PIDs are the DOI, ISBN, and ORCID, followed by GND, URN, and ROR. Andreas also announced the recent launch of the first version of Germany&rsquo;s PID Monitor. Exciting!</p>
<p>Washington Segundo of Brazil&rsquo;s IBICT gave an overview of six key components national research infrastructure in Brazil, including a national repository of theses and dissertations (BDTD), a portal to publication and data repositories (Oasisbr), the Brazilian current research information system (BrCris), an open repository and data lake that collects and processes FAIR data (Laguna), a directory of Brazilian electronic scientific journals (Miguilim), and a project to develop an open and non-centralized source of persistent identifiers (dARK). Despite technical, financial, and cultural challenges, IBICT is working to expand Open Science infrastructure in Brazil, disseminate the country&rsquo;s scientific production, and reduce dependence on commercial platforms.</p>
<p>Finally, Michelle Doran of Ireland&rsquo;s National Open Research Forum gave an overview of Ireland&rsquo;s recommendations and roadmap for an Irish National PID Strategy, published in the fall of 2024. Immediate recommended actions include engaging with additional senior leaders across all sectors and maintaining engagement with international PID initiatives, while lessons learned include recruiting lots of willing volunteers to help promote the work and getting buy-in from funding bodies.</p>
<a class='btn btn-sm btn-round green' href='/events/2025-02-05-annual-meeting-national-pids'>See National PID Policies and Practices slides and recording</a>


<h2 id="focusing-on-the-asia-pacific-region">Focusing on the Asia-Pacific region 
</h2>
<p>The session <strong>Successes and Opportunities for ROR in the Asia-Pacific Region</strong> was itself a definite success, and we plan to hold more APAC-focused sessions in APAC-friendly time slots in the future. Even though some clever folks in the Asia-Pacific region have already realized the value of ROR, there&rsquo;s clearly lots of room to increase awareness and adoption of ROR in the Asia-Pacific region..</p>
<p>First, ROR Technical Community Manager Amanda French gave a brief introduction to ROR. Next, Aaron Ballagh of The Lens (based in Australia) explained how ROR is used to identify organizations in The Lens so that users can find scholarly works from researchers affiliated with a particular organization, then demonstrated an &ldquo;institutional lineage&rdquo; search lets a user find all the scholarly works associated with an organization and the organization’s &ldquo;child&rdquo; organizations.</p>
<p>Brietta Pike of the publishing arm of Australia&rsquo;s National Science Agency, CSIRO Publishing, then outlined their &ldquo;journey to capture and deposit better metadata&rdquo; by taking steps toward ROR implementation as a &ldquo;source of truth&rdquo; for author affiliations. Scott Edmunds of GigaScience Press, based in Hong Kong, then recounted how the GigaByte Journal is &ldquo;focused on the Interoperable in FAIR&rdquo; to make their publications &ldquo;AI-ready,&rdquo; and completing the ROR integration for GigaScience&rsquo;s journals and database is a key part of that.</p>
<p>Finally, Masatsura Igami of NISTEP described why and how they incorporated ROR into the NISTEP Dictionary of Organization names in order to better link this Japanese resource with overseas users. The NISTEP Dictionary &ldquo;plays a central role in NISTEP&rsquo;s efforts to compile and publish basic data on the research activities of Japanese universities and public institutions.&rdquo; The presentation called for improved coverage of Japanese organizations in ROR (which is a priority for 2025) and concluded with a call for ROR to &ldquo;connect the efforts being made in each country as a network.&rdquo; A useful idea!</p>
<a class='btn btn-sm btn-round green' href='/events/2025-02-05-annual-meeting-ror-apac/'>See Successes and Opportunities for ROR in the Asia-Pacific Region slides and recording</a>


<h2 id="join-us-in-the-coming-months">Join us in the coming months 
</h2>
<p>ROR has plenty to keep us busy this year, and you surely do as well. Keep in touch with us by registering for ROR&rsquo;s upcoming <a href="/events">events</a>, by signing up for the <a href="http://eepurl.com/gjkT9H">quarterly ROR newsletter</a>, by joining the <a href="https://groups.google.com/a/ror.org/g/ror-community/">ROR Community Forum</a> to receive more frequent announcements and participate in discussions, or by following ROR on <a href="https://bsky.app/profile/researchorgs.bsky.social">Bluesky</a>, <a href="https://mastodon.social/@ResearchOrgs">Mastodon</a>, or <a href="https://www.linkedin.com/company/ror-research-organization-registry/">LinkedIn</a>. Here&rsquo;s hoping we all thrive in 2025.</p>
<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Write us at <a href="mailto:support@ror.org">support@ror.org</a> with any questions or comments.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Highlights of the 2025 ROR annual meeting sessions, including the Community Update, the panel on National PID Policies and Practices, and the session on Successes and Opportunities for ROR in the Asia-Pacific Region.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Faster Affiliation Matching at Scale</title><id>https://doi.org/10.71938/mtx2-vb16</id><link rel="alternate" href="https://ror.org/blog/2025-01-27-faster-affiliation-matching/"/><published>2025-01-27T09:14:38-05:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Adam Buttrick</name><uri>https://orcid.org/0000-0003-1507-1031</uri></author><category term="Adoption News"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<figure class="featured-figure"><img src="/img/lion-closeup-1200.jpg"
         alt="Closeup of male lion with mane"/>
</figure>

<p>OpenAlex has added a new metadata matching strategy co-developed by ROR and Crossref to its affiliation matching processes: ROR is also investigating the prospect of incorporating this new matching strategy into the ROR API in 2025.</p>
<hr>
<p>If you’ve been reading our <a href="/tags/matching/">recent series of blog posts about metadata matching</a>, you know that automatic metadata matching at scale is a topic dear to our hearts. Research metadata is full of unstructured and wildly variant text strings describing organizations, which makes connecting research outputs to research organizations unnecessarily difficult. Universities find it difficult to track what their affiliated researchers have produced; publishers find it difficult to associate author affiliations with subscribing institutions; and funders struggle to accurately track and assess the impact of their investments. Metadata matching can help alleviate these problems.</p>
<p>ROR and Crossref have been collaborating on extensive research into developing and evaluating new metadata matching strategies that have the potential to improve organization metadata throughout the scholarly communication ecosystem. One of the most promising of these new metadata matching strategies is one we refer to internally as “single search,” now being used by OpenAlex.</p>


<h2 id="what-is-single-search">What is “single search”? 
</h2>
<p><a href="https://gitlab.com/crossref/labs/marple/-/tree/main/strategies_available/affiliation_single_search">Single search</a> is a new affiliation matching strategy developed in collaboration between ROR and the Data Science team at <a href="https://crossref.org">Crossref</a> (which is currently <a href="https://www.crossref.org/jobs/2025-01-16-data-scientist">hiring</a>!). Whereas ROR’s <a href="https://ror.readme.io/v2/docs/api-affiliation">existing affiliation matching</a> makes use of multiple queries to a search index to return its results, the single search strategy is able to achieve approximately the same performance with just one query to said index. This allows the strategy to return matching results in a fraction of the time as the existing ROR affiliation matching while also using far fewer computing resources. The single search strategy is therefore particularly well suited for matching ROR IDs to affiliation strings in large datasets.</p>


<h2 id="partnering-with-openalex">Partnering with OpenAlex 
</h2>
<p>This past fall, we spoke with OpenAlex about our work on new strategies for affiliation matching, which resulted in a project to supplement their existing <a href="https://github.com/ourresearch/openalex-institution-parsing">classification-based affiliation matching</a> with the single search strategy. After testing against OpenAlex&rsquo;s benchmark datasets resulted in the same high performance as our internal evaluations, OpenAlex was able to move forward with a full-scale implementation of this strategy across their entire dataset.</p>
<p>This implementation yielded impressive results: matching against the 140 million affiliation strings in OpenAlex created affiliations for 7,700 unique institutions, generating new affiliations for 1.2 million strings. This translated to new affiliations for 2.2 million works and added 4.5 million new author affiliations. The number of ROR IDs represented in OpenAlex also increased significantly, from 93,000 to 100,600, or approximately 90% of the active records in ROR.</p>


<h2 id="next-steps">Next steps 
</h2>
<p>ROR is currently looking into how to bring the single search strategy to the ROR API in 2025, and Crossref will publish a dataset of all unique affiliation strings in the Crossref database matched to ROR IDs using the single search method, as well as a set of relationships between works, contributors, and organisations discovered through affiliation matching.</p>
<p>Our special thanks to OpenAlex’s Jason Priem and Justin Barrett for this collaboration. Stay tuned to all of our <a href="/community/#get-involved">various communication channels</a> and <a href="/events">events</a> for future updates on our work on metadata matching!</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">OpenAlex has added a new metadata matching strategy co-developed by ROR and Crossref to its affiliation matching processes: ROR is also investigating the prospect of incorporating this new matching strategy into the ROR API in 2025.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Metadata Matching: Beyond Correctness</title><id>https://doi.org/10.71938/jwav-tb32</id><link rel="alternate" href="https://ror.org/blog/2025-01-09-metadata-matching-beyond-correctness/"/><link rel="related" href="https://doi.org/10.13003/axeer1ee"/><published>2025-01-09T11:37:34-05:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Dominika Tkaczyk</name><uri>https://orcid.org/0000-0001-5055-7876</uri></author><author><name>Adam Buttrick</name><uri>https://orcid.org/0000-0003-1507-1031</uri></author><category term="Explainers"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>The fifth and final blog post about metadata matching by ROR’s Adam Buttrick and Crossref’s Dominika Tkaczyk outlines a set of pragmatic criteria for making decisions about metadata matching. Read all posts in the <a href="/tags/matching">series on metadata matching</a>.</span></div>
	</div>
</div>
<figure class="featured-figure"><img src="/img/blog/matching-correctness/matching-strategy-criteria.png"
         alt="Six criteria for making decisions about metadata matching: Openness, Explainability, Complexity, Flexibility, Resources, and Speed."/>
</figure>

<p>In our <a href="/blog/2024-11-06-how-good-is-your-matching/">previous entry</a> in this <a href="/tags/matching">series</a>, we explained that thorough evaluation is key to understanding a matching strategy&rsquo;s performance. While evaluation is what allows us to assess the correctness of matching, choosing the best matching strategy is, unfortunately, not as simple as selecting the one that yields the best matches. Instead, these decisions usually depend on weighing multiple factors based on your particular circumstances. This is true not only for metadata matching, but for <a href="https://www.wired.com/2012/04/netflix-prize-costs/">many technical choices</a> that require navigating trade-offs. In this blog post, the last one in the metadata matching series, we outline a subjective set of criteria we would recommend you consider when making decisions about matching.</p>


<h2 id="openness">Openness 
</h2>
<p>Matching tools come in many different shapes and sizes: web applications, APIs, command-line tools, sometimes even <a href="https://adambuttrick.github.io/mysterious-crystal-ball-matching/">enchanted crystal balls showing matched identifiers emerging from a mysterious mist</a>! No matter what form they take, an important consideration is whether the source code and all the related resources for the matching are openly available.</p>
<figure class="blog-figure"><img src="/img/blog/matching-correctness/enchanted-crystal-ball-matching.png"
         alt="Picture of a pixelated crystal ball labeled Enchanted Crystal Ball Matching with a field for entering an affiliation below it."/><figcaption>
            <p>Picture of the <a href='https://adambuttrick.github.io/mysterious-crystal-ball-matching/'>Enchanted Crystal Ball Matching</a> interface to the affiliation endpoint of the ROR API. Source code openly available at <a href="https://github.com/adambuttrick/mysterious-crystal-ball-matching">https://github.com/adambuttrick/mysterious-crystal-ball-matching</a>.</p>
        </figcaption>
</figure>

<p>Matching strategies that are either closed-source, or rely on closed-source services for their matching logic, make it difficult to fully understand and explain matching processes. This lack of transparency also makes it impossible to adjust or improve the matching logic, since we cannot understand or improve code we cannot see.</p>
<p>Users are similarly impeded from identifying flaws or suggesting improvements to processes they are unable to examine. By blocking this community participation, we also lose the proven cycle of real-world testing, refinement, and validation that has strengthened myriad of open source projects. The cumulative impact of both minor and major community-driven refinements over time is incredibly valuable and should not be underestimated.</p>
<p>Using open source matching will also help build trust in the matching workflows and results. This is one reason why open source is one of the tenets of the <a href="https://openscholarlyinfrastructure.org">Principles of Open Scholarly Infrastructure</a>, adopted by Crossref, DataCite, ROR, and other organizations who build and maintain open scholarly infrastructure.</p>
<p>When evaluating matching strategies, we strongly recommend prioritizing those that are fully open source. This not only ensures their transparency and trustworthiness, but also allows for the kind of continuous improvement that results from this visibility and community engagement.</p>


<h2 id="explainability">Explainability 
</h2>
<p>In terms of our ability to understand and improve a matching strategy, using an open source model is only the first step. What typically matters most in the context of building and maintaining matching services is that we are able to understand their underlying code and have a clear model of how matches are derived from their corresponding inputs. Even if the matching code itself and all of the resources used in the matching are open, if they are poorly documented, lack reproducibility or tests, or are otherwise opaque, there is no guarantee that it will be possible to understand or improve the strategy. Striving for a high level of interpretability in our matching plays a determinative role in how well we can understand and modify our strategies in the future.</p>
<p>Being able to explain the behaviour of the matching will also help you to respond to and incorporate user feedback. When users encounter errors, you will be able to do things like advise them on how to modify or clean their inputs so that the results are better. Conversely, examining the behaviour of the strategy relative to user inputs and feedback can provide you with ideas for improving the matching.</p>
<p>Typically, heuristic-based strategies, such as those that use forms of search or string similarity measures, like <a href="https://en.wikipedia.org/wiki/Edit_distance">edit distance</a>, are easier to explain than, say, machine learning models. If a strategy uses machine learning, at least some internal decisions might be made by passing data through a complex network of algebraic equations. Those can be mysterious, non-deterministic, and are famous for being <a href="https://xkcd.com/1838/">hard to interpret</a>.</p>
<figure class="blog-figure"><img src="https://imgs.xkcd.com/comics/machine_learning.png"
         alt="Image of two stick figures, one standing on a messy pile, with dialogue: This is your machine learning system? Yup! You pour the data into this big pile of linear algebra, then collect the answers on the other side. What if the answers are wrong? Just stir the pile until they start looking right."/><figcaption>
            <p>Randall Munroe, Machine Learning, xkcd <a href="https://xkcd.com/1838/">https://xkcd.com/1838/</a></p>
        </figcaption>
</figure>

<p>This doesn&rsquo;t mean they should be avoided entirely &ndash; we have built and use many machine-learning based tools ourselves! Instead, it is a good idea to weigh how their inherent lack of explainability could affect your ability to continue work on the strategy and respond to user needs, relative to all the available options.</p>


<h2 id="complexity">Complexity 
</h2>
<p>Complexity is another aspect that can greatly affect how easy it is to maintain the strategy. Complexity is related to how many different components the strategy has and how difficult they are to use and maintain. When a strategy has multiple interconnected parts, each component becomes a potential failure point that requires discrete assessment and maintenance.</p>
<p>Consider, for example, two different approaches to a matching strategy: one that uses a single machine learning model versus another that uses an ensemble of models. A single model requires maintaining one set of training data, a single training pipeline, and one deployment process. If the model&rsquo;s performance unexpectedly deteriorates, whether because of an issue with the training data, a configuration error, or the need for additional input sanitization, the source of the problem is easier to isolate and fix.</p>
<p>The ensemble, by contrast, combines multiple, specialized models, each requiring its own training data, tests, updates, and deployments. If one model in the ensemble is found to reduce the performance of the strategy, the interdependence between models can cause this degradation to cascade through the entire system and undermine its overall reliability. Correcting for these errors becomes more challenging. If fixing one model&rsquo;s performance requires retraining or adjusting its outputs, this could require recalibrating the entire ensemble to maintain the balance between models, identify regressions, and prevent new errors from emerging.</p>
<p>In general, preferring simpler strategies not only reduces operational overhead, but also makes it easier to diagnose issues, test changes, and iterate on user feedback. When problems arise, having fewer moving parts means less places to look for the root cause and fewer components that could be affected by any fixes.</p>


<h2 id="flexibility">Flexibility 
</h2>
<p>The metadata to which we match grows and changes over time. New records are created, existing ones are updated, with schemas changing and evolving alongside. The resources that underlie our matching are also not static. The libraries we depend on may deprecate features between versions or the taxonomies we used to categorize results might undergo significant revisions. We thus rarely have the luxury of deploying a matching strategy once and using it forever without any changes. A good strategy has to be flexible enough to adapt to such changes, with this adaptation also being both technically feasible and practical to implement.</p>
<p>Much of this flexibility is also determined by a matching strategy&rsquo;s ability to incorporate new data. Strategies that use continuously updated databases or indices can immediately match against new metadata as it appears in the system. By contrast, some machine learning-based approaches require training on target matches and can thus be limited in flexibility and face more constraints. While some models can be incrementally updated to recognize new matches, others require retraining from scratch to incorporate these changes - a process that can be both time-consuming and resource-intensive.</p>
<p>Paying close attention to a strategy&rsquo;s flexibility and favoring this aspect, when possible, can significantly impact its long-term viability. When comparing different matching strategies, flexibility should thus be a primary concern in your decision-making process.</p>


<h2 id="resources">Resources 
</h2>
<p>Matching strategies can vary significantly in their resource requirements, including things like CPU and GPU utilization, memory consumption, storage capacity, and network bandwidth. These requirements are directly related to infrastructure costs and energy consumption, so when evaluating a matching strategy, it is necessary to assess its resource demands across all phases of the matching lifecycle. This includes things like initial model training, re-training, index construction, updates and management for all aspects of the strategy, as well as the real-world processing of matching requests. It is a good idea to measure and monitor resource usage carefully in considering which strategies to use, as the best performing strategy may also be too resource intensive to run as a service or might grow to this state over time with additional utilization.</p>


<h2 id="speed">Speed 
</h2>
<p>Matching strategies can operate at a wide range of speeds, from milliseconds to minutes per match. Since the overall response time of a strategy can affect both system scalability and user experience, we should always assess the strategy&rsquo;s performance for different usage scenarios and scales of data. While some strategies might perform adequately with small datasets, they can also exhibit exponential slowdowns as data volume and complexity increases or as concurrent requests grow in number. We should therefore consider carefully how requirements for matching speed might evolve with increased usage, data complexity, and total anticipated growth. The fastest matching strategy might not always be the best choice if it comes at the cost of reduced accuracy or requires large amounts of resources, but unacceptable latency can make an otherwise excellent strategy unusable in practice for many use cases.</p>


<h2 id="putting-it-all-together">Putting it all together 
</h2>
<p>The typical life cycle of developing a metadata matching strategy is as follows:</p>
<ol>
<li><strong>Scoping</strong>: we define the matching task, along with its inputs and outputs.</li>
<li><strong>Research</strong>: we research what existing strategies are available for our task and/or we develop our own.</li>
<li><strong>Evaluation</strong>: we evaluate all available strategies, internally or externally-developed, exploring all of the aspects described above.</li>
<li><strong>Decision</strong>: we choose which strategy (if any) we want to use in our production system.</li>
<li><strong>Production setup</strong>: we prepare the production models, indexes, and other resources needed for the matching.</li>
<li><strong>Maintenance</strong>: we monitor and adapt the strategy relative to changing data, user feedback, and new resource requirements.</li>
</ol>
<p>In practice, these phases do not happen all at once, nor in this strict order. Often we need to proceed through multiple iterations of them to arrive at the best strategy. For example, if initial evaluation of a strategy yields poor results, we might return to the research phase to investigate other strategies or refine our understanding of the task. Often, during the maintenance phase, we receive feedback from users that indicates potential areas of improvement and then pursue them with a new round of research and evaluation.</p>
<p>As we cycle through these phases, ideally all the aspects described in this entry, along with the results of the evaluation, would be taken into account. Of course, this means that these decisions have to be based on multiple criteria and by making trade-offs between their performance and all other considerations. In making these complex and difficult choices, it is useful to consider two primary questions:</p>
<ol>
<li>Are any of the considered matching strategies good enough for our use case?</li>
<li>Out of all the considered strategies that are sufficient for our use case, which would be the best?</li>
</ol>
<p>The first question requires us to create clear and quantifiable criteria that allow for eliminating some of the potential strategies. As we have indicated, these could include things like the strategy being open source, minimum performance baselines using measures like precision or recall, and operational thresholds, like the strategy being able to return results quickly, relative to user expectations or the volume of data to be processed. It should be fairly easy to test these requirements and eliminate any strategies that fall short of them. If the strategies are difficult to assess, that is likely a mark against them.</p>
<p>If no strategies meet these criteria, we have two options: either to abandon matching entirely or to reassess and relax our criteria to align with the available options. While the former is always an option, adopting a more pragmatic lens, framing in terms of potential value (or harm) to the users, might be beneficial. Sometimes we approach matching tasks with too high expectations and a dose of realism helps us to re-center our perspectives. After more consideration, you might decide that your criteria were too stringent or realize that you need to better define and decompose the tasks to fit the available options.</p>
<p>When multiple strategies appear viable, the selection process becomes more nuanced. When evaluating strategies across these various dimensions, we should try to avoid placing undue weight on minor performance differences. Evaluation metrics are useful estimates of performance, but do not always translate to real-world applications and changing data. In cases where a more complex strategy offers only marginal improvements over a simpler alternative, the maintenance and operational benefits of the simpler solution often outweigh small performance gains.</p>
<p>This concludes our <a href="/tags/matching">series on metadata matching</a>, where we described the conceptual, product, and technical aspects of matching and its applications. We hope this overview was instructive and helps you to make better decisions about the use of matching in your own tools and services!</p>
<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Questions? Contact <a href="mailto:adam@ror.org">adam@ror.org</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">The fifth and final blog post about metadata matching by ROR’s Adam Buttrick and Crossref’s Dominika Tkaczyk outlines a set of pragmatic criteria for making decisions about metadata matching.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Clarivate Integrates ROR</title><id>https://doi.org/10.71938/atnp-me51</id><link rel="alternate" href="https://ror.org/blog/2024-12-18-clarivate-integrates-ror/"/><published>2024-12-18T13:49:54-05:00</published><updated>2025-10-22T06:05:03-04:00</updated><author><name>Clarivate</name><uri>https://ror.org/04fce1c40</uri></author><author><name>ROR Core Team</name><uri>https://ror.org/about/team/#core-team</uri></author><category term="Adoption News"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<figure class="featured-figure"><img src="/img/blog/clarivate/Clarivate_Logo.png"
         alt="Clarivate logo"/>
</figure>

<p>We are pleased to announce that <a href="https://clarivate.com/academia-government/scientific-and-academic-research/research-discovery-and-referencing/web-of-science/web-of-science-core-collection/">Web of Science Core Collection™</a> now includes Research Organization Registry (ROR) identifiers. Users of the Web of Science™ can search for publications associated with an institution by using a ROR ID, and ROR IDs are available in organization profiles. In addition, <a href="https://developer.clarivate.com/apis/wos">Web of Science API Expanded</a> now also exposes ROR IDs in query responses, making it easier for automated tools to retrieve standardized organization information and analyze research by institution.</p>
<p>&ldquo;ROR is a trusted source of institutional identifiers and metadata that is being widely adopted across the scholarly communications landscape,&rdquo; said ROR Director Maria Gould. &ldquo;ROR&rsquo;s open and carefully curated dataset and our robust and easy-to-use tools make it ideally suited for a range of implementations aimed at normalizing institutional data, and we&rsquo;re pleased that ROR is being integrated into discovery services like the Web of Science to enrich institutional models and support key user needs.&rdquo;</p>
<p>Tilla Edmunds, Director of Web of Science Content Management at Clarivate, said “We are pleased to integrate ROR IDs into the Web of Science Core Collection. This advancement highlights our commitment to strong partnerships with the academic community, working together to support and progress scholarly communication globally.”</p>
<p><a href="https://clarivate.com">Clarivate™</a> has long supported ROR as a community-led solution to the pressing challenge in scholarly publishing of variant affiliation names for the same organizations. The new ROR features in the Web of Science, developed in response to customer feedback, are only the first phase of ROR adoption for Clarivate. Now that more and more academic publishers are beginning to use ROR, Web of Science plans in the future to collect ROR IDs for researchers&rsquo; affiliations directly from publisher metadata, thus lessening and perhaps one day even eliminating the problem of nonstandard research affiliation information.</p>
<div class='callout orange'>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-bullhorn'></i></div><div class="col-md-10"><span>Are you planning to integrate ROR? <a href="https://ror.readme.io/v2/docs/">Learn how.</a></span></div>
	</div>
</div>
<p>Learn more about this ROR integration from the <a href="https://clarivate.com/academia-government/release-notes/web-of-science/web-of-science-november-7-2024-release-notes/">Web of Science release notes</a> and a presentation given at the September 2024 ROR Community Call by Tilla Edmunds, Director of Web of Science Content Management at Clarivate.</p>
<hr>

<div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
  <iframe src="https://www.youtube.com/embed/_z1lcsd_osc" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="ROR in Web of Science presentation at ROR Community Call September 2024"></iframe>
</div>

<hr>
<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Contact <a href="mailto:support@ror.org">support@ror.org</a> or <a href="mailto:newsroom@clarivate.com">newsroom@clarivate.com</a> with questions.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Web of Science Core Collection™ from Clarivate™ now includes Research Organization Registry (ROR) identifiers for the benefit of the global research community.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">ROR in 2024: Year in Review</title><id>https://doi.org/10.71938/h7r4-fd19</id><link rel="alternate" href="https://ror.org/blog/2024-12-17-year-in-review/"/><published>2024-12-17T14:44:18-05:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>ROR Core Team</name><uri>https://ror.org/about/team/#core-team</uri></author><category term="Year in Review"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<figure class="featured-figure"><img src="/img/blog/review2024/Camouflage_Unsplash.jpg"
         alt="Lisa H lh_photography, CC0, via Wikimedia Commons and Unsplash"/>
</figure>

<p>As 2024 comes to a close, we&rsquo;re reflecting on where ROR is now, and the answer is unmistakable: ROR is just about everywhere. We continue to be thrilled by the extent to which ROR is recognized as the obvious choice for identifying research and funding organizations, as evidenced by a wide range of new ROR integrations, by our extremely busy curation queue with over 8,000 requests this year to add new organizations and update existing ROR records, by the bustling traffic of 14 million monthly requests to the ROR API, by the fact that the number of downloads of <a href="https://doi.org/10.5281/zenodo.6347574">our open dataset</a> has nearly tripled in the past year, by the honor of being named a finalist for the <a href="https://blog.alpsp.org/2024/08/spotlight-on-research-organization.html">2024 ALPSP Innovation in Publishing Award</a>, and by continual additions of enthusiastic ROR supporters to our very active and engaged community.</p>
<p>Here are some highlights from ROR&rsquo;s banner year in 2024.</p>


<h2 id="rors-schema-and-api-evolve-with-the-release-of-v2">ROR&rsquo;s schema and API evolve with the release of v2 
</h2>
<p>April of 2024 marked a major milestone with the <a href="/blog/2024-04-15-announcing-ror-v2/">release of ROR API and Schema 2.0</a> &ndash; an official departure from the schema ROR initially inherited from GRID. Version 2 added features such as created and last-modified dates on all records, a place to store language codes for all names, and the addition of a <code>domains</code> field. All of these changes were shaped by extensive community feedback, and we’re pleased that more than 40% of ROR API users have already switched to version 2.</p>
<figure><img src="/img/blog/review2024/ror-api-stats-by-version.png"
         alt="ROR v2 now accounts for nearly 6 million requests to the ROR API in the course of a month."/><figcaption>
            <p>ROR v2 now accounts for nearly 6 million requests to the ROR API in the course of a month.</p>
        </figcaption>
</figure>

<p>After community feedback indicated the need for additional location information, the ROR team implemented a minor, non-breaking release of the ROR schema in December of 2024. <a href="https://ror.readme.io/changelog/2024-12-12-schema-v2-1">Schema 2.1</a> makes location information in ROR more granular and adds continent names and codes to the location information available.</p>


<h2 id="publishers-repositories-and-knowledge-graphs-adopt-ror">Publishers, repositories, and knowledge graphs adopt ROR 
</h2>
<p>ROR adoption continues to increase among <a href="https://bit.ly/ror-integrations">many systems – and many <em>kinds</em> of systems</a>. We began the year with <a href="/blog/2024-02-20-ror-fifth-anniversary/">Springer Nature&rsquo;s January account of their internal use of ROR</a> at the ROR Annual Community Meeting, and then at mid-year came the announcement of the <a href="/blog/2024-07-23-aps-adopts-ror/">American Physical Society&rsquo;s adoption of ROR</a> both internally and as an affiliation identifier in Crossref DOI metadata. Drawing publishers&rsquo; attention to the strategic importance of ROR in open scholarly metadata, <a href="/blog/2024-07-25-re-introducing-participation-reports/">Crossref and CWTS Leiden worked together to add metrics for ROR IDs and author affiliations to Crossref&rsquo;s Participation Reports</a>.</p>
<figure><img src="/img/blog/review2024/crossref-prep-elife.png"
         alt="Crossref&#39;s Participation Reports allow anyone to track the completeness of a Crossref member&#39;s metadata, now including what percentage of that member&#39;s items have ROR IDs."/><figcaption>
            <p>Crossref&rsquo;s Participation Reports allow anyone to track the completeness of a Crossref member&rsquo;s metadata, now including what percentage of that member&rsquo;s items have ROR IDs.</p>
        </figcaption>
</figure>

<p>ROR has already seen tremendous uptake among repository systems, and this year thanks to <a href="https://4science.com">4Science</a>, both DSpace 8.0 and DSpace-CRIS now support ROR IDs for both author affiliations and funders.</p>
<p>Scholarly Knowledge Graphs (SKGs), too, showed their awareness of ROR&rsquo;s strategic value in 2024. <a href="https://openalex.org">OpenAlex</a> continues to be a key ROR adopter, and 2024 saw the launch of a new tool for community-based correction of OpenAlex&rsquo;s automatic matches of text affiliation strings to ROR IDs: the <a href="https://works-magnet.esr.gouv.fr/">Works Magnet</a>, developed in conjunction with France’s Ministère de l’Enseignement Supérieur et de la Recherche (MESR). Finally, <a href="https://clarivate.com">Clarivate™</a> gave the ROR Community a special preview of their planned integration of ROR into the Web of Science at the <a href="/events/2024-09-26-ror-community-call/">ROR Community Call in September</a> and <a href="https://clarivate.com/academia-government/release-notes/web-of-science/web-of-science-november-7-2024-release-notes/">launched the integration</a> only a few weeks later.</p>
<p>Too many other systems incorporated ROR this year to list here, but we want to emphasize our appreciation for every new ROR integration as well as the improvements to existing ROR integrations that we&rsquo;re beginning to see from many early adopters of ROR, <a href="https://info.orcid.org/orcid-renews-support-of-ror-to-maximize-the-quality-of-organization-information-for-researchers/">including ORCID</a>. We&rsquo;re always happy to feature ROR users <a href="https://bit.ly/ror-integrations">on our integrations list</a>, in <a href="/categories/case-studies">case studies on our blog</a>, and as presenters at our bi-monthly <a href="/events">community calls</a>.</p>


<h2 id="use-of-ror-as-a-funder-identifier-continues-to-grow">Use of ROR as a funder identifier continues to grow 
</h2>
<p>ROR has been increasing its coverage of funders by reconciling ROR with the Open Funder Registry since well before the announcement in late 2023 that <a href="/blog/2023-09-07-open-funder-registry-transition-ror-cross-post/">the Crossref Open Funder Registry will eventually be deprecated</a> in favor of ROR. With the release of v2.0 in April, ROR added a &ldquo;funder&rdquo; item type to indicate which ROR records are mapped to the Open Funder Registry to assist with this use case. We&rsquo;ve also published a <a href="https://ror.readme.io/v2/docs/funder-registry">guide to help users transition from Open Funder Registry to ROR</a>.</p>
<p>Beginning in late 2023 and throughout 2024, we&rsquo;ve seen many widely used scholarly systems begin to use ROR IDs to identify and disambiguate funders. <a href="https://chorusaccess.org">CHORUS Dashboard &amp; Reporting Services</a>, <a href="https://oa.report">OA.Report</a>, <a href="https://chronoshub.io">ChronosHub</a>, and <a href="https://openalex.org">OpenAlex</a> are all among the systems that either use ROR exclusively for this purpose or else map other funder identifiers to ROR IDs.</p>
<p>Repository systems have been particularly quick to adopt ROR for funder identification, and <a href="https://api.datacite.org/dois?query=fundingReferences.funderIdentifierType:ROR&amp;page%5Bsize%5D=0">nearly 450,000 DataCite DOI records</a> registered by these systems currently include ROR IDs for funders. See our blog post <a href="/blog/2024-08-06-using-ror-for-funder-identification/">Using ROR for Funder Identification</a> for more details about how repository systems such as Dryad, Zenodo, and Figshare use ROR for the funder identification use case.</p>
<figure><img src="/img/blog/use-case-funders/ror-inveniordm.png"
         alt="Documentation of the ROR-based funder model and award lookup in InvenioRDM, the open source system that runs Zenodo"/><figcaption>
            <p>Documentation of the ROR-based funder model and award lookup in InvenioRDM, the open source system that runs Zenodo.</p>
        </figcaption>
</figure>



<h2 id="ror-expands-global-coverage-of-research-organizations-and-scales-curation-capacity">ROR expands global coverage of research organizations and scales curation capacity 
</h2>
<p>We pride ourselves at ROR on providing a registry with truly global coverage that takes an international perspective on research organizations and information systems. In 2024, we undertook several initiatives aimed at improving regional coverage of organizations, while at the same time experiencing organic growth from Mexico, France, and Indonesia. We also called for community feedback on <a href="https://bit.ly/ror-proposal-external-ids-draft">a proposal to add additional external identifier mappings</a> to ROR records, including regional identifiers such as the European PIC code, which many ROR users have said they would find useful. In 2025, we plan to complete our evaluation of potential new external identifiers and then add them to ROR records.</p>
<p>More generally, we have processed over 8,000 curation requests in 2024, a 44% increase over 2023, adding a total of 4,098 new records to the registry this year and updating 106,944 records. Recognizing the importance of scaling our capacity to respond to curation requests quickly while continuing to produce accurate and complete metadata, this year we not only revamped and improved our curation tools and codebase but have also added a new team member: <a href="/blog/2024-09-03-welcome-riley-marsh/">Riley Marsh, Metadata Manager</a>.</p>
<figure><img src="/img/blog/review2024/ror-curation-tracker.png"
         alt="The very busy ROR Curation Tracker shows just a few of the requests we receive and process every day."/><figcaption>
            <p>The very busy ROR Curation Tracker at <a href="https://github.com/ror-community/ror-updates/issues">https://github.com/ror-community/ror-updates/issues</a> shows just a few of the requests we receive and process every day.</p>
        </figcaption>
</figure>

<p>With the release of v2, ROR records now also include a place to store a language code in all name fields and a new field for an organization&rsquo;s top-level domains (e.g., <code>madeira.gov.pt</code> instead of <code>https://www.madeira.gov.pt</code>), and in 2024 we carefully populated these new, empty elements with accurate values for thousands of records, further enriching ROR&rsquo;s metadata and making it of even greater value to the global research information community.</p>


<h2 id="rors-community-activities-highlight-broad-engagement">ROR&rsquo;s community activities highlight broad engagement 
</h2>
<p>ROR&rsquo;s community activities in 2024 included publishing pieces on various topics of interest. For instance, ROR and Crossref collaborated on a <a href="/tags/matching/">blog post series explaining various aspects of automatic metadata matching</a> that proved very popular and has even been <a href="https://punktokomo.abes.fr/2024/12/04/anatomie-des-alignements-a-labes-ou-metaphore-des-chaussettes-episode-3-3/">adapted into French</a> by Carole Melzac of ABES. Merci, Carole! We also published additional <a href="/categories/case-studies">case studies</a> with ROR users and a <a href="/blog/2024-11-26-rrid-ror-facilities/">guide to RRID and ROR for facilities</a>.</p>
<p>ROR also participated in and co-organized several global <a href="https://ror.org/events">events</a> both in person and online, notably a webinar co-organized with 4Science on <a href="/events/2024-10-03-ror-in-dspace/">New Features for Organization Information in DSpace with ROR</a>, a session at PIDFest on <a href="https://repozitar.techlib.cz/entities/publication/5c93a23a-61c9-403a-9635-cf9a442747fd">Curation and Community: Setting Metadata Policies at ROR</a>, and an ACRL Choice / LibTech Insights webinar in collaboration with OpenAlex on <a href="/events/2024-11-07-open-affiliation-metadata/">Open Affiliation Metadata: How Recent Innovations Help Libraries and Librarians</a>.</p>
<p>And, of course, as always, we held our regular bi-monthly online <a href="https://www.youtube.com/watch?v=z9Onl3dcoxs&amp;list=PL4n_Cvd0PpoEBWyZMiwb2ImVQCy4Zpchk">Community Calls</a> featuring demos and presentations by adopters as well as engaging discussion about future directions. If you&rsquo;re making a New Year&rsquo;s resolution to get more involved with ROR in 2025, we encourage you to <a href="https://groups.google.com/a/ror.org/g/ror-community">join the ROR Community Forum</a> (now 376 members strong!) to receive notifications of upcoming Community Calls as well as other news and event announcements. Save the dates, too, for ROR&rsquo;s Annual Community Meeting on February 4th and 5th, 2025 – <a href="https://ror.org/events">registration is open now</a> for four terrific sessions over the course of two days.</p>
<p>We give our heartfelt thanks to all of you who have supported ROR this year by sharing your thoughts, asking us questions, doing the work at your own organizations, making financial contributions, and in a hundred other ways demonstrating that you understand the value of ROR not just for your own benefit, but for the benefit of the entire research information ecosystem. We look forward to collaborating with you in 2025.</p>
<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Write us at <a href="mailto:info@ror.org">info@ror.org</a> with any comments or questions.</span></div>
	</div>
</div>
<div class='callout green'>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-comment'></i></div><div class="col-md-10"><span>Get involved by joining the <a href="https://groups.google.com/a/ror.org/g/ror-community">ROR Community Forum</a>.</span></div>
	</div>
</div>
<div class='callout mustard'>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-calendar-days'></i></div><div class="col-md-10"><span>Register for the <a href="https://ror.org/events">ROR Annual Community Meeting</a> Feb 4-5, 2025.</span></div>
	</div>
</div>
<!-- Commonly used content 

<!-- <figure class="featured-figure"><img src="/img/blog/"/>
</figure>
 -->
<!-- <div class='callout green'>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-info'></i></div><div class="col-md-10"><span></span></div>
	</div>
</div>
-->
<!-- <figure class="blog-figure"><img src="/img/blog/"/>
</figure>
 -->
<!-- 
<div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
  <iframe src="https://www.youtube.com/embed/XXX" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="YouTube Video"></iframe>
</div>
 -->
<!-- <div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Write us at <a href="mailto:info@ror.org">info@ror.org</a> with any comments or questions.</span></div>
	</div>
</div> -->
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">In 2024, ROR processed over 8000 curation requests, handled 14 million monthly requests to the ROR API, saw the number of downloads of the ROR dataset triple, and was a finalist for the ALPSP Innovation in Publishing Award: read on for more highlights from a banner year.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Understanding RRID and ROR for Facilities</title><id>https://doi.org/10.71938/r7ar-eq11</id><link rel="alternate" href="https://ror.org/blog/2024-11-26-rrid-ror-facilities/"/><link rel="related" href="https://www.rrids.org/news/2024/11/26/understanding-rrid-and-ror-for-facilities"/><published>2024-11-26T14:51:10-05:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Anita Bandrowski</name><uri>https://orcid.org/0000-0002-5497-0243</uri></author><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><category term="Explainers"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<figure class="featured-figure"><img src="/img/blog/rrid-ror/iconmonstr-flask-9-240.png"
         alt="Icon Monster flask"/>
</figure>

<p>This blog post explores the difference between &ldquo;core facilities&rdquo; in RRID and &ldquo;facilities&rdquo; in ROR and provides guidance for those who run facilities on how to effectively use these identifiers.</p>


<h2 id="what-is-rrid-and-what-is-its-scope">What is RRID and what is its scope? 
</h2>
<p>RRIDs (Research Resource Identifiers) help identify a wide variety of resources which are <strong>inputs to experiments</strong>, especially biomedical experiments. Examples of resources identified by RRID:</p>
<ul>
<li>Antibodies</li>
<li>BioSamples</li>
<li>Cell lines</li>
<li><strong>Core facilities</strong></li>
<li>Databases</li>
<li>Instruments (capital equipment)</li>
<li>Organisms</li>
<li>Plasmids</li>
<li>Reagents</li>
<li>Software</li>
</ul>
<p>RRIDs started in 2014 as an agreement between 25 journal editors to improve how research resources, especially antibodies, are cited in the scientific literature. The infrastructure has been supported in the FAIR Data Informatics Lab at the University of California at San Diego and SciCrunch Inc, a company devoted to improving scientific literature. RRIDs.org has recently become a stand-alone not-for-profit entity, enabling sustainability. The RRID registry at <a href="https://rrid.site/data/source/nlx_144509-1/search">scicrunch.org</a> currently includes nearly 25,000 records, including around 3,000 facilities.</p>


<h2 id="what-is-ror-and-what-is-its-scope">What is ROR and what is its scope? 
</h2>
<p>ROR (Research Organization Registry) IDs help identify <strong>research organizations</strong>, defined as &ldquo;any organization that is involved in research,&rdquo; including organizations that produce, fund, facilitate, manage, and publish research as well as organizations that educate or employ researchers. Examples of research organizations identified by ROR:</p>
<ul>
<li>Archives</li>
<li>Colleges and universities</li>
<li>Companies that fund or conduct research</li>
<li>Government agencies and units that fund or conduct research</li>
<li>Hospitals and healthcare centers</li>
<li>Laboratories</li>
<li>Nonprofits and non-governmental organizations that fund or conduct research</li>
<li>Private foundations that fund research</li>
<li><strong>Research facilities</strong></li>
<li>Research institutes</li>
<li>Research libraries</li>
<li>Scholarly publishers</li>
<li>Zoos</li>
</ul>
<p><a href="https://ror.org/about">ROR is an initiative</a> jointly supported and managed by Crossref, DataCite, and the California Digital Library that makes it easy to disambiguate institution names and connect research organizations to researchers and research outputs. ROR was launched in 2019 after three years of consultations with working groups and stakeholders, developed to solve the problem of identifying organizations in an open and community-driven way. The <a href="https://ror.org/search">ROR registry</a> currently includes nearly 110,000 active records and is currently <a href="https://bit.ly/ror-integrations">widely adopted by many essential scholarly systems</a>.</p>


<h2 id="definitions-of-facilities-for-rrid-and-ror">Definitions of facilities for RRID and ROR 
</h2>
<p>As you can see above, RRID includes a resource type for &ldquo;core facilities,&rdquo; and ROR <a href="https://ror.readme.io/v2/docs/ror-data-structure#types">includes an organization type &ldquo;facility.&rdquo;</a> Despite this apparent similarity, the definitions for &ldquo;core facilities&rdquo; in RRID and &ldquo;facilities&rdquo; in ROR differ quite a bit, and indeed there is very little overlap between  core facilities in RRID and facilities in ROR. Let’s look now at how RRID and ROR define &ldquo;core facility&rdquo; and “facility” and at some examples of each.</p>


<h3 id="what-are-core-facilities-in-rrid">What are “core facilities” in RRID? 
</h3>
<p>Core facilities are centralized resources within universities that offer access to instruments, technologies, and expert services. These semi-autonomous units are essential for scientific and clinical investigators, enabling them to conduct cutting-edge research without the need for individual investment in expensive equipment or staff.</p>
<p>Core facilities have joined forces and created a society called the <a href="https://abrf.org/">Association of Biomolecular Resource Facilities (ABRF)</a>, which began as a society for US biomedical facilities but now encompasses international facilities from many disciplines. The <a href="https://coremarketplace.org/">ABRF Core Marketplace</a> is the listing of member Core Facilities that is primarily used to advertise Core services. One issue that has come up for Cores is the lack of proper citation of the facilities and the tremendous waste of time that searching the literature takes from each Core.</p>
<p>The ABRF Core Marketplace joined the RRID initiative as a partner and now lists RRIDs on each active Core listing, gathering data about Core usage from the RRID efforts, search of the PubMedCentral database as well as the Core directors. “RRIDs allow our cores to be uniquely identified and many of our members push their users to use RRIDs to cite / acknowledge their facilities” explains Nate Herzog, the Core Marketplace director. As of this writing, the ABRF Core Marketplace maintains 1597 active facility profiles, 576 of which have been cited by RRID.</p>
<p>Some active core facilities with RRIDs include the following:</p>
<ul>
<li>New York University School of Medicine Langone Health Microscopy Laboratory Core Facility (<a href="https://rrid.site/data/record/nlx_144509-1/SCR_017934/resolver/mentions?q=*&amp;i=rrid:scr_017934">RRID:SCR_017934</a>)</li>
<li>Salk Institute Razavi Newman Integrative Genomics and Bioinformatics Core Facility (IGC) (<a href="https://rrid.site/data/record/nlx_144509-1/SCR_014842/resolver/mentions?q=*&amp;i=rrid:scr_014842">RRID:SCR_014842</a>)</li>
<li>Stanford University Vincent Coates Foundation Mass Spectrometry Laboratory Core Facility (<a href="https://rrid.site/data/record/nlx_144509-1/SCR_017801/resolver/mentions?q=*&amp;i=rrid:scr_017801">RRID:SCR_017801</a>)</li>
</ul>
<p>These core facilities with RRIDs do not have corresponding ROR IDs, but the parent organizations do:</p>
<ul>
<li>New York University - <a href="https://ror.org/0190ak572">https://ror.org/0190ak572</a></li>
<li>Salk Institute for Biological Studies - <a href="https://ror.org/03xez1567">https://ror.org/03xez1567</a></li>
<li>Stanford University - <a href="https://ror.org/00f54p054">https://ror.org/00f54p054</a></li>
</ul>


<h3 id="what-are-facilities-in-ror">What are “facilities” in ROR? 
</h3>
<p>ROR’s list of organization types includes a value for “facility,” <a href="https://ror.readme.io/v2/docs/ror-data-structure#types">defined as</a> “A specialized facility where research takes place, such as a laboratory or telescope or dedicated research area.” There are currently <a href="https://api.ror.org/v2/organizations?query.advanced=types:facility">over 11,000 organizations of the type “facility”</a> in ROR.</p>
<p>Facilities in ROR include the following:</p>
<ul>
<li>The Square Kilometre Array Telescope, Australia - <a href="https://ror.org/00ydnew86">https://ror.org/00ydnew86</a></li>
<li>Medical Research Council (MRC) Prion Unit, University College London - <a href="https://ror.org/043j90n04">https://ror.org/043j90n04</a></li>
<li>Stanford Linear Accelerator Center (SLAC) National Accelerator Laboratory - <a href="https://ror.org/05gzmn429">https://ror.org/05gzmn429</a></li>
</ul>
<p>These facilities in ROR do not have corresponding RRIDs, reflecting the broader nature of the definition of “facility” in ROR as well as ROR’s history of comprehensive coverage of non-US and non-biomedical organizations. <strong>Facilities in ROR are usually organizations like national laboratories that may be associated with a university but are not dependent on it.</strong></p>


<h3 id="what-are-the-main-differences-between-core-facilities-in-rrid-and-facilities-in-ror">What are the main differences between &ldquo;core facilities&rdquo; in RRID and &ldquo;facilities&rdquo; in ROR? 
</h3>
<p>RRID identifies core facilities because they can be considered as &ldquo;inputs to experiments,&rdquo; especially when the main contribution of the core facility is to provide researchers with access to particular instruments. University-based core facilities are typically funded by individual grants, and in many cases those are instrument-related grants intended to bring a capability to a certain university. These are typically large instruments that are made available to multiple individual laboratories in the department or university. The success of the core facility depends on the usage of the core resources, the publication of scientific work based on the data produced at the core facility, and on the successful interactions between core facility staff and local investigators. Because these facilities are grant-driven, an accounting of impact typically centers on counting papers that core facilities / instruments have been acknowledged in. RRIDs help core facilities better identify the manuscripts that need to be accounted for as they measure their impact.</p>
<p><strong>In the majority of cases, these core facilities are not in scope for ROR because they are subsections or service offerings at a university instead of being functionally separate organizations in the way that hospitals, national laboratories and facilities, and large research institutes are.</strong> The national or international facilities that have ROR IDs are also acknowledged by researchers in the way described above, and they may well also be core facilities, but they are independent organizations that do not rely on a single parent institution or on individual grants. These independent facilities may also employ researchers who cite that organization as their primary affiliation. One of the primary uses of ROR is to help organizations track research outputs by contributor affiliation.</p>
<p>The difference between facility acknowledgement and contributor affiliation can be seen by looking at a sample article <a href="https://doi.org/10.1038/s41375-021-01491-z">published in 2022 in the journal <em>Leukemia</em> by Baeten et al.</a> The authors are affiliated with research organizations that can be identified by ROR IDs, including The University of Chicago Medicine Comprehensive Cancer Center (<a href="https://ror.org/042wftp98">https://ror.org/042wftp98</a>).</p>
<figure class="blog-figure"><img src="/img/blog/rrid-ror/ror-affiliations.png"
         alt="Author affiliations of researchers for the article https://doi.org/10.1038/s41375-021-01491-z, best tracked with ROR"/><figcaption>
            <p>Author affiliations of researchers for the article <a href="https://doi.org/10.1038/s41375-021-01491-z">https://doi.org/10.1038/s41375-021-01491-z</a>, best tracked with ROR</p>
        </figcaption>
</figure>

<p>In the same paper, the authors also acknowledge support from the DNA Sequencing and Genotyping Facility Core, also known as the <a href="https://rrid.site/data/record/nlx_144509-1/SCR_019196/resolver/mentions?q=*&amp;i=RRID:SCR_019196">University of Chicago Functional Genomics Core Facility, identified by RRID:SCR_019196</a>.</p>
<figure class="blog-figure"><img src="/img/blog/rrid-ror/rrid-acknowledgments.png"
         alt="Acknowledgements of support from core facilities in the article https://doi.org/10.1038/s41375-021-01491-z, best tracked with RRID"/><figcaption>
            <p>Acknowledgements of support from core facilities in the article <a href="https://doi.org/10.1038/s41375-021-01491-z">https://doi.org/10.1038/s41375-021-01491-z</a>, best tracked with RRID</p>
        </figcaption>
</figure>

<p>In this example, then, <strong>RRIDs help track core facility acknowledgements, while ROR IDs help track contributor affiliations.</strong> These are the typical use cases for RRIDs and ROR IDs. That being said, ROR IDs can also be used to help track facility acknowledgements if the facility has a ROR ID.</p>
<p>Let&rsquo;s look now at how to use RRID and ROR.</p>


<h2 id="how-to-use-rrid-and-ror">How to use RRID and ROR 
</h2>


<h3 id="how-to-use-rrids">How to use RRIDs 
</h3>
<p>RRIDs are usually added to the main text of the manuscript; they are present in materials or resource tables (<a href="https://www.sciencedirect.com/science/article/pii/S2211124724012166#sec8">STAR Table example</a>), in the methods section text (<a href="https://www.frontiersin.org/journals/cellular-neuroscience/articles/10.3389/fncel.2021.632354/full#h3">see example</a>), or in the acknowledgement section (<a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC10600361/#s1">many core facilities are cited this way</a>). This enables the core owner, and any robot (or undergraduate student) to easily extract the fact that this manuscript is associated with this resource or core facility. Being part of the <a href="https://jats.nlm.nih.gov/archiving/tag-library/1.2d2/element/resource-id.html">JATS standard</a> means that publishers will pull these identifiers into article metadata for you, at some point, but we are not holding our breath until that happens. Based on the hard work of robots and undergraduate students, RRID pages on both the RRID and Core Marketplace sides list the papers associated with each Core Facility, owners of facilities can also help to fill out this data, for example <a href="https://n2t.net/RRID:SCR_019195">https://n2t.net/RRID:SCR_019195</a>.</p>
<figure class="blog-figure"><img src="/img/blog/rrid-ror/ualberta-core.png"
         alt="Using RRIDs to track papers associated with the University of Alberta Faculty of Medicine and Dentistry Flow Cytometry Core Facility (RRID:SCR_019195)"/><figcaption>
            <p>Using RRIDs to track papers associated with the University of Alberta Faculty of Medicine and Dentistry Flow Cytometry Core Facility (RRID:SCR_019195)</p>
        </figcaption>
</figure>

<p>ORCID records also <a href="https://info.orcid.org/ufaqs/what-information-does-a-research-resource-item-contain/">accept RRIDs</a>, enabling data to be harvested from or sent to an individual researcher’s ORCID profile, which means that analysts can examine ORCID data for RRIDs. <a href="https://datacite.org/blog/metadata-schema-rfc-march_2024/">RRIDs are proposed for inclusion in the next DataCite metadata schema</a>, which means that one day analysts will also be able to look for RRIDs in DataCite data.</p>


<h3 id="how-to-use-ror">How to use ROR 
</h3>
<p>ROR is “infrastructure for infrastructure,” meaning that ROR is meant to be directly incorporated into software applications or added to scholarly metadata by information professionals rather than being used by individual researchers or by facility administrators. ROR offers both a <a href="https://ror.readme.io/v2/docs/rest-api">REST API</a> and a <a href="https://ror.readme.io/v2/docs/data-dump">downloadable dataset</a> for use by developers in their applications, and those who register DOIs with <a href="https://ror.readme.io/v2/docs/crossref">Crossref</a> or <a href="https://ror.readme.io/v2/docs/datacite">DataCite</a> can also include ROR IDs in DOI metadata. Typically, a ROR-enabled system will <a href="https://ror.readme.io/v2/docs/forms">build a ROR-powered lookup into forms</a> so that a user can simply search for the name of their organization and select it without ever seeing or using the ROR ID. Research outputs such as journal articles and datasets can then be tracked by all kinds of different ROR-enabled systems, including OpenAlex, the Web of Science, Crossref, and DataCite Commons.</p>
<figure class="blog-figure"><img src="/img/blog/rrid-ror/ror-datacite-maglab.png"
         alt="DataCite Commons uses the ROR ID to enable tracking of datasets associated with the National High Magnetic Field Laboratory https://ror.org/03s53g630."/><figcaption>
            <p><a href="https://commons.datacite.org/ror.org/03s53g630">DataCite Commons</a> uses the ROR ID to enable tracking of datasets associated with the National High Magnetic Field Laboratory (<a href="https://ror.org/03s53g630">https://ror.org/03s53g630</a>).</p>
        </figcaption>
</figure>

<p>ROR is the preferred organization identifier for contributor affiliations in ORCID, DataCite, and Crossref and is used in DOI metadata for many item types, including journal articles, datasets, projects, instruments, software, data management plans, and grants.</p>


<h2 id="recommendations-for-facilities">Recommendations for Facilities 
</h2>
<p><strong>If you are managing a facility, it’s important to adopt a strategy that incorporates either a ROR ID, an RRID, or both</strong>. Here are our recommendations.</p>
<ol>
<li><strong>University-based core facilities: Obtain an RRID.</strong> RRIDs will enhance the visibility of your research and allow for proper attribution in research outputs. To request an RRID simply go to the <a href="https://rrid.site/about/resource">RRID website</a> and fill out a form (3 fields are required for the initial step). You will direct the curator to look at your core facility webpages and we will in turn ask you to also register with CoreMarketplace (one click option is coming soon!). Curation will provide a set of instructions for how to use your RRID and we will start tracking papers citing your core as soon as the record is approved.</li>
<li><strong>Other facilities: Obtain a ROR ID.</strong> If your facility has multiple parent organizations or is frequently used as an affiliation to identify research outputs (e.g., in contributor affiliations or dataset attributions) without reference to its parent organizations, first <a href="https://ror.org/search">check to see if your facility is already in ROR</a>, and if not, <a href="https://curation-request.ror.org">request a ROR ID</a>. A ROR ID enables disambiguation from your parent organization(s) while preserving the connection, allowing for proper attribution and impact tracking.</li>
<li><strong>Educate researchers about RRID</strong>: Inform researchers about the importance of using RRIDs and encourage them to include RRIDs in their publications for accurate representation of the resources used. Your acceptance letter will include the pointers from a survey from cores that implemented RRIDs successfully.</li>
<li><strong>Choose systems that use ROR</strong>: Check to see if the <a href="https://bit.ly/ror-integrations">tools and platforms</a> used within your facility to manage and track your activities use ROR, and if not, ask the provider to integrate ROR (see more in the <a href="https://doi.org/10.5065/zgsx-2d06">FAIR Facilities Report</a>).</li>
</ol>


<h2 id="conclusion">Conclusion 
</h2>
<p>The use of persistent identifier systems like RRID and ROR is essential for the effective management and recognition of research facilities. Implementing these identifiers will not only streamline operations but also foster collaboration and transparency in research. Using these identifiers can help facilities of various kinds enhance their visibility, track their impact more effectively, and contribute to the larger academic community.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Questions? Contact <a href="mailto:support@ror.org">support@ror.org</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">This blog post explores the difference between 'core facilities' in RRID and 'facilities' in ROR and provides guidance for those who run facilities on how to effectively use these identifiers.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Case Study: Next-Generation Publishing Platform Curvenote and ROR</title><id>https://doi.org/10.71938/f314-e718</id><link rel="alternate" href="https://ror.org/blog/2024-11-15-case-study-curvenote/"/><published>2024-11-15T00:00:00+00:00</published><updated>2025-10-22T06:05:03-04:00</updated><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><category term="Case Studies"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<figure class="featured-figure"><img src="/img/blog/curvenote/curvenote-logo.png"
         alt="Curvenote logo"/>
</figure>

<p>In this interview with Curvenote cofounder Rowan Cockett, we envision a world in which an authoring and publication platform helps scientists collaborate earlier, publish faster, and easily use structured metadata to create fully connected and highly interactive publications and portfolios.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>
<h2 id="key-quotations">Key quotations 
  <a href="#key-quotations" aria-label="Key quotations">
<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 640 512" class="ha ha-link" width="16" height="16" alt="Key quotations"><path d="M579.8 267.7c56.5-56.5 56.5-148 0-204.5c-50-50-128.8-56.5-186.3-15.4l-1.6 1.1c-14.4 10.3-17.7 30.3-7.4 44.6s30.3 17.7 44.6 7.4l1.6-1.1c32.1-22.9 76-19.3 103.8 8.6c31.5 31.5 31.5 82.5 0 114L422.3 334.8c-31.5 31.5-82.5 31.5-114 0c-27.9-27.9-31.5-71.8-8.6-103.8l1.1-1.6c10.3-14.4 6.9-34.4-7.4-44.6s-34.4-6.9-44.6 7.4l-1.1 1.6C206.5 251.2 213 330 263 380c56.5 56.5 148 56.5 204.5 0L579.8 267.7zM60.2 244.3c-56.5 56.5-56.5 148 0 204.5c50 50 128.8 56.5 186.3 15.4l1.6-1.1c14.4-10.3 17.7-30.3 7.4-44.6s-30.3-17.7-44.6-7.4l-1.6 1.1c-32.1 22.9-76 19.3-103.8-8.6C74 372 74 321 105.5 289.5L217.7 177.2c31.5-31.5 82.5-31.5 114 0c27.9 27.9 31.5 71.8 8.6 103.9l-1.1 1.6c-10.3 14.4-6.9 34.4 7.4 44.6s34.4 6.9 44.6-7.4l1.1-1.6C433.5 260.8 427 182 377 132c-56.5-56.5-148-56.5-204.5 0L60.2 244.3z"/></svg>
  </a>
</h2>
<p>&ldquo;This represents the ideal of a scholarly ecosystem that is rich with APIs and structured data, and our authoring tools need to be able to show the value of that as you&rsquo;re working. You see a paper, you copy the DOI, and then you don&rsquo;t have to think about the references, you don&rsquo;t have to think about APA formatting. You see a ROR, you copy it into your document, and then it turns into the name of the organization with all of the information directly available in a hover reference.&rdquo;</p>
<p>&ldquo;It took less than 45 minutes [to integrate ROR], and then we were up and going.&rdquo;</p>
<p>&ldquo;We can work and share and do that as fast as possible and that can be supported by the scientific and scholarly infrastructure as well. And that again includes DOIs, and ORCID, and ROR, and things like that. I think it&rsquo;s an exciting vision, and it&rsquo;s happening.&rdquo;</p>
<p>&ndash; Rowan Cockett, Cofounder and CEO, Curvenote</p>
</span></div>
	</div>
</div>


<h3 id="hahahugoshortcode-s2-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Thanks for speaking with us. Tell us about <a href="https://curvenote.com">Curvenote</a>.</p>


<h3 id="hahahugoshortcode-s3-hbhb-rowan-cockett"><figure class="round-figure"><img src="/img/blog/curvenote/rowan-cockett.png"
         alt="Avatar of Rowan Cockett"/>
</figure>
 Rowan Cockett 
</h3>
<p>Curvenote’s mission is to free science from static PDF documents to enable researchers to continuously share more interactive, reproducible, and richly-linked scientific content. Curvenote develops authoring and publishing tools for researchers, societies, and institutes, with a focus on computational research. We build authoring tools, including a &ldquo;What You See Is What You Get&rdquo; (<a href="https://www.merriam-webster.com/dictionary/WYSIWYG">WYSIWYG</a>) editor, with the goal of making both the way that science is authored and the way that science is published a lot easier. We&rsquo;re also thinking about how scientific metadata is included, trying to make sure that scientific publications are born with metadata all the way through.</p>
<p>We&rsquo;ve done some really exciting integrations with ROR, as well as with <a href="https://orcid.org">ORCIDs</a> and <a href="https://doi.org">DOIs</a> and <a href="https://rrid.site/">RRIDs</a>. For instance, one thing we&rsquo;re thinking about is that as authors are writing, instead of having a citation manager where you&rsquo;re copying the name and the title and the year into your text as a text-only citation, you just copy in the persistent identifier and then it automatically queries all of the scholarly databases against that citation for you. And then you get a very beautiful site that you can publish out of that. I use the word &ldquo;site&rdquo; rather than &ldquo;journal article,&rdquo; because I think one of the things that is difficult for scientists to do is to have a portfolio of their work. When you&rsquo;re using the publishing ecosystem, your work is scattered all around the place. It&rsquo;s really difficult to show all of the papers and articles and blog posts all in one collection on a landing page on your website.</p>
<p>We&rsquo;re also elevating a lot of scientific computational work by letting people show off interactive <a href="https://jupyter.org/">Jupyter Notebooks</a>. I think by bringing all this a little bit closer together and into the scientist&rsquo;s control, you can experiment a lot more with how you&rsquo;re presenting your work. You can push on things like reproducibility and interactive apps that are educational rather than focusing only on novelty in the scientific paper sense. What we&rsquo;re trying to make possible is the ability to continually update that project, that paper, that article, that blog post, and to make sure there&rsquo;s good metadata throughout that entire process such that it&rsquo;s actually integrated into the scholarly ecosystem.</p>
<div class='centered'><figure><img src="/img/blog/curvenote/curvenote-demo.gif"
         alt="Demonstration of richly-linked and interactive scientific content in Curvenote."/><figcaption>
            <p>Demonstration of richly-linked and interactive scientific content in Curvenote.</p>
        </figcaption>
</figure>
</div>


<h3 id="hahahugoshortcode-s5-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Would it be fair to say that Curvenote is a digital-first platform?</p>


<h3 id="hahahugoshortcode-s6-hbhb-rowan-cockett"><figure class="round-figure"><img src="/img/blog/curvenote/rowan-cockett.png"
         alt="Avatar of Rowan Cockett"/>
</figure>
 Rowan Cockett 
</h3>
<p>Oh, yes. It creates beautiful PDF documents, but that is not the point.</p>


<h3 id="hahahugoshortcode-s7-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Does this come out of your own scientific training?</p>


<h3 id="hahahugoshortcode-s8-hbhb-rowan-cockett"><figure class="round-figure"><img src="/img/blog/curvenote/rowan-cockett.png"
         alt="Avatar of Rowan Cockett"/>
</figure>
 Rowan Cockett 
</h3>
<p>Yes. I have a background in Geoscience. I started creating interactive visualization tools for my peers in undergrad and understood that the web is just an exciting platform on which to disseminate these things. I saw my peers tinkering with geologic models and seeing the results right away, and there was this really close interactivity loop where they could discover things just by using those models. That was something that I learned in undergrad. I started and then sold a company around that over the next few years. Then I was working at that acquiring company for about four years, working in large-scale version control system collaboration for environmental science, civil engineering, and mining companies, and creating platforms for those companies. I was in charge of a large team over there.</p>
<p>While I was doing that, I was also doing my PhD in Computational Geophysics, so lots of open source technology, lots of Python, and lots of <a href="https://jupyter.org">Jupyter</a>. We were deploying courses that use those interactively in the classroom, and we took those on lecture tours. So we saw how powerful it was to put research tools directly in the hands of students all over the world, and rather than giving them a PDF with a picture of something, you give them a direct connection to the computation. With a PDF, there are no avenues back to where this came from. The actual scientific work behind that PDF document is all just lost. That was a really eye-opening experience of integrating reproducible science and open source communities into the medium that you&rsquo;re using to communicate.</p>
<p>And while I&rsquo;m doing that, I&rsquo;m also going through scientific publishing and, like, getting my PDF paper rejected from multiple journals. Just honestly the worst.</p>


<h3 id="hahahugoshortcode-s9-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>And how long did it take to get those rejections?</p>


<h3 id="hahahugoshortcode-s10-hbhb-rowan-cockett"><figure class="round-figure"><img src="/img/blog/curvenote/rowan-cockett.png"
         alt="Avatar of Rowan Cockett"/>
</figure>
 Rowan Cockett 
</h3>
<p>I was running a 65-person team at the same time as I was trying to finish my PhD, and I was in spurts of trying to get my PhD over the line, so it took three years for me to publish that paper. And the most disappointing thing is that at the end of those three years, someone reached out with a dataset that could use my work. That was the first time that had happened, and we completely missed out on the collaboration possibilities. If I had published right away and got that out there, I could have collaborated with that person at the start and made a much bigger impact.</p>
<p>My research would have been much more applicable with a field data set. I was doing theoretical modeling, and I did the <a href="https://doi.org/10.1016/j.cageo.2018.04.006">first 3d model and inversion of a vadose zone dataset of infiltrating water in an agricultural setting</a>. And that was all theoretical and computational: I demonstrated that this was possible, but not in a field sense. I didn&rsquo;t actually go and apply it anywhere. But there are people who are collecting data that I could work with, and because of the scientific publishing ecosystem, I had zero capability to work with those people, and I think it limited the impact of that research project. By the time the research was published, I was no longer working on that project, and I think that&rsquo;s just a shame, honestly.</p>


<h3 id="hahahugoshortcode-s11-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>And tell us about how <a href="https://mystmd.org">MyST Markdown</a> fits into this.</p>


<h3 id="hahahugoshortcode-s12-hbhb-rowan-cockett"><figure class="round-figure"><img src="/img/blog/curvenote/rowan-cockett.png"
         alt="Avatar of Rowan Cockett"/>
</figure>
 Rowan Cockett 
</h3>
<p>I started Curvenote with the idea that you need that direct connection between research and publishing — working and sharing. That experience I described taught me that you need to be able to get your work out there and you need to get feedback right away. It needs to be born on the web with metadata right from the start. I think most scientists do their writing in Word documents, so that was the experience that we were going after for Curvenote: a What You See Is What You Get text editor that integrates with computation and metadata in a much richer way.</p>
<p>With the computation aspects, I think what I underestimated was that the communities who work in Python and Jupyter are much more used to text markup than to working in WYSIWYG editors. There are ways to work across those spectrums, but to enable that reproducibility, we needed to move into that text markup space. So we joined the <a href="https://executablebooks.org/en/latest/">Executable Books project</a>, we <a href="https://github.com/curvenote/curvenote">open sourced all of Curvenote&rsquo;s command line tools</a>, and that was the start of the <a href="https://mystmd.org">MyST-MD</a> project, which is growing and helping the <a href="https://jupyterbook.org/en/stable/intro.html">Jupyter Book ecosystem</a> to create tutorials and educational materials. The MyST-MD project has recently <a href="https://executablebooks.org/en/latest/blog/2024-05-20-jupyter-book-myst/">become part of the Jupyter Books</a> project officially, which is really exciting. Jupyter <a href="https://www.whitehouse.gov/ostp/news-updates/2024/03/21/white-house-office-of-science-technology-policy-announces-year-of-open-science-recognition-challenge-winners/">won the Open Science Award from the White House</a> in the spring, and it&rsquo;s making significant Open Science progress. We are bringing science communication tools to the core of Jupyter and integrating those metadata tools into the fabric of the Jupyter ecosystem.</p>


<h3 id="hahahugoshortcode-s13-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>That&rsquo;s fascinating. I didn&rsquo;t realize all the connections to Jupyter.</p>


<h3 id="hahahugoshortcode-s14-hbhb-rowan-cockett"><figure class="round-figure"><img src="/img/blog/curvenote/rowan-cockett.png"
         alt="Avatar of Rowan Cockett"/>
</figure>
 Rowan Cockett 
</h3>
<p>Yeah, we&rsquo;ve been working very hard.</p>


<h3 id="hahahugoshortcode-s15-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>How does Curvenote address the concept of peer review?</p>


<h3 id="hahahugoshortcode-s16-hbhb-rowan-cockett"><figure class="round-figure"><img src="/img/blog/curvenote/rowan-cockett.png"
         alt="Avatar of Rowan Cockett"/>
</figure>
 Rowan Cockett 
</h3>
<p>There are different ways to publish MyST sites. We have a publishing system for Curvenote, so you can publish something yourself and just post it (for example to <a href="https://mystmd.org/guide/deployment-github-pages">GitHub pages</a>), but if you want a DOI on that, if you want to incorporate it into a scientific journal and have some of these better integrations into the scholarly system that are archival and that involve peer review, for example, that needs a system around it. And yes, the Curvenote publishing system and platform enables the creation of peer-reviewed journals. We&rsquo;re working with <a href="https://www.agu.org/">AGU, the American Geophysical Union</a>, which is a large-scale publisher. They&rsquo;re thinking about how to do these web-first papers, these computational articles, and they&rsquo;re experimenting with that and running that on the Curvenote platform. And that&rsquo;s the same thing we&rsquo;re doing with the <a href="https://microscopy.org/">Microscopy Society of America</a>, which was just presented in <a href="https://doi.org/10.1038/d41586-024-02577-1">Nature Magazine</a>.</p>


<h3 id="hahahugoshortcode-s17-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Got it.</p>


<h3 id="hahahugoshortcode-s18-hbhb-rowan-cockett"><figure class="round-figure"><img src="/img/blog/curvenote/rowan-cockett.png"
         alt="Avatar of Rowan Cockett"/>
</figure>
 Rowan Cockett 
</h3>
<p>The <a href="https://www.ncssm.edu/">North Carolina School of Science and Mathematics</a> is also using Curvenote &ndash; that&rsquo;s <strong>high school students</strong> learning about ORCIDs, RORs, and DOIs for the first time! There&rsquo;s a teacher there who wanted his students to be able to have a breath of fresh air in the science ecosystem and to learn what it would be like for them to publish in six years. And that means they&rsquo;re publishing immediately. They&rsquo;re getting DOIs, they have ORCIDs, they have a <a href="https://ror.org/03zbydc22">ROR</a>! Their work is born open access, and it&rsquo;s born with reproducibility, and it&rsquo;s born with the datasets integrated. It&rsquo;s just so cool. Their journal is called <a href="https://morgantonscientific.ncssm.edu/">The Morganton Scientific</a>.</p>
<figure class="blog-figure"><img src="/img/blog/curvenote/ncssm-morganton-scientific.png"
         alt="The North Carolina School of Science and Mathematics showcases student research in The Morganton Scientific, powered by Curvenote."/><figcaption>
            <p>NCSSM showcases student research in Curvenote-powered The Morganton Scientific.</p>
        </figcaption>
</figure>



<h3 id="hahahugoshortcode-s20-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Very cool! There&rsquo;s so many people experimenting right now with new publishing models, new peer review models, new models of open preprint dissemination, that kind of thing. And it&rsquo;s so clear that it&rsquo;s a generational type of thing. I don&rsquo;t think younger people coming into science have the patience for the kind of print-first model that persists, which has just been replaced with a PDF in place of print.</p>


<h3 id="hahahugoshortcode-s21-hbhb-rowan-cockett"><figure class="round-figure"><img src="/img/blog/curvenote/rowan-cockett.png"
         alt="Avatar of Rowan Cockett"/>
</figure>
 Rowan Cockett 
</h3>
<p>Yes, exactly. And that&rsquo;s the way that we&rsquo;re framing Curvenote, as a next-generation tool, a platform where scientific computation, scientific reproducibility, and scientific metadata are deeply integrated. And what does &ldquo;next generation&rdquo; mean? It means that it&rsquo;s easy enough for a high school to get up and running and host their own journal! They can do that by themselves, and it&rsquo;s open access, and it&rsquo;s better than most journals out there. I think that it&rsquo;s just exciting. Eighteen-year-olds can do this, so there&rsquo;s no excuse!</p>


<h3 id="hahahugoshortcode-s22-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Tell us about how you&rsquo;re using ROR now and how you might intend to use it in the future.</p>


<h3 id="hahahugoshortcode-s23-hbhb-rowan-cockett"><figure class="round-figure"><img src="/img/blog/curvenote/rowan-cockett.png"
         alt="Avatar of Rowan Cockett"/>
</figure>
 Rowan Cockett 
</h3>
<p>There are a couple of different ways. There are the ROR IDs in affiliation metadata that we&rsquo;re collecting, so inside of Curvenote, you can add your ROR ID in the metadata, and that&rsquo;s collected along with your affiliation. And then we have just started integrating that into the MyST ecosystem, so now, when you paste a link to ROR, you can hover over that anywhere in your document, and you can see the integrated metadata.</p>
<figure><img src="/img/blog/curvenote/curvenote-ror-hover.gif"
         alt="In Curvenote, hovering over the text &#39;University of British Columbia&#39; brings up information about the organization from ROR."/><figcaption>
            <p>In Curvenote, hovering over the text &lsquo;University of British Columbia&rsquo; brings up information about the organization from ROR.</p>
        </figcaption>
</figure>

<p><strong>And again, this represents the ideal of a scholarly ecosystem that is rich with APIs and structured data, and our authoring tools need to be able to show the value of that as you&rsquo;re working. You see a paper, you copy the DOI, and then you don&rsquo;t have to think about the references, you don&rsquo;t have to think about APA formatting. You see a ROR, you copy it into your document, and then it turns into the name of the organization with all of the information directly available in a hover reference.</strong></p>


<h3 id="hahahugoshortcode-s25-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Cool. And do you have a model for funding information in Curvenote?</p>


<h3 id="hahahugoshortcode-s26-hbhb-rowan-cockett"><figure class="round-figure"><img src="/img/blog/curvenote/rowan-cockett.png"
         alt="Avatar of Rowan Cockett"/>
</figure>
 Rowan Cockett 
</h3>
<p>We rely on the same funding-statement standards that we have developed into the MyST ecosystem. Part of the work we&rsquo;ve been doing with the Sloan Foundation and with AGU was collecting the funding information and making sure that that&rsquo;s easy to author in a <a href="https://mystmd.org/guide/frontmatter#frontmatter-funding">YAML-based format</a>. That&rsquo;s something that is presentable to authors, so that they can write their funding information in with a couple of DOIs. We focused on the <a href="https://www.crossref.org/services/funder-registry/">Crossref Funder ID</a> as the main integration point there.</p>


<h3 id="hahahugoshortcode-s27-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Right. That is currently the standard for a funder identifier. I don&rsquo;t know whether you saw that Crossref did <a href="https://www.crossref.org/blog/open-funder-registry-to-transition-into-research-organization-registry-ror/">announce last year</a> that over the long term ROR will be the standard for identifying funders as well as other research organizations. We&rsquo;re in the process of doing a lot of curation in ROR to make sure that all the funders are represented, and Crossref is working to make sure that ROR IDs can be accepted as funder identifiers in the Crossref schema. So just be aware that that&rsquo;s coming. [See our <a href="https://ror.readme.io/docs/funder-registry">Funder Registry transition guide</a> for more information.]</p>


<h3 id="hahahugoshortcode-s28-hbhb-rowan-cockett"><figure class="round-figure"><img src="/img/blog/curvenote/rowan-cockett.png"
         alt="Avatar of Rowan Cockett"/>
</figure>
 Rowan Cockett 
</h3>
<p>We will make those changes. That&rsquo;s good to know.</p>


<h3 id="hahahugoshortcode-s29-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>MyST is a flavor of <a href="https://en.wikipedia.org/wiki/Markdown">Markdown</a>, correct? How is it different from regular Markdown?</p>


<h3 id="hahahugoshortcode-s30-hbhb-rowan-cockett"><figure class="round-figure"><img src="/img/blog/curvenote/rowan-cockett.png"
         alt="Avatar of Rowan Cockett"/>
</figure>
 Rowan Cockett 
</h3>
<p>Yes, it is a flavor of Markdown. We&rsquo;re pretty loose in the way that we talk about the MyST ecosystem, because there&rsquo;s really three parts of it. First, there&rsquo;s the flavor of Markdown &ndash; the actual syntax that you write and see. Second, there is the structure of that after it has been parsed. That piece is much more similar to <a href="https://jats.nlm.nih.gov/">JATS</a> &ndash; there&rsquo;s basically a one to one mapping of that next-generation scholarly structured data to what is in JATS XML today. There&rsquo;s a <a href="https://mystmd.org/spec">MyST spec</a> describing the structure of these documents, which we&rsquo;re working on in the MyST community. Third, there are the command line tools, which make it very, very easy to <a href="https://mystmd.org/guide/quickstart">get up and going</a>. So MyST is more than the markup language: again, you paste a DOI in there and it pulls in the references, it talks to various external services, and MyST brings that authoring experience fully together.</p>


<h3 id="hahahugoshortcode-s31-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Great. Thanks for that.</p>


<h3 id="hahahugoshortcode-s32-hbhb-rowan-cockett"><figure class="round-figure"><img src="/img/blog/curvenote/rowan-cockett.png"
         alt="Avatar of Rowan Cockett"/>
</figure>
 Rowan Cockett 
</h3>
<p>Currently, there&rsquo;s 15% adoption of the MyST language in the scientific documentation community, and that&rsquo;s actually quite large.</p>


<h3 id="hahahugoshortcode-s33-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>That&rsquo;s quite good, yes!</p>


<h3 id="hahahugoshortcode-s34-hbhb-rowan-cockett"><figure class="round-figure"><img src="/img/blog/curvenote/rowan-cockett.png"
         alt="Avatar of Rowan Cockett"/>
</figure>
 Rowan Cockett 
</h3>
<p>That&rsquo;s active documentation for projects, and there&rsquo;s a couple of caveats in there, but that&rsquo;s 15% right now. We&rsquo;re working with the Python community to bring them over into this MyST ecosystem. That&rsquo;s one of the entry points to a large part of the scientific computation ecosystem: teaching people how to author in this markup language, providing metadata in a structured way, and then coming all the way back. Over the coming years, MyST will be a default language in the Jupyter ecosystem that&rsquo;s available. We&rsquo;re starting to integrate MyST into the fabric of how scientists are authoring computational documents and computational articles.</p>


<h3 id="hahahugoshortcode-s35-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>It&rsquo;s very smart to connect to things like Jupyter &ndash; tools that people use all the time. At one point, Jupyter was probably itself just a tool that nobody knew about, just used by a few different people. And then it was everywhere, and it became something everyone was using all the time, because it filled a need.</p>


<h3 id="hahahugoshortcode-s36-hbhb-rowan-cockett"><figure class="round-figure"><img src="/img/blog/curvenote/rowan-cockett.png"
         alt="Avatar of Rowan Cockett"/>
</figure>
 Rowan Cockett 
</h3>
<p>Yeah. There&rsquo;s a great TED talk called &ldquo;How to Start a Movement.&rdquo; It&rsquo;s a dance party. The take-home there is that you need to join the things that are already existing and put your effort behind them. It takes a different kind of leadership to join these movements rather than start your own thing.</p>

<div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
  <iframe src="https://www.youtube.com/embed/V74AxCqOTvg" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="How to start a movement | Derek Sivers"></iframe>
</div>

<p>This is very much our approach, to reinforce existing communities, to bring new experiences to life inside of those and be that strengthening and connecting tissue. I think that&rsquo;s somewhat lacking in how scientists experience the scholarly ecosystem in their day to day work. They have DOIs, they use Google Scholar, they see the list of papers that cite them, but they don&rsquo;t have that direct connection to adding metadata in the way that they should. It should be tangible. They should get a hit of dopamine.</p>


<h3 id="hahahugoshortcode-s38-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>The UI you&rsquo;ve made is actually really fabulous, because you can see the metadata, right? You can see all the connections underneath. People who don&rsquo;t code or people who don&rsquo;t really work with a lot of metadata often have a hard time understanding the importance of persistent identifiers. I think it helps when people can see, visually, that the identifier is connected to this knot of information.</p>


<h3 id="hahahugoshortcode-s39-hbhb-rowan-cockett"><figure class="round-figure"><img src="/img/blog/curvenote/rowan-cockett.png"
         alt="Avatar of Rowan Cockett"/>
</figure>
 Rowan Cockett 
</h3>
<p>Right.</p>


<h3 id="hahahugoshortcode-s40-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>How was ROR to integrate, technically? Was it difficult?</p>


<h3 id="hahahugoshortcode-s41-hbhb-rowan-cockett"><figure class="round-figure"><img src="/img/blog/curvenote/rowan-cockett.png"
         alt="Avatar of Rowan Cockett"/>
</figure>
 Rowan Cockett 
</h3>
<p><strong>It took less than 45 minutes, and then we were up and going.</strong></p>


<h3 id="hahahugoshortcode-s42-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Amazing. That&rsquo;s great to hear. What else would you like to say about ROR or about your work?</p>


<h3 id="hahahugoshortcode-s43-hbhb-rowan-cockett"><figure class="round-figure"><img src="/img/blog/curvenote/rowan-cockett.png"
         alt="Avatar of Rowan Cockett"/>
</figure>
 Rowan Cockett 
</h3>
<p>The next few years in scientific publishing are going to look very different. I think there&rsquo;s a wave that&rsquo;s been building up over this time. We&rsquo;ve been working for five years on Curvenote, MyST and the underlying authoring tools, and the scientific publishing ecosystem and the players in that space don&rsquo;t see that coming, because they&rsquo;re not really paying attention to educational tutorials that people are publishing by themselves, or blog posts that are using this, or scientific documentation.</p>
<p>This is where people are doing their work. That&rsquo;s where people work every single day. And if we are extending the publishing ecosystem out from that, and deeply integrating it into the fabric of how people are working daily, that is a very different future than how we think about scientific publishing today. It&rsquo;s much more about science communication. How are you getting feedback as fast as possible? How are you directing micro-expertise? How are you thinking about how to integrate with services like <a href="https://prereview.org/">PREreview</a> to bring that attention closer to when you&rsquo;re doing a statistical analysis? You can get your figure out in front of people. You can get your methods section out in front of people.</p>
<p>And that&rsquo;s a possibility that is being enabled by these various communities working together. That is a radically different reimagining of how the science infrastructure is supporting the work and communication of scientists. It can help support today&rsquo;s publishing infrastructure, but it doesn&rsquo;t have to just be that. <strong>We can work and share and do that as fast as possible and that can be supported by the scientific and scholarly infrastructure as well. And that again includes DOIs, and ORCID, and ROR, and things like that. I think it&rsquo;s an exciting vision, and it&rsquo;s happening.</strong></p>


<h3 id="hahahugoshortcode-s44-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Thanks so much for speaking with us.</p>


<h3 id="hahahugoshortcode-s45-hbhb-rowan-cockett"><figure class="round-figure"><img src="/img/blog/curvenote/rowan-cockett.png"
         alt="Avatar of Rowan Cockett"/>
</figure>
 Rowan Cockett 
</h3>
<p>Thank you.</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">In this interview with Curvenote cofounder Rowan Cockett, we envision a world in which an authoring and publication platform helps scientists collaborate earlier, publish faster, and easily use structured metadata to create fully connected and highly interactive publications and portfolios.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">How Good Is Your Matching?</title><id>https://doi.org/10.71938/vq19-1577</id><link rel="alternate" href="https://ror.org/blog/2024-11-06-how-good-is-your-matching/"/><link rel="related" href="https://doi.org/10.13003/ief7aibi"/><published>2024-11-06T20:21:55-05:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Dominika Tkaczyk</name><uri>https://orcid.org/0000-0001-5055-7876</uri></author><author><name>Adam Buttrick</name><uri>https://orcid.org/0000-0003-1507-1031</uri></author><category term="Explainers"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>The fourth blog post about metadata matching by ROR’s Adam Buttrick and Crossref’s Dominika Tkaczyk explains how to measure the quality of different matching strategies with an evaluation dataset and metrics. Read all posts in the <a href="/tags/matching">series on metadata matching</a>.</span></div>
	</div>
</div>
<figure class="featured-figure"><img src="/img/blog/matching-how-good/string-id-matching-crown.png"
         alt="An elaborate crown on top of the words string = id on a bright blue background"/>
</figure>

<p>In our <a href="/blog/2024-08-28-the-myth-of-perfect-metadata-matching/">previous blog post</a> in this <a href="/tags/matching">series</a>, we explained why no metadata matching strategy can return perfect results. Thankfully, however, this does not mean that it&rsquo;s impossible to know anything about the quality of matching. Indeed, we can (and should!) measure how close (or far) we are from achieving perfection with our matching. Read on to learn how this can be done!</p>
<p>How about we start with a quiz? Imagine a database of scholarly metadata that needs to be enriched with identifiers, such as ORCIDs or ROR IDs. Hopefully, by this point in our series this is recognizable as a classic matching problem. In searching for a solution, you identify an externally-developed matching tool that makes one of the below claims. Which of the following would demonstrate satisfactory performance?</p>
<ol>
<li>It is a cutting-edge, state-of-the-art, intelligent-as-they-come, bullet-proof technology! All the big players are using it. You won&rsquo;t find anything better!</li>
<li>The tool was tested on the metadata of 10 articles we authored, and many identifiers were matched.</li>
<li>The quality of our matching is 98%.</li>
</ol>
<p>Okay, okay, trick question. The correct answer here is to opt for secret answer #4: &ldquo;I wouldn&rsquo;t be satisfied by any of these claims!&rdquo; Let&rsquo;s dig in a bit more to why this is the correct response.</p>


<h2 id="the-importance-of-the-evaluation">The importance of the evaluation 
</h2>
<p>Before we decide to integrate a matching strategy, it is important to understand as much as possible about how it will perform. Whether it is used in a semi or fully automated fashion, metadata matching will result in the creation of new relationships between things like works, authors, funding sources, and institutions. Those relationships will then, in turn, be used by the consumers of this metadata to guide their understanding and perhaps even to make important decisions about those same entities. As organisations providing scholarly infrastructure, we must therefore take it as our paramount responsibility to understand any caveats or shortcomings of the scholarly metadata we make available, including that resulting from matching.</p>
<p>Proper evaluation is what allows us to do this, as it is impossible to know how well a given matching strategy will perform in its absence. This is true no matter how simple or complex a matching strategy may seem. Complex methods can be tailored to data with specific characteristics and might fail when faced with something different from this. Simple methods might be only appropriate for clean metadata or a narrow set of use cases.</p>
<p>Beyond complexity, matching strategies themselves vary widely in character, inheriting biases from their design, training data, or how a problem has been formulated. Some prioritise avoiding false negatives, while others focus on minimising false positives. Even a generally high-performing strategy might not be perfectly aligned with your specific needs or data. In some cases, the task also itself might be too challenging, or the available metadata too noisy, for any matching strategy to perform adequately.</p>
<p>Evaluation is, again, how we understand these nuances and make informed decisions about whether to implement matching or avoid it altogether. By now, it should also be clear that the notion &ldquo;we don&rsquo;t need to evaluate&rdquo; is far from ideal! Given its importance, let&rsquo;s explore how evaluation is actually done.</p>


<h2 id="evaluation-process">Evaluation process 
</h2>
<p>In general, a proper evaluation procedure should follow the following steps:</p>
<ol>
<li>Preparation of an evaluation dataset containing many examples of matching inputs and the corresponding expected outputs.</li>
<li>Applying the strategy to all inputs from the dataset and recording the responses.</li>
<li>Comparing the expected outputs with the outputs from the strategy.</li>
<li>Converting the results of the above comparison into evaluation metrics.</li>
</ol>
<p>From this accounting, we can see that there are two primary components for the evaluation process: an evaluation dataset and metrics.</p>


<h3 id="evaluation-dataset">Evaluation dataset 
</h3>
<p>It&rsquo;s useful to conceive an evaluation dataset as the specification for an ideal matching strategy, describing what would be returned from our forever-elusive perfect matching. When creating such a dataset, what this means in practice is that it should contain a number of real-world, example inputs, along with the corresponding ideal or expected outputs, and that all data should be in the same format as the strategy is expected to process. The outputs should themselves also confirm the strategy&rsquo;s overall requirements, for example, by being consistent with its cardinality, meaning whether zero, one, or multiple matches should be returned and under what circumstances. In terms of size, it&rsquo;s generally useful to calculate the ideal number of evaluation examples using a sample size calculator or using <a href="https://doi.org/10.1520/E0122-17R22">standardised measures</a>, but as a quick rule of thumb: less than 100 examples is probably insufficient, more than 1,000 or 2,000 is generally acceptable.</p>
<p>It is also important that the evaluation dataset be representative of the data to be matched in order to ensure reliable results. Using unrepresentative data, even if convenient, can lead to biassed or misleading evaluations. For example, if matching affiliations from various journals, building an evaluation dataset solely from one journal that already assigns ROR IDs to authors&rsquo; affiliations might be tempting. The data, having been already annotated, allow us to avoid the tedious work of labelling, and we might even know that it is produced by a high-quality source.  This is still, unfortunately, a flawed approach. In practice, such datasets are unlikely to represent the entire range of affiliations to be matched, potentially leading to a significant discrepancy between the evaluated quality and the actual performance of the matching strategy, when applied to the full dataset. To assess a matching strategy&rsquo;s effectiveness, we have to resist shortcuts and instead do our best to create truly representative evaluation datasets to be confident that we&rsquo;ve accurately measured their performance.</p>


<h3 id="evaluation-metrics">Evaluation metrics 
</h3>
<p>Evaluation metrics are what allow us to summarise the results of the evaluation into a single number. Metrics give us a quick way to get an estimation of how close the strategy was to achieving perfect results. They are also useful if we want to compare different strategies with each other or decide whether the strategy is sufficient for our use case, removing the need to compare countless evaluation examples from different strategies against one another.</p>
<p>The simplest metric is <a href="https://en.wikipedia.org/wiki/Accuracy_and_precision">accuracy</a>, which can be calculated as the fraction of the dataset examples that were matched correctly. While a commonsense benchmark, accuracy can be misleading, and we generally do not recommend using it. To understand why, let&rsquo;s consider the following small dataset and the responses from two strategies:</p>
<table>
<thead>
<tr>
<th>Input</th>
<th>Expected output</th>
<th>Strategy 1</th>
<th>Strategy 2</th>
</tr>
</thead>
<tbody>
<tr>
<td>string 1</td>
<td>ID 1</td>
<td>ID 1</td>
<td>ID 1</td>
</tr>
<tr>
<td>string 2</td>
<td>ID 2</td>
<td>ID 3</td>
<td>Empty output</td>
</tr>
<tr>
<td>string 3</td>
<td>Empty output</td>
<td>Empty output</td>
<td>Empty output</td>
</tr>
</tbody>
</table>
<p>Both strategies achieved the same accuracy, 0.67, making one mistake each on the second affiliation string. However, a closer examination reveals that these error types are distinct. The first strategy matched to an incorrect identifier, while the second refused to return any value illustrating the limitation of accuracy as a measure: it generally fails to capture important nuances in strategy behaviour. In our example, the first strategy appears more permissive, returning matches even in unclear circumstances, while the second is more conservative, withholding them when uncertain. Although using such a small dataset would preclude drawing any definitive conclusions, it highlights how relying on accuracy alone can obscure differences in performance.</p>
<p>For evaluating matching strategies, we instead recommend using two metrics: <a href="https://en.wikipedia.org/wiki/Precision_and_recall">precision and recall</a>. To recap from our previous blog post:</p>
<ul>
<li>Precision is calculated as the number of correctly matched relationships resulting from a strategy, divided by the total number of matched relationships. It can also be interpreted as the probability that a match is correct. Low precision indicates a high rate of false positives, which are incorrect relationships created by the strategy.</li>
<li>Recall is calculated as the number of correctly matched relationships resulting from a strategy, divided by the number of true (expected) relationships. It can also be interpreted as the probability that a true (correct) relationship will be created by the strategy. Low recall means a high rate of false negatives, which are relationships that should have been created by the strategy but were not made.</li>
</ul>
<p>Applying these measures to our prior example, the strategies achieved the following results:</p>
<ul>
<li>Strategy 1: accuracy 0.67, precision 0.5, recall 0.5</li>
<li>Strategy 2: accuracy 0.67, precision 1.0, recall 0.5</li>
</ul>
<p>As we can see, while both strategies have the same accuracy, using precision and recall better describes the difference between the two sets of results. Strategy 1&rsquo;s lower precision indicates it made false positive matches, while Strategy 2&rsquo;s perfect precision shows that it made none. The identical recall scores show both identified half of the possible matches.</p>
<p>Of course, results calculated using such a small dataset are not very meaningful. If we obtained these scores from a large, representative evaluation dataset, it would indicate to us that Strategy 1 risks introducing many incorrect relationships, while Strategy 2 would be unlikely to do so. In both cases, we would still expect approximately half of the possible relationships to be missing from the strategies&rsquo; outputs.</p>
<p>Which one is more important to prioritise, precision or recall? It depends on the use case. As a general rule, if you want to use the strategy in a fully automated way, without any form of manual review or correction of the results, we recommend paying more attention to precision. Privileging precision will allow you to better control the number of incorrect relationships added to your data. If you want to use the strategy in a semi-automated fashion, where there is a manual examination of and a chance to correct the results, pay more attention to recall. Doing so will guarantee that enough options are presented during the manual review stage and fewer relationships will be missed as a result.</p>
<p>To get a more balanced estimation of performance, we can also consider both precision and recall at the same time using a measure called <a href="https://en.wikipedia.org/wiki/F-score">F-score</a>. F-score combines precision and recall into a single number, with variable weight given to either aspect. There are three commonly used types, each calculated as the weighted <a href="https://en.wikipedia.org/wiki/Harmonic_mean">harmonic mean</a> of precision and recall:</p>
<ul>
<li>F0.5: Precision is weighted more heavily. It can be understood as a score that is 50% more sensitive to precision than recall. A high F0.5 score indicates a measure of performance that minimises false positives.</li>
<li>F1: Equal weight is given to both precision and recall. It can be interpreted as the most balanced score in this set. High F1 indicates good overall performance, with both false positives and false negatives being minimised equally.</li>
<li>F2: Recall is weighted more heavily. It can be understood as a score that is 50% more sensitive to recall than precision. A high F2 score indicates a measure of performance where false negatives are minimised.</li>
</ul>
<p>Each of these variants allows for fine-tuning the evaluation metric to align with your expectations for a specific matching task. Choose whichever reflects the relative importance of precision versus recall for your use case.</p>
<p>To summarise, to avoid falling prey to misleading sales pitches or silly quizzes, it is important to have a good understanding of the performance of any strategies you are building or integrating. With thorough evaluation, including a representative dataset and carefully considered metrics, we can estimate the quality of matching and, by extension, its resulting relationships.</p>
<p>Now that we&rsquo;ve covered how to evaluate effectively, we can move on to some other aspects of metadata matching. Our next blog post will take a final, more holistic view of matching, exploring some complementary considerations to all of the preceding. Stay tuned for more!</p>
<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Questions? Contact <a href="mailto:adam@ror.org">adam@ror.org</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">The fourth blog post about metadata matching by ROR’s Adam Buttrick and Crossref’s Dominika Tkaczyk explains how to measure the quality of different matching strategies with an evaluation dataset and metrics.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Welcome Riley Marsh, Metadata Manager</title><id>https://doi.org/10.71938/aavf-pb62</id><link rel="alternate" href="https://ror.org/blog/2024-09-03-welcome-riley-marsh/"/><published>2024-09-03T00:00:00+00:00</published><updated>2025-11-05T16:17:42-08:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="Team Updates"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<figure class="featured-figure"><img src="/img/lion-closeup-1200.jpg"
         alt="Closeup of a lion&#39;s face"/>
</figure>

<p>We are thrilled to introduce a new member of the ROR pride: Riley Marsh joined the ROR team in August as our new Metadata Manager.</p>
<p>As Metadata Manager, Riley will support the day-to-day activities of curating the ROR registry, including triaging incoming requests and processing changes for new registry releases. Riley will work closely with ROR’s <a href="https://ror.org/registry/#curation-advisory-board">Curation Advisory Board</a> to maintain ROR’s metadata policies and practices as well as working to understand and support various community use cases that might influence ROR&rsquo;s curation processes and procedures. Based at <a href="https://crossref.org">Crossref</a>, which is one of ROR’s three operating organizations, Riley will also contribute to metadata schema development activities on the Crossref side.</p>
<p>Riley brings a strong background to this work, most recently having worked at Tulane University Libraries in the Digital Scholarship &amp; Initiatives department, supporting digital collections, electronic theses and dissertations, and various digitization projects across campus and beyond.</p>
<p>Former Metadata Curation Lead Adam Buttrick continues to be involved in ROR in his <a href="https://ror.org/blog/2024-05-22-new-positions/">new role as persistent identifiers product manager at California Digital Library</a>.</p>
<p>With Riley joining the team, it is an opportunity to reflect on and emphasize the importance of the workflows, policies, infrastructure, and above all the people that ensure ROR&rsquo;s metadata is carefully curated and actively maintained so that our global user base has access to clean, comprehensive, and accurate records.</p>
<p>As a reminder, anyone can suggest registry additions and updates via <a href="https://curation-request.ror.org">this form</a>. All suggestions are processed according to ROR’s curation policies, which are developed with our Curation Advisory Board, and tracked openly on <a href="https://github.com/orgs/ror-community/projects/19">GitHub</a> so you can see what has been suggested and follow suggestions as they go through the process. Registry updates are published approximately once a month, and the latest data is available on <a href="https://zenodo.org/doi/10.5281/zenodo.6347574">Zenodo</a> and in the <a href="https://ror.readme.io/docs/rest-api">ROR API</a>.</p>
<p>For more information about curation, read more on the <a href="https://ror.org/registry/">registry web page</a> and in the <a href="https://github.com/ror-community/ror-updates?tab=readme-ov-file#ror-updates">ror-updates GitHub repository</a>.</p>
<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Questions? Contact <a href="mailto:support@ror.org">support@ror.org</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">We are thrilled to introduce a new member of the ROR pride: Riley Marsh joined the ROR team in August as our new Metadata Manager.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">How Should ROR Handle API Client Identification?</title><id>https://doi.org/10.71938/rgyq-pk44</id><link rel="alternate" href="https://ror.org/blog/2024-08-29-api-client-identification/"/><published>2024-08-29T09:19:41-04:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Liz Krznarich</name><uri>https://orcid.org/0000-0001-6622-4910</uri></author><category term="Technical News"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>We&rsquo;re looking for your feedback on a <a href="https://bit.ly/ror-client-id-proposal">draft proposal for managing ROR API client identification</a> with &lsquo;mailto&rsquo; parameters or API keys, and comments are open <strong>through October 4th, 2024</strong>. Which option would you prefer? Does either option present a significant barrier for your use of the ROR API? Let us know!</span></div>
	</div>
</div>
<figure class="featured-figure"><img src="/img/blog/clientid/sven-finger-XaBE3eokg88-unsplash.jpg"
         alt="Lion-shaped door knocker from a house in Portugal. Photo by Sven Finger on Unsplash."/>
</figure>

<p>Since ROR’s launch in 2019, the ROR API has been provided at no cost and with no registration or credentials required. While requiring no registration or identification reduces barriers to entry, it makes managing the health and stability of the API challenging amid constant and growing use.</p>
<p>In recent months, “impolite” use of the ROR API has presented a particular challenge, which causes degraded performance for all API users. Examples of “impolite” use include:</p>
<ul>
<li>Sustained high levels of requests just below the rate limit from multiple coordinated IP addresses</li>
<li>Use of proxies and frequent IP changing to evade rate-limiting</li>
</ul>
<p>IP-based rate limiting, which ROR <a href="https://ror.readme.io/v2/docs/rest-api#registration-and-rate-limits">already has in place at a limit of 2,000 requests in every 5 minutes</a>, is not an effective solution to the behaviors listed above. Additionally, monitoring and blocking impolite behavior based on IP address is not a scalable solution as ROR API use continues to grow. <strong>ROR is therefore seeking to implement a lightweight solution to allow monitoring and managing API traffic based on criteria other than IP address.</strong></p>


<h2 id="goals">Goals 
</h2>
<ul>
<li>
<p>Maintain stability and usability of ROR API by enabling throttling/blocking based on some sort of client identification rather than IP address</p>
</li>
<li>
<p>Allow ROR team to better support users by enabling ROR team to contact users whose requests generate lots of errors, are incorrectly formatted, etc and provide them with assistance fixing their requests</p>
</li>
<li>
<p>Allow ROR team more insight into API usage patterns than can be gathered from IP addresses alone, which help to guide technical decisions and architecture approaches</p>
</li>
</ul>
<div class='callout orange'>
	<div class="row">
		<div class="col-md-12"><span>ROR is <strong>not</strong> considering removing access to any API services/resources for anonymous requests. Anonymous requests will be allowed, but they will receive a lower rate limit.</span></div>
	</div>
</div>


<h2 id="scope">Scope 
</h2>
<p>ROR is seeking feedback on the the following proposed change:</p>
<p><strong>The potential addition of API client identification in order to receive a rate limit of 2000 requests per 5 minute period. Requests without identification will receive a lower rate limit of 50 requests per 5 minute period.</strong></p>
<p>Any additional revisions to ROR’s API are out of scope for this proposal, but can be submitted for consideration to <a href="https://github.com/ror-community/ror-roadmap">ROR’s roadmap</a>.</p>


<h2 id="proposed-changes">Proposed changes 
</h2>
<p>In keeping with ROR’s objective of maintaining low barriers to entry, two options for identifying API clients have been developed. Both approaches involve providing an additional URL parameter or request header with each API request. Neither of these approaches are intended to provide authentication or authorization, therefore values provided are not secret and can be passed and stored as plain text. <strong>Full details of the proposed changes, pros and cons, privacy considerations, examples, and implementation timing are available <a href="https://bit.ly/ror-client-id-proposal">in the proposal draft</a>.</strong></p>


<h2 id="giving-feedback">Giving feedback 
</h2>
<p>To give feedback, please visit the <a href="https://bit.ly/ror-client-id-proposal">proposal document</a> and suggest changes to the text or add comments in the margin by <strong>October 4th, 2024</strong>. Feedback about the &ldquo;Questions to consider&rdquo; noted in the document is particularly appreciated. We&rsquo;ll give a summary of the responses and discuss implementation at the <a href="https://ror.org/events/#ror-community-call-november-2024">ROR Community Call in November</a>.</p>
<p>Let us know what you think! We appreciate your time and all that you do to help us make ROR the best service it can be.</p>
<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Questions? Write <a href="mailto:support@ror.org">support@ror.org</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">We're looking for your feedback on a draft proposal for managing ROR API client identification with 'mailto' parameters or API keys, and comments are open through October 4th, 2024.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">The Myth of Perfect Metadata Matching</title><id>https://doi.org/10.71938/ec54-6370</id><link rel="alternate" href="https://ror.org/blog/2024-08-28-the-myth-of-perfect-metadata-matching/"/><link rel="related" href="https://doi.org/10.13003/pied3tho"/><published>2024-08-28T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Dominika Tkaczyk</name><uri>https://orcid.org/0000-0001-5055-7876</uri></author><author><name>Adam Buttrick</name><uri>https://orcid.org/0000-0003-1507-1031</uri></author><category term="Explainers"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>The third blog post about metadata matching by ROR’s Adam Buttrick and Crossref’s Dominika Tkaczyk discusses a few common myths about metadata matching that are often encountered when interacting with users, developers, integrators, and other stakeholders. Read all posts in the <a href="/tags/matching">series on metadata matching</a>.</span></div>
	</div>
</div>
<figure class="featured-figure"><img src="/img/blog/matching-perfect/false-positives-negatives.png"
         alt="False positives and false negatives"/>
</figure>

<p>In our previous installments of the <a href="/tags/matching">blog series about matching</a> (see <a href="/blog/2024-05-16-metadata-matching-101/">part 1</a> and <a href="/blog/2024-06-27-anatomy-of-metadata-matching/">part 2</a>), we explained what metadata matching is, why it is important and described its basic terminology. In this entry, we will discuss a few common beliefs about metadata matching that are often encountered when interacting with users, developers, integrators, and other stakeholders. Spoiler alert: we are calling them myths because these beliefs are not true! Read on to learn why.</p>
<p>If you have stuck with us this far in our series, hopefully, you are at least a bit excited about the possibility of creating new relationships between the works, authors, institutions, preprints, datasets, and myriad other objects in our existing scholarly metadata. Who would not want all of these to be better connected?</p>
<p>We have to pause for a moment and be honest with you: metadata matching is a complex problem, and doing it correctly requires significant effort. What is worse, even if we do everything right, our matching won&rsquo;t be perfect. This may be counterintuitive. Perhaps you&rsquo;ve heard that matching is not a hard problem, or have encountered people surprised that a matching strategy returned a wrong or incomplete answer. Sometimes, it is obvious to a person from looking at some specific example that a match should (or should not) have been made, so they naturally assume that a change to account for this has to be simple.</p>
<p>Misconceptions like these can be problematic. They create confusion around matching, drive users&rsquo; expectations to unreasonable levels, and make people drastically underestimate the effort needed to build and integrate matching strategies. So let&rsquo;s dive right in and debunk a few common myths about metadata matching.</p>


<h2 id="myth-1-a-metadata-matching-strategy-should-be-100-correct">Myth #1: A metadata matching strategy should be 100% correct 
</h2>
<p>Anyone who has built or supported a matching strategy has likely encountered the following belief: it is possible to develop a perfect strategy, meaning one that always returns the correct results, no matter the inputs. The unfortunate truth is that while one&rsquo;s aim should always be to design matching strategies that return correct results, once we move beyond the simplest class of problems or artificially clean data, no strategy can achieve this outcome. In thinking through why this is the case, some inherent constraints become obvious:</p>
<p>The inputs to matching are often strings in human-readable formats, which can vary wildly in their structure, order and completeness. Since they&rsquo;re intended to be parsed by people, instead of machines, they&rsquo;re inherently lossy and frequently unstructured, anticipating that a person can infer from the source context what is being referenced. Matching strategies, although built to make sense of unstructured data, unfortunately, don&rsquo;t have the luxury of this flexibility. A strategy has to account for translating a messy, partial, or inconsistent input into a correct and structured match.</p>
<p>Consider, for example, the following inputs to an affiliation matching strategy:</p>
<pre tabindex="0"><code>1. &#34;Department of Radiology, St. Mary&#39;s Hospital, London W2 1NY, UK&#34;
2. &#34;Saint Mary&#39;s Hospital, Manchester University NHS Foundation Trust&#34;
3. &#34;St. Mary&#39;s Medical Center, San Francisco, CA&#34;
4. &#34;St Mary&#39;s Hosp., Dublin&#34;
5. &#34;St Mary&#39;s Hospital Imperial College Healthcare NHS Trust&#34;
6. &#34;聖マリア病院&#34;
</code></pre><p>In order to correctly identify the organisations mentioned here, the matching strategy must be able to distinguish between different ways of representing the same institution, disambiguate multiple institutions that have similar names, and handle variant forms for the parts of each name (Saint/St./St), identify the same name in different languages (&ldquo;聖マリア病院&rdquo; is Japanese for &ldquo;St. Mary&rsquo;s Hospital&rdquo;), and make assumptions about partial or ambiguous locations translating to more precise references. While a person reviewing each of these strings might be able to accomplish these tasks, even here there are some challenges. Does &ldquo;St Mary&rsquo;s Hosp., Dublin&rdquo; refer to the hospital in Ireland or a separate hospital in one of the many cities that share this name? Should we presume that because &ldquo;聖マリア病院&rdquo; is in Japanese, this refers to a hospital in Japan?  Would someone, by default, be aware that St. Mary&rsquo;s Hospital in London is part of the Imperial College Healthcare NHS Trust, such that inputs one and five refer to the same organisation?</p>
<p>An additional challenge lies in the quality of the data, which in the context of matching, encompasses both the input and the dataset being matched against. In real world circumstances, no dataset is fully accurate, complete, or current and certainly not all three. As a result, there will always be functionally random differences between inputs to the strategy and the entities to be matched. A theoretically perfect matching strategy would thus need to distinguish between inconsequential discrepancies resulting from gaps, errors, and variable forms of reference and actual, meaningful differences indicating an incorrect match. As one might imagine, this would require near total knowledge of the meaning and context for all inputs and outputs, a nigh-on impossible task for any person or system!</p>
<p>As a consequence, no metadata matching strategy will ever be perfect. It is unreasonable for us to expect them to be. This does not mean, of course, that all strategies are equally flawed or destined to forever return middling results. Some are better than others and we can improve them over time. Which brings us to the next myth:</p>


<h2 id="myth-2-it-is-always-a-good-idea-to-adapt-the-matching-strategy-to-a-specific-input">Myth #2: It is always a good idea to adapt the matching strategy to a specific input 
</h2>
<p>Matching strategies are not static. They can &ndash; and should &ndash; be improved. There is, however, a deceptive trap that one can fall into when attempting to improve a matching strategy. Whenever we encounter an incorrect or missing result for a specific input, we treat this problem like a software bug and try to adapt the strategy to work better for it, without considering all other cases.</p>
<p>The more complicated reality is that the quality of matching results is controlled through a complex set of trade-offs between <a href="https://en.wikipedia.org/wiki/Precision_and_recall">precision and recall</a> that determine the kind and number of relationships created between items:</p>
<ul>
<li>Precision is calculated as the number of correctly matched relationships resulting from a strategy, divided by the total number of matched relationships. It can also be interpreted as the probability that a match is correct. Low precision indicates a high rate of false positives, which are incorrect relationships created by the strategy.</li>
<li>Recall is calculated as the number of correctly matched relationships resulting from a strategy, divided by the number of true (expected) relationships. It can also be interpreted as the probability that a true (correct) relationship will be created by the strategy. Low recall means a high rate of false negatives, which are relationships that should have been created by the strategy but were not made.</li>
</ul>
<figure class="blog-figure"><img src="/img/blog/matching-perfect/false-positives-negatives.png"
         alt="False positives and false negatives"/><figcaption>
            <p>The diagram depicts false negatives and false positives. The ideal outcome would be that the ellipses are identical, matched relationships are exactly the same as true relationships, and there are no false negatives or false positives. In practice, we try to make the intersection as big as possible.</p>
        </figcaption>
</figure>

<p>The tradeoff between precision and recall roughly means that modifying the strategy to improve recall will decrease precision, and vice versa.</p>
<p>Imagine, for example, we received a report about a relationship that was missed by matching because of a partial, noisy, or ambiguous input. We might be tempted to resolve this issue by relaxing our matching criteria. Unfortunately, this will have a cost of a higher overall rate of false positive matches.</p>
<p>Conversely, if we encounter a case where the matching has returned an incorrect match, we might attempt to make the matching strategy stricter to avoid this result. We should remember, however, that this may have the consequence of causing the strategy to skip many perfectly valid matches.</p>
<figure class="blog-figure"><img src="/img/blog/matching-perfect/precision-recall-tradeoff.png"
         alt="The tradeoff between precision and recall"/><figcaption>
            <p>The tradeoff between precision and recall. (a) A strict strategy prioritises precision over recall resulting in more false negatives. (b) A relaxed strategy prioritises recall over precision resulting in more false positives.</p>
        </figcaption>
</figure>

<p>Striking this balance becomes even more difficult when attempting to address multiple issues at once, or considering constraints like the time and resources consumed by each aspect of the strategy. Each choice can compound the individual effects in unanticipated and expensive ways. The aim of matching ultimately then can&rsquo;t be to achieve perfect results for every single case. Fixing one particular situation might not be desirable, as it can result in breaking multiple other cases. Instead, we have to find a locally optimal balance that optimises the strategy&rsquo;s utility, relative to these inherent limitations. This means accepting some level of imperfection as not just inevitable, but necessary for implementing a workable strategy. When you consider all this, you might conclude that …</p>


<h2 id="myth-3-we-shouldnt-do-large-scale-unsupervised-matching">Myth #3: We shouldn&rsquo;t do large-scale, unsupervised matching 
</h2>
<p>Imperfect matching strategies, when applied automatically to real-world large datasets, might:</p>
<ul>
<li>Fail to discover some relationships (false negatives), an outcome that may not be terribly problematic. In the worst case scenario, we have wasted a great deal of effort developing matching strategies that do not improve our metadata.</li>
<li>Create incorrect relationships between items (false positives), what seems like a potentially larger problem, where we have added incorrect relationships to the metadata.</li>
</ul>
<p>Many have the instinct to avoid false positives at any cost, even if this means missing many additional correct relationships at the same time. They might come to the conclusion that if we cannot have 100% precision (see our previous myth), we simply should not allow matching strategies to act in an automated, unsupervised way on large datasets. While there might be circumstances where this belief is rational, in the context of the scholarly record, this notion is seriously flawed.</p>
<p>First, if you are dealing with any medium to large-sized dataset, it almost certainly contains errors, even before you apply any automated processing to it. Even if data is submitted and curated by users, they can still make mistakes, and might themselves be using automated tools for extracting the data from other sources, without your knowledge. It is thus not entirely obvious that applying an (imperfect) matching strategy to create more relationships would actually make the data quality worse.</p>
<p>Second, while we cannot eliminate all matching errors, we can place a high priority on precision when developing strategies, with the aim of keeping the number of incorrectly matched results as low as possible. We can also make use of additional mechanisms to easily correct for incorrectly matched results, for example doing so manually, in response to error reports.</p>
<p>Finally, the results of matching should always contain provenance information to distinguish them from those that have been manually curated. This way, the users can make their own decisions about whether to use and trust the matching results, relative to their use case.</p>
<p>By applying those additional checks, we can minimize the negative effects of incorrect matching, while at the same time reap the benefits of filling gaps in the scholarly record.</p>


<h2 id="myth-4-we-can-only-ever-guess-at-the-accuracy-of-our-matching-results">Myth #4: We can only ever guess at the accuracy of our matching results 
</h2>
<p>In attempting to determine the correctness of our matching, we immediately encounter a number of inherent limitations. The sheer amount of entries in many datasets prevents a thorough, manual validation of the results, but if instead, we use too few or specific items as our benchmarks, these are unlikely to be representative of overall performance. The unpredictable nature of future data adds another wrinkle: will our matching always be as successful as when we first benchmarked it or will its performance degrade relative to some change in the data?</p>
<p>With so many unknowns, are we then doomed? No! We have rigorous and scientific tools at our disposal that can help us estimate how accurate our matching will be. How do we use them? Well, that is a big and fairly technical topic, so we will leave you with this little cliffhanger. See you in the next post!</p>
<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Questions? Contact <a href="mailto:adam@ror.org">adam@ror.org</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">In this post, we discuss a few common myths about metadata matching that are often encountered when interacting with users, developers, integrators, and other stakeholders.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Case Study: How and Why Optica Publishing Group Uses ROR</title><id>https://doi.org/10.71938/z07b-1n79</id><link rel="alternate" href="https://ror.org/blog/2024-08-23-case-study-optica-publishing-group/"/><published>2024-08-23T00:00:00+00:00</published><updated>2025-11-07T11:35:16-05:00</updated><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><category term="Case Studies"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<figure class="featured-figure"><img src="/img/blog/optica/opticapg-logo.png"
         alt="Optica Publishing Group logo"/>
</figure>

<p><a href="https://opg.optica.org/">Optica Publishing Group</a> was one of the earliest publishers to send ROR IDs to Crossref in DOI metadata. In this interview, we speak with them to learn more about their rigorous processes for cleaning organization names.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>
<h2 id="key-quotations">Key quotations 
  <a href="#key-quotations" aria-label="Key quotations">
<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 640 512" class="ha ha-link" width="16" height="16" alt="Key quotations"><path d="M579.8 267.7c56.5-56.5 56.5-148 0-204.5c-50-50-128.8-56.5-186.3-15.4l-1.6 1.1c-14.4 10.3-17.7 30.3-7.4 44.6s30.3 17.7 44.6 7.4l1.6-1.1c32.1-22.9 76-19.3 103.8 8.6c31.5 31.5 31.5 82.5 0 114L422.3 334.8c-31.5 31.5-82.5 31.5-114 0c-27.9-27.9-31.5-71.8-8.6-103.8l1.1-1.6c10.3-14.4 6.9-34.4-7.4-44.6s-34.4-6.9-44.6 7.4l-1.1 1.6C206.5 251.2 213 330 263 380c56.5 56.5 148 56.5 204.5 0L579.8 267.7zM60.2 244.3c-56.5 56.5-56.5 148 0 204.5c50 50 128.8 56.5 186.3 15.4l1.6-1.1c14.4-10.3 17.7-30.3 7.4-44.6s-30.3-17.7-44.6-7.4l-1.6 1.1c-32.1 22.9-76 19.3-103.8-8.6C74 372 74 321 105.5 289.5L217.7 177.2c31.5-31.5 82.5-31.5 114 0c27.9 27.9 31.5 71.8 8.6 103.9l-1.1 1.6c-10.3 14.4-6.9 34.4 7.4 44.6s34.4 6.9 44.6-7.4l1.1-1.6C433.5 260.8 427 182 377 132c-56.5-56.5-148-56.5-204.5 0L60.2 244.3z"/></svg>
  </a>
</h2>
<p>&ldquo;As part of discussions leading to the release of Version 2 [of JATS4R], we talked a great deal about ROR being probably the dominant identifier for affiliations in the near future.&rdquo;</p>
<p>&ldquo;We want to be good citizens of our publishing community and contribute as much metadata as possible, as much <em>correct</em> metadata as possible, to Crossref as an exchange mechanism among publishers.&rdquo;</p>
<p>&ndash; Sasha Schwarzman, Content Technology Architect, Optica Publishing Group</p>
<hr>
<p>&ldquo;The messier the data, the more we need ROR.&rdquo;</p>
<p>&ldquo;ROR has been a really great tool, and it&rsquo;s really helped our efforts, so we really appreciate that it exists.&rdquo;</p>
<p>&ndash; Chris Iannicello, Director of Business and Content Intelligence, Optica Publishing Group</p>
</span></div>
	</div>
</div>


<h3 id="hahahugoshortcode-s2-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Chris and Sasha, can you tell me your names, titles, and organizations?</p>


<h3 id="hahahugoshortcode-s3-hbhb-chris-iannicello"><figure class="round-figure"><img src="/img/blog/optica/chris-iannicello.jpg"
         alt="Avatar of Chris Iannicello"/>
</figure>
 Chris Iannicello 
</h3>
<p>Chris Iannicello, Director of Business and Content Intelligence, Optica Publishing Group.</p>


<h3 id="hahahugoshortcode-s4-hbhb-sasha-schwarzman"><figure class="round-figure"><img src="/img/blog/optica/sasha-schwarzman.jpg"
         alt="Avatar of Sasha Schwarzman"/>
</figure>
 Sasha Schwarzman 
</h3>
<p>My name is Sasha Schwarzman. My position is Content Technology Architect with the Optica Publishing Group.</p>


<h3 id="hahahugoshortcode-s5-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Tell us about Optica Publishing Group. When was it founded, what does it do, and what makes it unique?</p>


<h3 id="hahahugoshortcode-s6-hbhb-sasha-schwarzman"><figure class="round-figure"><img src="/img/blog/optica/sasha-schwarzman.jpg"
         alt="Avatar of Sasha Schwarzman"/>
</figure>
 Sasha Schwarzman 
</h3>
<p><a href="https://www.optica.org">Optica</a> was founded in 1916 as the Optical Society of America, and the first issue of the <a href="https://opg.optica.org/josa/browse.cfm">Journal of the Optical Society of America</a> was published in 1917. In 2021, the organization&rsquo;s name was changed to Optica to reflect the society’s global membership, and <a href="https://opg.optica.org/">Optica Publishing Group</a> became its publishing arm.</p>


<h3 id="hahahugoshortcode-s7-hbhb-chris-iannicello"><figure class="round-figure"><img src="/img/blog/optica/chris-iannicello.jpg"
         alt="Avatar of Chris Iannicello"/>
</figure>
 Chris Iannicello 
</h3>
<p>The unit we work in, Optica Publishing Group, is responsible for publishing optics and photonics scientific journals and books. There&rsquo;s also an outreach program that&rsquo;s affiliated with the society, which is a nonprofit, and that involves hosting meetings and conferences. The society also engages with young scholars to promote the field of optics and photonics. We also get involved with legislation in Congress related to issues that affect scientific research and funding.</p>


<h3 id="hahahugoshortcode-s8-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Wonderful. And how many journals and publications does Optica Publishing Group produce?</p>


<h3 id="hahahugoshortcode-s9-hbhb-sasha-schwarzman"><figure class="round-figure"><img src="/img/blog/optica/sasha-schwarzman.jpg"
         alt="Avatar of Sasha Schwarzman"/>
</figure>
 Sasha Schwarzman 
</h3>
<p>Optica Publishing Group&rsquo;s portfolio offers the largest peer-reviewed optics and photonics content collection. It includes 21 peer-reviewed publications, co-publications, and papers from more than 1,150 meetings and conferences, including associated videos. The content comprises more than 460,000 journal articles, conference papers, and videos.</p>


<h3 id="hahahugoshortcode-s10-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Gotcha. So when did you hear about ROR for the first time?</p>


<h3 id="hahahugoshortcode-s11-hbhb-chris-iannicello"><figure class="round-figure"><img src="/img/blog/optica/chris-iannicello.jpg"
         alt="Avatar of Chris Iannicello"/>
</figure>
 Chris Iannicello 
</h3>
<p>A few years ago, I was attempting to standardize our affiliation data a little better &ndash; clean it. And I came across <a href="https://grid.ac">GRID</a> at that time, maybe three or four years ago. I started using GRID IDs informally in my own cleaning exercises against our affiliation data. And by affiliation data, I mean both author institutions and also institutions that subscribe to our content. Both those lists require different degrees of cleaning. Then I heard the <a href="https://www.digital-science.com/news/grid-passes-torch-to-ror/">news that GRID was being sunset</a>, and I started learning about ROR. So about two years ago, I really started to use ROR exclusively, once I knew GRID was going away.</p>


<h3 id="hahahugoshortcode-s12-hbhb-sasha-schwarzman"><figure class="round-figure"><img src="/img/blog/optica/sasha-schwarzman.jpg"
         alt="Avatar of Sasha Schwarzman"/>
</figure>
 Sasha Schwarzman 
</h3>
<p>I think I heard about the ROR system four years ago, and I believe it was at a <a href="https://jats.nlm.nih.gov/jats-con/">JATS-Con</a> conference that I attended. At JATS-Con, we always talk about persistent identifiers. When ROR came along, of course, there was an announcement, and then there was, I believe, <a href="https://doi.org/10.6084/m9.figshare.8137961.v1">a presentation from the ROR folks</a>. And ever since, it has been on my radar. Then, when <a href="https://www.crossref.org/blog/open-funder-registry-to-transition-into-research-organization-registry-ror/">Crossref announced that the Funder Registry is going to be sunset</a>, that really caught my attention. Also, I&rsquo;m on the <a href="https://jats4r.niso.org/authors-and-affiliations-v2">JATS4R Author and Affiliations working group</a>. <strong>As part of discussions leading to the release of Version 2, we talked a great deal about ROR being probably the dominant identifier for affiliations in the near future.</strong></p>


<h3 id="hahahugoshortcode-s13-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Ah, yes, I&rsquo;m so glad you mentioned that, because we have looked at the JATS4R proposals about affiliations during the open comment period, and they looked good to us, so we&rsquo;re pleased to know that ROR is such a key part of that. I think we didn&rsquo;t leave any comments, because we were just pleased with them as they were.</p>


<h3 id="hahahugoshortcode-s14-hbhb-sasha-schwarzman"><figure class="round-figure"><img src="/img/blog/optica/sasha-schwarzman.jpg"
         alt="Avatar of Sasha Schwarzman"/>
</figure>
 Sasha Schwarzman 
</h3>
<p>We did retain examples with ISNI and Ringgold, because you have to include those.</p>


<h3 id="hahahugoshortcode-s15-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yes. Understandable. Chris, you mentioned that you were using first GRID and then ROR in your cleaning projects for both author affiliations and institutional subscribers. I&rsquo;m curious about what that cleaning involves and why it&rsquo;s important.</p>


<h3 id="hahahugoshortcode-s16-hbhb-chris-iannicello"><figure class="round-figure"><img src="/img/blog/optica/chris-iannicello.jpg"
         alt="Avatar of Chris Iannicello"/>
</figure>
 Chris Iannicello 
</h3>
<p>The second question is a lot shorter to answer than the first, so I&rsquo;m going to give the answer to the second question first, which is, <strong>the messier the data, the more we need ROR.</strong> For years, when we accepted submissions for manuscripts, which may or may not become a published article, the author could submit what institution, college, university, or laboratory they&rsquo;re affiliated with. It was a free-form field, and they could put as much text as they wanted. So we have data that may mention the university in it, but it also has the address, the PO box, the phone number, and it&rsquo;s one piece of metadata, it&rsquo;s not split up in our database. So the value of ROR to us is just to standardize the identity of all these places. We may have hundreds of different permutations of the same institution. To me, that&rsquo;s the big value: to have a way of quality checking the information we have internally.</p>
<p>The information we&rsquo;re asking for now is cleaner, because not only have we required cleaner data, but we&rsquo;ve also integrated the ROR API so that authors can choose a ROR affiliation when they submit a manuscript, although not a large percentage is doing that as of yet.</p>
<figure class="blog-figure"><img src="/img/blog/optica/Optica_ROR_Production.png"
         alt="ROR-powered organization pick list in Optica Publishing Group manuscript submission interface"/><figcaption>
            <p>ROR-powered organization pick list in Optica Publishing Group manuscript submission interface</p>
        </figcaption>
</figure>

<p>The answer to the first question about the cleaning process is that it&rsquo;s a very involved process where I take that metadata that I mentioned, which has a lot of information, and I attempt to extract the university or the laboratory or the the part of the metadata that is actually saying what the institution or affiliation is. And that is a multi-step cleaning process that has nothing to do with ROR: it just has to do with delimiting our data to the point where I can find the best possible part of the metadata of that might be an institution. Once I do that, I take that list and I put it against the ROR data. Sometimes I&rsquo;ll get a one-to-one text match, and we consider that a match. If it&rsquo;s not a one-to-one text match, then I run it against your API, line by line. And then I review those results manually.</p>


<h3 id="hahahugoshortcode-s18-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Which is fantastic, by the way. Not everybody does that much work to get to that level of quality. I have to say, too, you&rsquo;re by no means alone in having accepted affiliations as text strings for so long. I think that&rsquo;s an endemic problem in publishing that for many years, even after the move to digital, people had a free-text field for that kind of data.</p>


<h3 id="hahahugoshortcode-s19-hbhb-chris-iannicello"><figure class="round-figure"><img src="/img/blog/optica/chris-iannicello.jpg"
         alt="Avatar of Chris Iannicello"/>
</figure>
 Chris Iannicello 
</h3>
<p>When I get to the point of manual review, I have the cleaned affiliation string from our data, and the first thing I&rsquo;ll do is see if there&rsquo;s a URL involved, because that&rsquo;s a much cleaner way to find a match, since you don&rsquo;t have to worry about permutations of names and the URL changes a lot less often than the name does. If that doesn&rsquo;t work, then I&rsquo;ll just start Googling the name until I find something that looks like it has a ROR ID.</p>
<p>In that process, I&rsquo;m searching the harvested ROR data that I have taken from your <a href="https://ror.readme.io/v2/docs/data-dump">data dump</a>, which I put into a business intelligence tool called <a href="https://www.tableau.com/">Tableau</a>, and I&rsquo;ve included the hierarchy data, which helps. I try to run it up to the highest parent in the <a href="https://ror.readme.io/v2/docs/relationships">hierarchy chain</a> when I assign the ROR ID or name. Sometimes I&rsquo;m not able to do that, just because it&rsquo;s too time-consuming and might not be granular enough anyway (for example, the &ldquo;US Department of Defense” might be too broad as opposed to its children like &ldquo;US Army&rdquo; and &ldquo;US Navy&rdquo;).</p>
<figure><img src="/img/blog/optica/Optica_ROR_Tableau.png"
         alt="Optica Publishing Group&#39;s list of organizations and ROR IDs in Tableau."/><figcaption>
            <p>Optica Publishing Group&rsquo;s list of organizations and ROR IDs in Tableau.</p>
        </figcaption>
</figure>

<p>I have a mechanism where you can see the child or the parent: you can toggle between the two, so if you want to see lower numbers, more bundled, you toggle to the parent for things like article counts and author accounts. The granularity might not be perfect, because I only have two data points, the institution I found a match for and its respective parent, no matter how many generations it is up the chain to that parent. I don&rsquo;t specify that.</p>
<p>Once I do all that, then I can start reporting on the data and presenting it to stakeholders internally. I give them the choice between looking at the ROR ID or looking at the raw strings in the same row broken out if they ask for it, which obviously makes it a much longer report. Basically, those are the major steps.</p>


<h3 id="hahahugoshortcode-s21-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>I see.</p>


<h3 id="hahahugoshortcode-s22-hbhb-chris-iannicello"><figure class="round-figure"><img src="/img/blog/optica/chris-iannicello.jpg"
         alt="Avatar of Chris Iannicello"/>
</figure>
 Chris Iannicello 
</h3>
<p>I don&rsquo;t do any automated matching, because the information we&rsquo;re using this for is too important. We&rsquo;re not doing broad trending with this data. We&rsquo;re doing reports that require very specific article counts, author affiliations, and articles per author affiliation per year. We need exact numbers or as close as we can get to it. I&rsquo;ve been on <a href="https://youtu.be/Tx5y7lX030U?si=U8DClGS2bOzEijp3">the webinars where you&rsquo;ve discussed automatic matching</a>, and I&rsquo;ve heard <a href="https://ror.readme.io/v2/docs/matching">all the techniques people are using</a>, and they&rsquo;re great, and I understand why they&rsquo;re doing well. But we just have to trust the data 100% for our use cases.</p>


<h3 id="hahahugoshortcode-s23-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>What are those use cases? In other words, what&rsquo;s the purpose of your reports?</p>


<h3 id="hahahugoshortcode-s24-hbhb-chris-iannicello"><figure class="round-figure"><img src="/img/blog/optica/chris-iannicello.jpg"
         alt="Avatar of Chris Iannicello"/>
</figure>
 Chris Iannicello 
</h3>
<p>We&rsquo;re using them for prospective business arrangements. Sometimes it&rsquo;s with a single university, sometimes it&rsquo;s with a consortium, sometimes it&rsquo;s with a whole country for <a href="https://scholarlykitchen.sspnet.org/2019/04/23/transformative-agreements/">Read-and-Publish deals or Publish-only deals</a>. In those deals, when you&rsquo;re discussing the details, it&rsquo;s very important to know the historical counts of authors who have written articles through Optica Publishing Group, because those counts determine the basis for the amount of the deal.</p>
<p>If you&rsquo;re not far along in the negotiation process, you may just want an order of magnitude, so you can do a quick keyword search of affiliations that may have more than one name. Now that might be done against ROR data, or that might be done against our clean affiliation data, or it might be both. That&rsquo;s more informal. But when you get closer to finalizing a deal, then the exact counts become important.</p>
<p>But we also want to apply ROR against our internal customer data, against all of the Optica society members, all of whom have affiliation data. If we take a cross-section of all our members, what are the ten leading affiliations that they&rsquo;re from? I&rsquo;ve run the ROR API against those lists, and I&rsquo;ve taken fuzzy matches on that, and in the reports I&rsquo;ve specified that these are fuzzy. Maybe one-tenth of them are clean. I&rsquo;ve set up the reports so you can toggle between the fuzzy matches and the clean matches, so you can remove the fuzzy matches from the report completely, and then you&rsquo;re only looking at the ones that I manually checked. It&rsquo;s a toggle that says, &ldquo;manually checked&rdquo; or &ldquo;fuzzy match.&rdquo; And I uncheck the fuzzy match by default, because I know it&rsquo;s risky to report that. So, if they just want to see general trends, they can include fuzzy matches.</p>
<p>A related use case for that data is to try to identify organizations that have a lot of authors who publish articles with us but do not subscribe to our content.</p>


<h3 id="hahahugoshortcode-s25-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>That&rsquo;s a great use case, of course, because it&rsquo;s not even just a business opportunity, like &ldquo;Hey, maybe you should subscribe.&rdquo; That&rsquo;s good for research, because if those researchers are publishing in your journals, then they probably want to be reading them as well.</p>


<h3 id="hahahugoshortcode-s26-hbhb-chris-iannicello"><figure class="round-figure"><img src="/img/blog/optica/chris-iannicello.jpg"
         alt="Avatar of Chris Iannicello"/>
</figure>
 Chris Iannicello 
</h3>
<p>Yes, that&rsquo;s the whole sales pitch. Researchers from your institution have published thirty articles in the past two years in our journals, but if they go to read them in your libraries, they can&rsquo;t get to them, or at least not to the non-open-access ones, anyway.</p>


<h3 id="hahahugoshortcode-s27-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Right.</p>


<h3 id="hahahugoshortcode-s28-hbhb-sasha-schwarzman"><figure class="round-figure"><img src="/img/blog/optica/sasha-schwarzman.jpg"
         alt="Avatar of Sasha Schwarzman"/>
</figure>
 Sasha Schwarzman 
</h3>
<p>I want to elaborate on one of the use cases that Chris mentioned, which is the case where we already use ROR IDs for funding purposes. Specifically, as far as open access is concerned, there are two business models. Either it&rsquo;s a funder who pays for open access, or it is an institution that pays for open access. When the institution pays, there are two variations. The first is a Read-and-Publish deal with a library consortium or with a particular institution that pays for open access. The second variation is a Publish-only deal, which can apply to open-access and hybrid journals.</p>
<p>Because, in both cases, it is an institution rather than a funder that pays for open access, it is crucial to identify the institution correctly, and we use ROR IDs to do so. We have already implemented that, and thanks to Chris&rsquo;s efforts, we have very strict one-to-one correspondence between an institution and a ROR ID. If the ROR ID belongs to a particular Read-and-Publish deal, we send the information to <a href="https://www.copyright.com/solutions-rightslink-scientific-communications/">RightsLink</a>, a Copyright Clearance Center service. They clear it with the institution, and then the publisher gets paid.</p>
<p>As part of the extensive quality control that each journal article undergoes, I use <a href="https://schematron.com/">Schematron</a> to check the correspondence between an eligible affiliation and its ROR ID. If they don&rsquo;t match, there is a QC option to reconcile them based on the ROR Registry. That is an actual use case where we already use ROR IDs for those Read-and-Publish or Publish-only deals.</p>
<p>Of course, as Chris said, we can use this data to know which institutions have published with us but haven&rsquo;t signed a deal with us yet. Or maybe the deal needs to be renegotiated because we&rsquo;re paid for this number of authors, but many more have published with us, so maybe we need to renegotiate the terms. That is very important from the business point of view.</p>


<h3 id="hahahugoshortcode-s29-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yes. I&rsquo;m curious about RightsLink. Do they accept ROR IDs as an input, or do they only accept names?</p>


<h3 id="hahahugoshortcode-s30-hbhb-chris-iannicello"><figure class="round-figure"><img src="/img/blog/optica/chris-iannicello.jpg"
         alt="Avatar of Chris Iannicello"/>
</figure>
 Chris Iannicello 
</h3>
<p>Both. We can specify three different identifier types to verify if an accepted manuscript is a match on the RightsLink side. It&rsquo;s the name, the ROR ID, or the email domain. Or more than one if needed.</p>


<h3 id="hahahugoshortcode-s31-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Interesting!</p>


<h3 id="hahahugoshortcode-s32-hbhb-chris-iannicello"><figure class="round-figure"><img src="/img/blog/optica/chris-iannicello.jpg"
         alt="Avatar of Chris Iannicello"/>
</figure>
 Chris Iannicello 
</h3>
<p>We&rsquo;re giving them the ROR ID and the ROR name. If there&rsquo;s a submitted manuscript that comes in from a region that we may have a deal with, where the author didn&rsquo;t choose a ROR value when they submitted, then I get an alert. And I&rsquo;ll go and manually match it,  which I can do in our internal production system. It happens pretty often.</p>
<p>Now, that doesn&rsquo;t mean they&rsquo;re part of the deal, but they&rsquo;re from an area where they might be. I have configured the system to get alerts for certain areas, and, worst-case scenario, they are not part of the deal, and now we just have one more matched ROR in our data. And if I had more time, I&rsquo;d go to every single paper that wasn&rsquo;t matched and do that, but that&rsquo;s not on my radar right now. We certainly could do that, because the <a href="https://ror.readme.io/v2/docs/rest-api">ROR API</a> is very helpful in that regard with the autofill and the drop down, and now we have the related organizations, so it&rsquo;s really a very useful tool just for matching purposes.</p>


<h3 id="hahahugoshortcode-s33-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>That&rsquo;s great. The other thing that I&rsquo;m really curious about, and I don&rsquo;t know if you know this, but Optica Publishing Group is one of the top contributors of ROR IDs in DOI metadata to <a href="https://crossref.org">Crossref</a>. We haven&rsquo;t yet talked about the fact that you&rsquo;re doing that, that you&rsquo;re making sure to include author affiliations with ROR IDs in Crossref metadata. Can you tell me a little bit more about why and how you do that?</p>
<figure class="blog-figure"><img src="/img/blog/optica/optica-crossref-prep.png"
         alt="Optica Publishing Group&#39;s Crossref Participation Report showing 59% of their current content has ROR IDs"/><figcaption>
            <p><a href="https://www.crossref.org/members/prep/285">Optica Publishing Group&rsquo;s Crossref Participation Report</a> showing that 59% of their current journal articles include ROR IDs</p>
        </figcaption>
</figure>



<h3 id="hahahugoshortcode-s35-hbhb-sasha-schwarzman"><figure class="round-figure"><img src="/img/blog/optica/sasha-schwarzman.jpg"
         alt="Avatar of Sasha Schwarzman"/>
</figure>
 Sasha Schwarzman 
</h3>
<p>As far as “why” is concerned, it&rsquo;s because of the culture of the OPG in general, and my responsibility in particular, is to ensure that all the metadata has integrity and quality. <strong>We want to be good citizens of our publishing community and contribute as much metadata as possible, as much <em>correct</em> metadata as possible, to Crossref as an exchange mechanism among publishers.</strong> We try to be as accurate as possible, and we don&rsquo;t see why we wouldn&rsquo;t contribute this data, since it goes through the rigorous quality control that Chris has described and through other processes to make sure our persistent identifiers are correct.</p>
<p>As far as “how,” we have a process called fulfillment. Our colleague Jennifer Mayfield ensures that all metadata that goes to Crossref is correct. If there are any problems, Crossref reports them, and then we redeposit our metadata so that everything is accurate.</p>


<h3 id="hahahugoshortcode-s36-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>And I would imagine that you are probably doing <a href="https://www.crossref.org/documentation/register-maintain-records/direct-deposit-xml/">XML registration with Crossref</a>?</p>


<h3 id="hahahugoshortcode-s37-hbhb-sasha-schwarzman"><figure class="round-figure"><img src="/img/blog/optica/sasha-schwarzman.jpg"
         alt="Avatar of Sasha Schwarzman"/>
</figure>
 Sasha Schwarzman 
</h3>
<p>Yes, definitely. We use a subset of <a href="https://jats.nlm.nih.gov/">JATS</a> and then transform it to the XML that Crossref accepts.</p>


<h3 id="hahahugoshortcode-s38-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>What&rsquo;s next for you with ROR? I think you&rsquo;re working on cleaning up your funder data and matching that to ROR IDs?</p>


<h3 id="hahahugoshortcode-s39-hbhb-sasha-schwarzman"><figure class="round-figure"><img src="/img/blog/optica/sasha-schwarzman.jpg"
         alt="Avatar of Sasha Schwarzman"/>
</figure>
 Sasha Schwarzman 
</h3>
<p>I&rsquo;ll start by explaining what we currently do, which will help clarify what we need to do moving forward and how ROR IDs will factor into that. We currently use the <a href="https://www.crossref.org/services/funder-registry/">Open Funder Registry</a> to assign DOIs to funders. We go through the registry from top to bottom, categorizing funders into three main groups: those requiring a CC BY license, those participating in <a href="https://www.chorusaccess.org">CHORUS</a>, and the frequent funders of optics and photonics research. Our goal is to ensure we&rsquo;re capturing all relevant funders.</p>
<p>We maintain a list of top ancestors in each category, and this list evolves over time. For instance, we might learn that a funder now requires a CC BY license or there is a new CHORUS participant. The license information varies depending on these three categories. Each funder has descendants, sometimes quite a few. If an ancestor requires a CC BY license, then all its descendants do, too. So, each time a new release of the Open Funder Registry comes out, we go through the registry to find all the descendants in each category. Some funders are particularly dynamic. For example, the United States Department of Defense has frequent changes, and in the United Kingdom, certain funders in the CC BY category also change often.</p>
<p>We use Schematron technology, where we say, &ldquo;All right, we found this funder.&rdquo; We&rsquo;ve integrated the Open Funder Registry widget into our submission system, so the XML data usually includes the funder DOI. However, sometimes, an author can’t find a match for a funder and types something in that&rsquo;s not quite correct. Or, during production, an author may realize they forgot a funder and then provide its name but not the DOI. In these cases, using Schematron, we ensure that the funder&rsquo;s name matches its DOI. If there’s no match, we prompt the user with the most likely DOI, using “preferred label,” that is, the official name of the funder. This step is critical because it&rsquo;s important to have the correct license, like a CC BY license, or to know if it&rsquo;s a CHORUS funder, which may affect the embargo period and so on.</p>
<p>Now, with the news that the Open Funder Registry will sunset and be replaced by the ROR Registry, my focus in the coming months will be figuring out how to replicate or redesign this process to maintain the quality control we need. The first question is about coverage. Thanks to Chris, we have many years&rsquo; worth of data on all the funders of research published in our journals. Are they all in the ROR Registry? We’ve been working with you to identify any missing ones because we can’t switch until the ROR Registry covers a significant portion of what we have.</p>
<p>We also need to figure out how to manage this process. How do we find those descendants? What tools do we need? Should we use the ROR API or the data dumps, which I imagine will be similar to the Open Funder Registry releases? These are very different data sources. In the coming months, I’ll be exploring the ROR Registry to determine whether to use the API or the data dumps and how to recreate or maybe even invent new processes that give us the same level of control we currently have with the Funder Registry.</p>


<h3 id="hahahugoshortcode-s40-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Oh, that&rsquo;s exactly what I wanted to hear. We&rsquo;re hoping to work quite closely with you to make sure that all of the funders that you need to have in ROR are in ROR, assuming they&rsquo;re in scope. We&rsquo;ve had some correspondence with Chris about this already.</p>
<p>The last I checked, <a href="https://ror.org/blog/2023-10-12-ror-funder-registry-overlap/">about 54% of Funder Registry records have equivalents in ROR</a>, and those are the most frequently used funders. We&rsquo;re continuing to go down the list of less frequently used funders and updating ROR to include them when possible.</p>
<p>We&rsquo;ve been talking with CHORUS, as well, and they are also in the process of building ROR into their workflows. They&rsquo;ve <a href="https://www.chorusaccess.org/chorus-workflow-for-japan-explained/">got it in some of their services</a> and in their public API. And on the <a href="https://doi.org/10.5281/zenodo.12549191">recent Crossref community call</a>, Crossref emphasized that they&rsquo;re prioritizing accepting ROR IDs in the schema where they currently accept Funder IDs.</p>


<h3 id="hahahugoshortcode-s41-hbhb-sasha-schwarzman"><figure class="round-figure"><img src="/img/blog/optica/sasha-schwarzman.jpg"
         alt="Avatar of Sasha Schwarzman"/>
</figure>
 Sasha Schwarzman 
</h3>
<p>By the way, this is in Chris&rsquo;s purview, but we have years and years and years of data about funding based on the DOIs of those funders, and then when we switch, there will be years and years of data based on ROR. And I wonder if we need to have business intelligence covering the range of years whether we will have to account for two identifiers for the same funder, which may or may not be a challenge.</p>


<h3 id="hahahugoshortcode-s42-hbhb-chris-iannicello"><figure class="round-figure"><img src="/img/blog/optica/chris-iannicello.jpg"
         alt="Avatar of Chris Iannicello"/>
</figure>
 Chris Iannicello 
</h3>
<p>I&rsquo;m not sure. I mean, it&rsquo;s pretty straightforward to look for one and then the other if the first one was missing. But as far as my part in this effort, we have about 13,000 different funder names, and maybe half of them have a DOI. And we&rsquo;re focusing on the ones that have a DOI for now, and of those 6000 or so that have a DOI, I&rsquo;ve matched about 4500 of them. Most of them were not matched manually: I was able to find a match through the DOI in ROR through an automatic name match. I was only able to manually match a couple hundred of those so far, but I haven&rsquo;t spent a lot of time on that part of it yet. The remaining 1400 or so I will probably match manually through the year. And the ones that don&rsquo;t have a DOI we may never match, as it&rsquo;s just very time-consuming to match it that way. Maybe we&rsquo;d apply a fuzzy match to it, if someone requests some kind of high-level reporting.</p>


<h3 id="hahahugoshortcode-s43-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Also, we are working on an additional service to match names to ROR IDs. We&rsquo;re doing that in conjunction with Crossref. <a href="https://ror.readme.io/v2/docs/api-affiliation">Our API</a>, as you&rsquo;ve mentioned, Chris, does do a good job of matching, but we&rsquo;re looking at developing a matching service that will do an even better job of that.</p>
<p>One question I&rsquo;d love to ask, and we&rsquo;ve touched on this already: What do you hope ROR does in the future? What problems could we fix for you, or what changes could we make that would make your lives easier?</p>


<h3 id="hahahugoshortcode-s44-hbhb-chris-iannicello"><figure class="round-figure"><img src="/img/blog/optica/chris-iannicello.jpg"
         alt="Avatar of Chris Iannicello"/>
</figure>
 Chris Iannicello 
</h3>
<p>Any type of improvement for matching would be great. Perhaps a mechanism to take in verified matches from me and others to use as training data for some kind of machine learning tool. We&rsquo;ve dabbled in that a little bit, internally, and the results were not good, so it may be a non-starter. I&rsquo;m assuming that it won&rsquo;t be a non-starter forever; if the AI is not up for the challenge now, I&rsquo;m sure it will be soon enough. That might be something to pursue down the line, because I can provide about 5,000 manually matched records.   There might be a mistake in there, but I&rsquo;ve looked at each one. It could be a basis for all kinds of matching down the line, perhaps.</p>


<h3 id="hahahugoshortcode-s45-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>No, I think you&rsquo;re really right. We have done some of that with the American Physical Society. They also did a lot of heavy curation where you can be really sure that this particular variant of this name matches this ROR ID. We have <a href="https://github.com/ror-community/affiliation-matching-experimental">a GitHub repository where we store both the code and the data for that matching service</a> that I mentioned, and in the <code>test_data</code> folder, there are some datasets. The APS dataset has some verified matches that they have manually curated in the same way that you do. The Crossref publisher assertions are from the Crossref API where the publisher has asserted that this name matches this ROR ID. We think that&rsquo;s probably less reliable than one that was manually reviewed by the publisher. I&rsquo;ll have to see if we could use some additional datasets for that project or if it&rsquo;s too far along.</p>


<h3 id="hahahugoshortcode-s46-hbhb-chris-iannicello"><figure class="round-figure"><img src="/img/blog/optica/chris-iannicello.jpg"
         alt="Avatar of Chris Iannicello"/>
</figure>
 Chris Iannicello 
</h3>
<p>Sounds good.</p>


<h3 id="hahahugoshortcode-s47-hbhb-sasha-schwarzman"><figure class="round-figure"><img src="/img/blog/optica/sasha-schwarzman.jpg"
         alt="Avatar of Sasha Schwarzman"/>
</figure>
 Sasha Schwarzman 
</h3>
<p>One particular challenge is dealing with Chinese funders. Funders there change rather frequently, and because of translation challenges it&rsquo;s not always clear whether the author specified the official funder&rsquo;s name and what funder that is. That is, I think, a particular challenge in dealing with Chinese funders.</p>


<h3 id="hahahugoshortcode-s48-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yes, that&rsquo;s been a priority for us. As you know, in our March community call we had <a href="https://youtu.be/jRic-l8Nivs?si=EzZQ4VWc56FeibIq&amp;t=1112">a presentation from Jackson Huang</a>, who was a LEADING Fellow with us and is now a member of our Curation Board, and who is a Chinese speaker who&rsquo;s done some work on that. The focus wasn&rsquo;t specifically on funders; but more like Chinese-language organizations in ROR generally.</p>
<p>What else would you like to say about ROR, or about the Optica Publishing Group, or about metadata, or about anything at all?</p>


<h3 id="hahahugoshortcode-s49-hbhb-chris-iannicello"><figure class="round-figure"><img src="/img/blog/optica/chris-iannicello.jpg"
         alt="Avatar of Chris Iannicello"/>
</figure>
 Chris Iannicello 
</h3>
<p>I don&rsquo;t have anything else other than <strong>it&rsquo;s been a really great tool, and it&rsquo;s really helped our efforts, so we really appreciate that it exists.</strong></p>


<h3 id="hahahugoshortcode-s50-hbhb-sasha-schwarzman"><figure class="round-figure"><img src="/img/blog/optica/sasha-schwarzman.jpg"
         alt="Avatar of Sasha Schwarzman"/>
</figure>
 Sasha Schwarzman 
</h3>
<p>I&rsquo;m looking forward to unifying the IDs for the funders and affiliations. It does make sense. I realize it will be challenging, but in the long run, I think it&rsquo;s the right thing to do, especially because the ROR Registry is actively and openly curated and has an active user base. So I am looking forward to this development and to this challenge.</p>


<h3 id="hahahugoshortcode-s51-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Wonderful. Yes. I&rsquo;m glad to hear that. I think it&rsquo;s going to be very exciting to see everything that comes out of this. Thank you both for taking the time to talk to me.</p>


<h3 id="hahahugoshortcode-s52-hbhb-chris-iannicello"><figure class="round-figure"><img src="/img/blog/optica/chris-iannicello.jpg"
         alt="Avatar of Chris Iannicello"/>
</figure>
 Chris Iannicello 
</h3>
<p>And we thank you so much.</p>


<h3 id="hahahugoshortcode-s53-hbhb-sasha-schwarzman"><figure class="round-figure"><img src="/img/blog/optica/sasha-schwarzman.jpg"
         alt="Avatar of Sasha Schwarzman"/>
</figure>
 Sasha Schwarzman 
</h3>
<p>Thank you very much!</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Questions? Want to be featured in a ROR case study? Contact <a href="mailto:support@ror.org">support@ror.org</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Optica Publishing Group was one of the earliest publishers to send ROR IDs to Crossref in DOI metadata. In this interview, we speak with them to learn more about their rigorous processes for cleaning organization names.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Using ROR for Funder Identification</title><id>https://doi.org/10.71938/xfhb-sm73</id><link rel="alternate" href="https://ror.org/blog/2024-08-06-using-ror-for-funder-identification/"/><published>2024-08-06T00:00:00+00:00</published><updated>2025-10-21T08:59:33-04:00</updated><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><category term="Case Studies"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<figure class="featured-figure"><img src="/img/blog/use-case-funders/ror-for-funders.png"
         alt="ROR logo and funder icon"/>
</figure>

<p>Several widely used scholarly systems are using ROR IDs to identify and disambiguate funding organizations. In this post, we’ll give you a glimpse of how and why these systems use ROR IDs as funder identifiers.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-info'></i></div><div class="col-md-10"><span>For help with switching to ROR for funder identification in your own system, visit our guide to the <a href="https://ror.readme.io/docs/funder-registry">Transition from Open Funder Registry to ROR.</a></span></div>
	</div>
</div>
<p>You might already know that for the last ten years, <a href="https://www.crossref.org/services/funder-registry/">the Crossref Open Funder Registry</a> (formerly FundRef) has been the default identifier for funding organizations. You might also know that last fall, Crossref <a href="https://www.crossref.org/blog/open-funder-registry-to-transition-into-research-organization-registry-ror/">announced</a> &ldquo;a long-term plan to deprecate the Open Funder Registry&rdquo; in favor of ROR so that publishing systems and workflows can use one open registry, not two, to identify organizations.</p>
<p>If you&rsquo;re interested in learning more about that transition, you can consult our guide to the <a href="https://ror.readme.io/docs/funder-registry">Transition from Open Funder Registry to ROR</a> or read <a href="https://ror.org/tags/funders/">previous ROR blog posts</a>. But if you&rsquo;re interested in learning more about <strong>early adopters who are already using ROR to identify funders now</strong>, read on.</p>


<h2 id="repositories-are-ahead-of-the-curve">Repositories are ahead of the curve 
</h2>
<p>In general, <strong>repository systems are doing better than publishing systems</strong> in terms of adopting ROR for identifying funding organizations. One major factor in this phenomenon is undoubtedly <strong>DataCite&rsquo;s early support</strong> for this use case. <a href="https://datacite.org">DataCite</a>, the DOI registration agency that most data repositories use, has <a href="https://datacite.org/blog/identify-your-affiliation-with-metadata-schema-4-3/">supported ROR identifiers in the <code>fundingReferences</code> element of its metadata schema</a> since August of 2019. This early schema change encouraged data repositories to send ROR IDs to DataCite in DOI metadata. Note, too, that DataCite is now <a href="https://datacite.org/blog/guidance-for-registering-datacite-dois-for-awards/">encouraging funders to register DOIs for awards</a>, and ROR is DataCite&rsquo;s recommended funder identifier in award metadata.</p>
<p>DataCite&rsquo;s API shows that <strong>nearly one-third of DataCite DOIs with an identifier in the funding reference use ROR</strong> as the funder identifier. Out of a total of about 1.4 million DataCite DOIs with funding identifiers, over 432,000 of them include ROR IDs.</p>
<figure><img src="/img/blog/use-case-funders/datacite-funder-ids-by-type.png"
         alt="Funder identifiers by ID type in DataCite"/><figcaption>
            <p>DataCite records that have a value in <code>fundingReferences.funderIdentifier</code> as of July 15, 2024, broken down by <code>fundingReferences.funderIdentifier.Type</code>. See <a href="https://docs.google.com/spreadsheets/d/e/2PACX-1vQU_zvKDRXOFY56Zq6gAZyWBXWivQxPyDgAVP18bYDpNTgrM2pQFz4EzNc6I44kgTmhoVqkTJfef-HN/pubhtml">ROR DOI stats</a> spreadsheet for queries and data.</p>
        </figcaption>
</figure>

<p>Another factor that has driven early adoption of ROR for funder identification in repository systems is the U.S. National Institutes of Health (NIH) <a href="https://datascience.nih.gov/data-ecosystem/generalist-repository-ecosystem-initiative">Generalist Repository Ecosystem Initiative (GREI)</a>, whose primary aim is to develop &ldquo;a common set of cohesive and consistent capabilities, services, metrics, and social infrastructure across generalist repositories&rdquo; so that NIH-funded researchers can more easily discover and share their research data. Seven generalist repository systems are participating: <a href="https://dataverse.org/">Dataverse</a>, <a href="https://datadryad.org/stash">Dryad</a>, <a href="https://figshare.com">Figshare</a>, <a href="https://data.mendeley.com">Mendeley Data</a>, the <a href="https://osf.io">Open Science Framework (OSF)</a>, <a href="https://vivli.org">Vivli</a>, and <a href="https://zenodo.org">Zenodo</a>. All seven have <a href="https://medium.com/@blog-grei/kicking-off-greis-third-year-b8709ab45a48">agreed to implement ROR for author affiliations</a>, and some have also taken the opportunity to implement ROR for funder identification sooner rather than later. Dryad, Figshare, Zenodo, and Vivli implemented ROR to identify funders in 2023; Dataverse and OSF are actively planning to use ROR for funders soon, and Mendeley Data will do so at some point in the future. When data repositories use ROR as a standard funder identifier, it enables all funders, including the NIH, to track research datasets resulting from their funding more easily.</p>
<p>DataCite&rsquo;s API shows that all but one of the top five providers of ROR IDs for funders are GREI participants: Dryad, Zenodo, and Figshare together have registered over 100,000 DOIs that use ROR IDs in funding references. An additional repository operated by the Leibniz Institute DSMZ, <a href="https://straininfo.dsmz.de/">StrainInfo</a> (a collection of data about microorganisms and cell cultures), has used ROR IDs for funder identification in over 300,000 DataCite DOIs.</p>
<figure class="blog-figure"><img src="/img/blog/use-case-funders/datacite-funder-ids-by-client.png"
         alt="Funder IDs by client in DataCite"/><figcaption>
            <p>DataCite&rsquo;s API shows the top providers of ROR IDs in funding references. Note that <code>figshare.ars</code> includes figshare.com, Figshare+, and other Figshare repositories. Query: <code>https://api.datacite.org/dois?query=fundingReferences.funderIdentifierType:ROR&amp;page[size]=0</code></p>
        </figcaption>
</figure>

<p>Let&rsquo;s take a look at each of these systems to see how they&rsquo;ve implemented ROR for funders.</p>


<h2 id="dryad-is-the-first-to-switch-to-ror-for-funder-ids">Dryad is the first to switch to ROR for funder IDs 
</h2>
<p>As the <a href="https://ror.org/blog/2019-07-10-ror-ing-together-with-dryad/">very first system to adopt ROR</a> in 2019, Dryad has been using ROR for author affiliations longer than anyone else, and Dryad was almost as quick to adopt ROR for funding identification. Like many publishing and repository systems, Dryad originally used Crossref&rsquo;s Open Funder Registry to identify funders, but with the announcement that the Open Funder Registry is headed for deprecation, Dryad moved speedily to switch from Funder IDs to ROR IDs, completing the transition in <a href="https://github.com/datadryad/dryad-product-roadmap/issues/2840">October 2023</a>. This transition involved four main tasks:</p>
<ul>
<li>Changing Dryad&rsquo;s submission form funder lookup to use ROR instead of the Funder Registry</li>
<li>Converting existing Funder IDs in Dryad&rsquo;s data to ROR IDs</li>
<li>Enabling the capacity to send ROR IDs in new metadata deposits to DataCite</li>
<li>Updating the funder identifiers in Dryad&rsquo;s existing DataCite DOI records</li>
</ul>
<p>We can see the result of Dryad&rsquo;s work in their submission form. Now, in Dryad, the typeahead fields for &ldquo;Institutional affiliation&rdquo; and &ldquo;Granting organization&rdquo; both use ROR, and both helpfully reveal organization names, acronyms, and locations based on what the user types, just as we recommend in our <a href="https://ror.readme.io/v2/docs/forms">form guidelines</a>.</p>
<figure><img src="/img/blog/use-case-funders/ror-dryad.gif"
         alt="Dryad&#39;s affiliation and funder fields use ROR to power typeahead lookups."/><figcaption>
            <p>Dryad&rsquo;s affiliation and funder fields use ROR to power typeahead lookups.</p>
        </figcaption>
</figure>

<p><strong>Dryad is the only system of those we&rsquo;re discussing here to have transitioned entirely from using the Crossref Open Funder Registry for funder identification to using ROR for funder identification</strong>: other systems have either started fresh with ROR or continue to use multiple registries.</p>


<h2 id="zenodos-funder-model-is-based-on-ror">Zenodo&rsquo;s funder model is based on ROR 
</h2>
<p>Also in October 2023, the generalist data repository <a href="https://zenodo.org">Zenodo</a> completed its <a href="https://blog.zenodo.org/2023/10/13/2023-10-13-zenodo-rdm/">migration</a> to an open source research data management system called <a href="https://inveniosoftware.org/products/rdm/">InvenioRDM</a>. InvenioRDM was architected from the beginning with ROR for both author affiliations and funders, and other repositories that use InvenioRDM (e.g., <a href="https://data.caltech.edu">CaltechData</a>) therefore have similar capabilities as Zenodo.</p>
<p>Zenodo takes an &ldquo;award-first&rdquo; approach to connecting items to funders, asking the user to search for and select a specific award and then using the award identifier to pull in information about the funder who granted that award. Users also have the option to filter awards by a limited number of funders who participate in <a href="https://www.openaire.eu/">OpenAIRE</a>. Zenodo&rsquo;s &ldquo;standard award&rdquo; field pulls award and funder data from the <a href="https://doi.org/10.5281/zenodo.10488385">OpenAIRE Graph dataset</a>, which uses ROR as one of several organization identifier schemes.</p>
<figure><img src="/img/blog/use-case-funders/ror-zenodo-standard.gif"
         alt="Zenodo&#39;s award lookup pulls in both award and funder information from OpenAIRE"/><figcaption>
            <p>Zenodo&rsquo;s award lookup pulls in both award and funder information from OpenAIRE. See the InvenioRDM documentation of <a href="https://inveniordm.docs.cern.ch/customize/vocabularies/funding/#awards-openaire">Awards (OpenAIRE)</a> for more information.</p>
        </figcaption>
</figure>

<p>Users can also add a custom award – for instance, one that isn&rsquo;t provided by OpenAIRE – and the custom award lookup in Zenodo is powered exclusively by ROR data to suggest funders in the award lookup funder field. Zenodo shows both the country code and the unique portion of the ROR identifier next to the name of the funder in the list of suggestions.</p>
<figure><img src="/img/blog/use-case-funders/ror-zenodo-custom.gif"
         alt="Zenodo&#39;s custom award lookup uses ROR data to suggest funders."/><figcaption>
            <p>Zenodo&rsquo;s custom award lookup uses ROR data to suggest funders.</p>
        </figcaption>
</figure>

<p>Zenodo communities can also be hosted by funders, and these funders can also be identified by ROR IDs, as is the case for the GREI community managed by NIH.</p>
<figure><img src="/img/blog/use-case-funders/ror-zenodo-grei.png"
         alt="The NIH-funded GREI initiative maintains a community on Zenodo in which the NIH is associated with its ROR ID."/><figcaption>
            <p>The NIH-funded GREI initiative maintains a community on Zenodo at <a href="https://zenodo.org/communities/grei/">https://zenodo.org/communities/grei/</a> in which the NIH is associated with its ROR ID.</p>
        </figcaption>
</figure>

<p>Zenodo also exposes funder information in its API at <code>https://zenodo.org/api/funders</code>.</p>
<p>It&rsquo;s important to reiterate that this functionality isn&rsquo;t confined to Zenodo: the ROR-powered funder lookup is built into the system that runs Zenodo, <a href="https://inveniosoftware.org/products/rdm/">InvenioRDM</a>, so that anyone who uses InvenioRDM can also use ROR to identify funders. InvenioRDM&rsquo;s excellent documentation explains in detail how award and funder lookups work in the system and gives more detail about the ROR-based data model for funders.</p>
<figure><img src="/img/blog/use-case-funders/ror-inveniordm.png"
         alt="Documentation of the ROR-based funder model and award lookup in InvenioRDM, the open source system that runs Zenodo"/><figcaption>
            <p>Documentation of the ROR-based funder model and award lookup in InvenioRDM, the open source system that runs Zenodo, available at <a href="https://inveniordm.docs.cern.ch/customize/vocabularies/funding/">https://inveniordm.docs.cern.ch/customize/vocabularies/funding/</a>.</p>
        </figcaption>
</figure>

<p><strong>InvenioRDM is the first new repository system to launch with ROR as the basis of its organizational data model</strong>, but it surely won&rsquo;t be the last new repository system to do so.</p>


<h2 id="figshare-imports-ror-from-dimensions">Figshare imports ROR from Dimensions 
</h2>
<p>Both Figshare&rsquo;s central repository at <a href="figshare.com">figshare.com</a> and Oxford University&rsquo;s Figshare-powered institutional repository at <a href="https://portal.sds.ox.ac.uk/">https://portal.sds.ox.ac.uk/</a> (also available via <a href="https://oxford.figshare.com">https://oxford.figshare.com</a>) are contributing a significant number of ROR IDs as funder identifiers to DataCite metadata. Figshare, like Zenodo, asks the user to enter an award number or title and then pulls in funding information based on that award; in this case, Figshare pulls the grant title, grant ID, funder name, and ROR ID from <a href="https://www.dimensions.ai/">Dimensions</a> grant data and includes it in the Figshare item’s metadata record.</p>
<figure><img src="/img/blog/use-case-funders/ror-figshare.gif"
         alt="Figshare&#39;s award lookup pulls data from Dimensions, which includes ROR IDs."/><figcaption>
            <p>Figshare&rsquo;s award lookup pulls data from Dimensions, which includes ROR IDs.</p>
        </figcaption>
</figure>

<p>ROR was <a href="https://ror.org/about/#launch-of-the-ror-mvr">originally seeded</a> with the GRID organization identifiers and data used by Dimensions, so there is a <a href="https://grid.ac/">close relationship between ROR IDs and GRID IDs</a>, and most ROR and GRID records are <a href="https://ror.readme.io/v2/docs/ror-data-structure#external_ids">mapped to each other</a>. Dimensions&rsquo;s support documentation <a href="https://dimensions.freshdesk.com/support/solutions/articles/23000025993-how-are-organizations-represented-in-dimensions-">explains how ROR IDs are incorporated into its data</a>:</p>
<div class='callout '>
	<div class="row">
		<div class="col-md-12"><span><p><strong>How are new ROR IDs added to Dimensions organization records?</strong></p>
<p>As part of every new organization data release we identify new ROR records, and examine each one of these for relevance to Dimensions, according to our inclusion criteria. Records that meet these criteria are added to Dimensions along with the relevant metadata.</p>
<p><strong>How is the ROR metadata maintained in Dimensions?</strong></p>
<p>We are also regularly checking for changes to ROR records which are already mapped to Dimensions organization records. When a mapped record changes in ROR, we examine if that change is also relevant for Dimensions and update relevant metadata as applicable.</p>
</span></div>
	</div>
</div>
<p>The curation work that Dimensions performs means that organization information in Figshare, including ROR metadata for funding organizations, remains up to date.</p>


<h2 id="straininfo-uses-ror-for-funders">StrainInfo uses ROR for funders 
</h2>
<p>The largest provider of ROR IDs in DataCite funding references is a repository called <a href="https://straininfo.dsmz.de/">StrainInfo</a>, which is &ldquo;a service developed to provide a resolution of microbial strain identifiers by storing culture collection numbers, their relations, and culture-associated data.&rdquo; StrainInfo is currently supported by the <a href="https://nfdi4microbiota.de/">NFDI4Microbiota</a> consortium, which is funded by the <a href="https://gepris.dfg.de/gepris/projekt/460129525?language=en">German Research Foundation (DFG)</a>, and is developed and maintained by <a href="https://www.dsmz.de/">Leibniz Institute DSMZ</a>.</p>
<figure><img src="/img/blog/use-case-funders/ror-straininfo.png"
         alt="Example of a strain catalogued in StrainInfo. Note the DataCite DOI in the upper left."/><figcaption>
            <p>Example of a strain catalogued in StrainInfo. Note the DataCite DOI in the upper left.</p>
        </figcaption>
</figure>

<p>StrainInfo indexes more than 290,000 microbial strains, each of which has its own DOI registered with DataCite, and each of which contains a funding reference in the metadata to award 460129525 issued by the German Research Foundation (DFG), whose ROR ID is <a href="https://ror.org/018mejw64">https://ror.org/018mejw64</a>.</p>
<figure class="blog-figure"><img src="/img/blog/use-case-funders/ror-straininfo-datacite.png"
         alt="DataCite metadata for https://doi.org/10.60712/SI-ID34969.2"/><figcaption>
            <p>DataCite metadata for <a href="https://doi.org/10.60712/SI-ID34969.2">https://doi.org/10.60712/SI-ID34969.2</a> available at <code>https://api.datacite.org/dois/10.60712/SI-ID34969.2</code>.</p>
        </figcaption>
</figure>

<p>Thus, while StrainInfo is currently responsible for the largest number of DOI records with ROR IDs in the <code>funderIdentifier</code> field to DataCite, all 300,000+ of those records (including versions with updated data for several strains) reference a single funder with a single ROR ID. As Ted Habermann noted in his 2019 piece <a href="https://doi.org/10.59350/4tbaw-m9382">How Many RORs Do We Need?</a>, it&rsquo;s good news for ROR adoption that &ldquo;some DataCite members only need to know a small number of RORs,&rdquo; since this can make implementation of ROR quite a bit easier.</p>


<h2 id="many-more-systems-use-ror-for-funders">Many more systems use ROR for funders 
</h2>
<p>The systems reviewed here are by no means the only ones using ROR IDs to identify funders.  Here&rsquo;s an alphabetical list of additional systems that are currently using ROR IDs for funder identification:</p>
<ul>
<li><a href="https://www.chorusaccess.org">CHORUS Dashboard and Reporting Services</a></li>
<li><a href="https://chronoshub.io/">ChronosHub Open Access Management Platform</a></li>
<li><a href="https://www.osti.gov/award-doi-service/">DOE Award DOI Service</a></li>
<li><a href="https://wiki.lyrasis.org/display/DSDOC8x/DSpace+8.x+Documentation">DSpace 8</a></li>
<li><a href="https://wiki.lyrasis.org/display/DSPACECRIS">DSpace-CRIS</a></li>
<li><a href="https://fairsharing.org">FAIRsharing</a></li>
<li><a href="https://gold.jgi.doe.gov/index">Genomes OnLine Database (GOLD)</a></li>
<li><a href="https://japanlinkcenter.org/">Japan Link Center (JALC)</a></li>
<li><a href="https://www.datasalon.com/web/index.htm">MasterVision and PaperStack by DataSalon</a></li>
<li><a href="https://oa.report">OA.Report</a></li>
<li><a href="https://openalex.org">OpenAlex</a></li>
<li><a href="https://orcid.org">ORCID</a></li>
<li><a href="https://proposalcentral.com/">ProposalCentral by Altum</a></li>
<li><a href="https://radar.products.fiz-karlsruhe.de/en/radarabout/ueber-radar">RADAR</a></li>
<li><a href="https://www.sherpa.ac.uk/">Sherpa Services</a></li>
<li><a href="https://vivli.org">Vivli Center for Global Clinical Research Data</a></li>
</ul>
<p>See the <a href="https://bit.ly/ror-integrations">full list of ROR Integrations</a> to see more systems that are using or planning to use ROR for all kinds of purposes beyond funder identification – author affiliations, Open Access deal management, publisher identification, internal statistics, and more. If you&rsquo;re managing a system that uses ROR for funder identification and you don&rsquo;t see it listed here, <a href="https://bit.ly/ror-integration-form">let us know</a>!</p>


<h2 id="whats-next">What&rsquo;s next 
</h2>
<p><strong>More and more systems will be using ROR to identify funders</strong> in the coming months and years as the deprecation of the Open Funder Registry nears. Crossref <a href="https://www.crossref.org/blog/roring-ahead-using-ror-in-place-of-the-open-funder-registry/">has been working hard</a> on building support for publishers to deposit ROR IDs as funder identifiers, and this capacity will be released before the end of the year. Moreover, as more and more funders register DOIs for awards using <a href="https://support.datacite.org/docs/registering-datacite-dois-for-awards">DataCite</a> or <a href="https://www.crossref.org/documentation/research-nexus/grants/">Crossref</a>, they will undoubtedly begin by using ROR IDs to identify their own organization as well as other funding institutions.</p>
<p>&ldquo;<strong>Open funding metadata is arguably the next big thing</strong>,&rdquo; <a href="https://www.crossref.org/blog/open-funding-metadata-community-workshop-report/">avers a recap of a funding metadata workshop</a> convened by Crossref last year, for many reasons: &ldquo;meta-researchers need this information in order to analyze research on research, editors are concerned with research integrity, including funding trends, and funders themselves need to track the reach and return of their support.&rdquo; This year alone, several lively discussions of how to improve funding metadata through the use of persistent identifiers in published research have been held, including:</p>
<ul>
<li>ROR Annual Meeting, <a href="https://ror.org/events/2024-01-31-why-we-all-need-good-funding-metadata/">Why We All Need Good Funding Metadata</a>, January 2024</li>
<li>Altum webinar, <a href="https://pages.altum.com/persistent-identifier-webinar">Advancing Research with Persistent Identifier (PID) Innovations</a>, February 2024</li>
<li>CHORUS Forum, <a href="https://www.chorusaccess.org/events/chorus-forum-navigating-the-future-of-persistent-identifiers-pids-in-scholarly-publishing-challenges-risks-and-opportunities/">Navigating the Future of PIDs in Scholarly Publishing</a>, May 2024</li>
</ul>
<p>The other commonly mentioned &ldquo;next big thing,&rdquo; AI, or more precisely machine learning, also has a bearing on funding metadata: Crossref and ROR are collaborating on an exciting project to build and evaluate robust, large-scale <a href="https://ror.org/tags/matching">matching</a> strategies that can reliably connect tens of thousands of funder names in Crossref metadata to ROR IDs. Nevertheless, because any such automated post-processing of metadata is bound to have some errors, it&rsquo;s important that good funding metadata be created and reviewed by humans as far upstream as possible, ideally during the manuscript submission process or publishing production process.</p>
<p><strong>So, in short, if you&rsquo;re in charge of a scholarly publishing system or workflow that&rsquo;s using the Open Funder Registry or one that isn&rsquo;t using funder identifiers at all, you should be thinking about using ROR.</strong> We&rsquo;re here to help!</p>
<ul>
<li>Are you using the Open Funder Registry? Start by consulting our <a href="https://ror.readme.io/docs/funder-registry">Funder Registry Transition Guide</a>.</li>
<li>Are you new to persistent identifiers and/or ROR? Start by browsing through our <a href="https://ror.readme.io">technical documentation</a>.</li>
</ul>
<p>ROR is a responsive, open, community-led endeavor, and we&rsquo;re happy to work with you to integrate ROR IDs into your scholarly publication metadata.</p>
<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Still have questions? Write to us at <a href="mailto:support@ror.org">support@ror.org</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Several widely used scholarly systems are using ROR IDs to identify and disambiguate funders. In this post, we’ll give you a glimpse of how and why these systems use ROR IDs for funder identifiers.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Re-introducing Participation Reports to Encourage Best Practices in Open Metadata</title><id>https://doi.org/10.71938/021v-z817</id><link rel="alternate" href="https://ror.org/blog/2024-07-25-re-introducing-participation-reports/"/><link rel="related" href="https://doi.org/10.64000/wxjpp-20570"/><published>2024-07-25T11:32:47-04:00</published><updated>2025-10-22T06:05:03-04:00</updated><author><name>Lena Stoll</name><uri>https://orcid.org/0009-0008-8562-7748</uri></author><category term="Adoption News"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>Crossref, with the help of <a href="https://www.cwts.nl/">CWTS Leiden</a>, has just released an exciting update to their participation reports, adding metrics for both affiliations in general and ROR IDs in particular. Now Crossref members can easily see how well they&rsquo;re doing in providing open affiliation metadata. Read the full text of the Crossref announcement below.</span></div>
	</div>
</div>
<figure class="featured-figure"><img src="/img/blog/prep/prep-2024-ror.png"
         alt="University of Szeged ROR ID percentage shown in Crossref PREP"/>
</figure>

<p>We’ve just released an update to our <a href="https://crossref.org/members/prep">participation report</a>, which provides a view for our members into how they are each working towards best practices in open metadata. Prompted by some of the signatories and organizers of the <a href="https://barcelona-declaration.org/">Barcelona Declaration</a>, which Crossref supports, and with the help of our friends at <a href="https://www.cwts.nl/">CWTS Leiden</a>, we have fast-tracked the work to include an updated set of metadata best practices in participation reports for our members. The reports now give a more complete picture of each member’s activity.</p>


<h2 id="what-do-we-mean-by-participation">What do we mean by ‘participation’? 
</h2>
<p>Crossref runs open infrastructure to link research objects, entities, and actions, creating a lasting and reusable scholarly record. As a not-for-profit with over 20,000 members in 160 countries, we drive metadata exchange and support nearly 2 billion monthly API queries, facilitating global research communication.</p>
<p>To make this system work, members strive to provide as much metadata as possible through Crossref to ensure it is openly distributed throughout the scholarly ecosystem at scale rather than bilaterally, thereby realizing the collective benefit of membership. Together, our membership provides and uses a rich nexus of information — known as <a href="https://crossref.org/documentation/research-nexus">the research nexus</a> — on which the community can build tools to help progress knowledge.</p>
<p>Each member commits to certain terms, such as keeping metadata current, updating links for their DOIs to redirect to, linking references and other objects, and preserving their content in perpetuity. Beyond this, we also encourage members to register as much rich metadata as is relevant and possible.</p>
<p>Creating and providing richer metadata is a key part of participation in Crossref; we’ve long encouraged a more complete scholarly record, such as through <a href="https://metadata2020.org">Metadata 20/20</a>, and through supporting or leading initiatives for specific metadata, like open citations (I4OC), open abstracts (I4OA), open contributors (<a href="https://orcid.org">ORCID</a>), and open affiliations (<a href="https://ror.org">ROR</a>).</p>


<h2 id="which-metadata-elements-are-considered-best-practices">Which metadata elements are considered best practices? 
</h2>
<p>Alongside basic bibliographic metadata such as title, authors, and publication date(s), we encourage members to register metadata in the following fields:</p>
<figure class="blog-figure"><a href="https://crossref.org/members/prep/5401"><img src="/img/blog/prep/prep-2024.png"
         alt="screenshot of Crossref participation report for member University of Szeged showing the 11 best practice metadata fields"/></a><figcaption>
            <p>Example participation report for Crossref member University of Szeged</p>
        </figcaption>
</figure>



<h3 id="references">References 
</h3>
<p>A list of all the references used by a work. This is particularly relevant for journal articles but the references can include any type of object, including datasets, versions, preprints, and more. Additionally, we encourage these to be added into <a href="https://crossref.org/documentation/principles-practices/best-practices/relationships/">relationships</a>, where relevant.</p>


<h3 id="abstracts">Abstracts 
</h3>
<p>A description of the work. These are particularly useful for discovery systems that will promote the work, and are often used in downstream analyses such as for detecting integrity issues.</p>


<h3 id="contributor-ids-orcid">Contributor IDs (ORCID) 
</h3>
<p>All authors should be included in a work’s metadata, ideally alongside their verified ORCID identifier.</p>


<h3 id="affiliations--affiliation-ids-ror">Affiliations / Affiliation IDs (ROR) 
</h3>
<p>Members are able to register contributor affiliations as free text, but we are encouraging everyone to add ROR IDs for affiliations as the recommended best practice, as this differentiates and avoids mistyping. These two fields have newly been added to the participation reports interface in the most recent update.</p>


<h3 id="funder-ids-ofr">Funder IDs (OFR) 
</h3>
<p>Acknowledging the organization(s) that funded the work. We encourage the inclusion of <a href="https://crossref.org//documentation/funder-registry/">Open Funder Registry</a> identifiers to make the funding metadata more usable. This will evolve into <a href="https://crossref.org//blog/open-funder-registry-to-transition-into-research-organization-registry-ror/">an additional use case for ROR</a> over time.</p>


<h3 id="funding-award-numbers--grant-ids-crossref">Funding award numbers / Grant IDs (Crossref) 
</h3>
<p>A number or identifier assigned by the funding organization to identify the specific award of funding or other support such as use of equipment or facilities, prizes, tuition, etc. The <a href="https://crossref.org//services/grant-linking-system/">Crossref Grant Linking System</a> includes a unique persistent link that can be connected with outputs, activities, people, and organizations.</p>


<h3 id="crossmark">Crossmark 
</h3>
<p>The <a href="https://crossref.org/services/crossmark/">Crossmark service</a> gives readers quick and easy access to the current status of a record, including any corrections, retractions, or updates, via a button embedded on PDFs or a web article. Openly adding corrections, retractions, and errata is critical part of publishing, and the button provides readers with an easy in-context alert.</p>


<h3 id="similarity-check-urls">Similarity Check URLs 
</h3>
<p>The <a href="https://crossref.org/services/similarity-check/">Similarity Check service</a> helps editors to identify text-based plagiarism through our collective agreement for the membership to access to Turnitin’s powerful text comparison tool, iThenticate. Specific fill-text links are required to participate in this service.</p>


<h3 id="license-urls">License URLs 
</h3>
<p>URLs pointing to a license that explains the terms and conditions under which readers can access content. These links are crucial to denote intended downstream use.</p>


<h3 id="text-mining-urls">Text mining URLs 
</h3>
<p>Full-text URLs that help researchers in meta-science easily locate your content for text and data mining.</p>


<h2 id="what-is-a-participation-report">What is a participation report? 
</h2>
<p>Participation reports are a visualization of the data representing members’ participation to the scholarly record which is available via our open REST API. There’s a separate participation report for each member, and each report shows what percentage of that member’s metadata records include 11 key metadata elements. These key elements add context and richness, and help to open up members work to easier discovery and wider and more varied use. As a member, you can use participation reports to see for yourself where the gaps in your organization’s metadata are, and perhaps compare your performance to others. Participation reports are free and open to everyone - so you can also check the report for any other members you are interested in.</p>
<p>We first <a href="https://crossref.org/blog/321-its-lift-off-for-participation-reports/">introduced</a> participation reports in 2018. At the time, Anna Tolwinska and Kirsty Meddings wrote:</p>
<blockquote>
<p>Metadata is at the heart of all our services. With a growing range of members participating in our community—often compiling or depositing metadata on behalf of each other—the need to educate and express obligations and best practice has increased. In addition, we’ve seen more and more researchers and tools making use of our APIs to harvest, analyze and re-purpose the metadata our members register, so we’ve been very aware of the need to be more explicit about what this metadata enables, why, how, and for whom.</p>
</blockquote>
<p>All of that still rings true today. But as the research nexus continues to evolve, so should the tools that intend to reflect it. For example, in 2022, we removed the <em>Open references</em> field from participation reports after <a href="https://crossref.org/blog/amendments-to-membership-terms-to-open-reference-distribution-and-include-uk-jurisdiction/">a board vote to change our policy and update the membership terms</a> meant that <em>all</em> references deposited with Crossref would be open by default. And now we’ve expanded the list of fields again, adding coverage data for contributor affiliation text and ROR identifiers.</p>


<h2 id="putting-it-in-practice">Putting it in practice 
</h2>
<p>To find out how you measure up when it comes to participation, type the name of your member organization into the <a href="https://www.crossref.org/members/prep/">search box</a>. You may be surprised by what you find—we often speak to members who thought they were registering a certain type of metadata for all their records, only to learn from their participation report that something is getting lost along the way.</p>
<p>You can only <a href="https://crossref.org/documentation/register-maintain-records/maintaining-your-metadata/updating-your-metadata/">address</a> gaps in your metadata if you know that they exist.</p>
<p>More information, as well as a breakdown of the now 11 key metadata elements listed in every participation report and tips on improving your scores, is available in our <a href="https://crossref.org/documentation/reports/participation-reports/">documentation</a>.</p>
<p>And if you have any questions or feedback, come talk to us on the <a href="https://community.crossref.org/">community forum</a> or request a metadata Health Check by emailing the <a href="mailto:feedback@crossref.org?subject=Participation%20reports%20and%20metadata%20health%20checks">community team</a>.</p>
<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Write the Crossref Community team at <a href="mailto:feedback@crossref.org?subject=Participation%20reports%20and%20metadata%20health%20checks">feedback@crossref.org</a> with any questions.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Crossref, with the help of CWTS Leiden, has just released an exciting update to their participation report, adding metrics for both affiliations in general and ROR IDs in particular. Now Crossref members can easily see how well they are doing in providing open affiliation metadata.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">American Physical Society Becomes Largest Society Publisher to Adopt ROR</title><id>https://doi.org/10.71938/8vkr-ae64</id><link rel="alternate" href="https://ror.org/blog/2024-07-23-aps-adopts-ror/"/><link rel="related" href="https://www.aps.org/about/news/2024/07/research-organization-registry"/><published>2024-07-23T00:00:00+00:00</published><updated>2025-10-22T06:05:03-04:00</updated><author><name>American Physical Society</name><uri>https://ror.org/05j8hg602</uri></author><category term="Adoption News"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>The American Physical Society (APS) has become the largest society publisher to adopt ROR, incorporating it into their manuscript submission process and sending ROR IDs in author affiliation metadata to Crossref. &ldquo;We&rsquo;ve been eagerly supporting ROR since before it was ROR,&rdquo; said APS Lead Data Analyst Arthur Smith. Read the whole announcement from APS below, then watch Arthur&rsquo;s marvelous presentation about the APS integration from last week&rsquo;s <a href="/events/2024-07-16-ror-community-call">ROR Community Call</a>.</span></div>
	</div>
</div>


<h2 id="american-physical-society-becomes-largest-society-publisher-to-adopt-research-organization-registry-identifiers">American Physical Society Becomes Largest Society Publisher to Adopt Research Organization Registry Identifiers 
</h2>
<figure class="featured-figure"><img src="/img/blog/aps/aps-logo.png"
         alt="APS logo"/>
</figure>

<p>The <a href="https://www.aps.org/homepage-launch">American Physical Society</a> has adopted the <a href="https://ror.org">Research Organization Registry</a> identification system, assigning persistent identifiers to authors&rsquo; affiliations for every paper published in the <a href="http://journals.aps.org">Physical Review journals</a>. The action comes as a response to challenges faced by the scientific community regarding metadata tracking.</p>
<p>Researchers, institutions, and libraries alike rely on high quality, accurate metadata to monitor publication output and impact. However, institutional metadata has lacked standardization across the publishing landscape. By assigning openly available ROR persistent identifiers, APS provides a solution to this problem and more. These identifiers will help authors and institutions keep track of the scope of their work.</p>
<p>APS also sends ROR identifiers to <a href="https://crossref.org">Crossref</a>, an online record for scholarly metadata. The initiative makes relevant metadata discoverable across all search tools, with implications for funding and for participating in open access models.</p>
<p>APS is one of the first publishers to incorporate ROR identifiers in published metadata, demonstrating the society’s commitment to advancing open science.</p>
<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Contact the APS Press Office at <a href="mailto:media@aps.org">media@aps.org</a> with questions.</span></div>
	</div>
</div>
<hr>


<h2 id="watch-aps-discuss-their-adoption-of-ror">Watch APS discuss their adoption of ROR 
</h2>
<p>Last week at the <a href="/events/2024-07-16-ror-community-call">ROR Community Call</a>, APS Lead Data Analyst Arthur Smith explained why and how APS adopted ROR IDs for author affiliation identification.</p>

<div class="blog-video">
  <iframe src="https://www.youtube.com/embed/qH1JXbAxbYU" allowfullscreen title="Featured ROR integrator American Physical Society"></iframe>
</div>

<p>The American Physical Society may be the largest society publisher so far to adopt ROR, but we know they won&rsquo;t be the last. Kudos to APS for helping to lead the way.</p>
<hr>
<div class='callout orange'>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-bullhorn'></i></div><div class="col-md-10"><span>Will you be the next publisher to adopt ROR for author affiliations and funders? <a href="https://ror.readme.io/v2/docs/">Learn how.</a></span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">The American Physical Society (APS) has become the largest society publisher to adopt Research Organization Registry Identifiers. Institutional metadata in the Physical Review journals are now easier to find, track, and reference — a move that strengthens open science and research transparency.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">ID Ideas: Adding New External Identifiers to ROR</title><id>https://doi.org/10.71938/apfd-qw54</id><link rel="alternate" href="https://ror.org/blog/2024-07-18-id-ideas/"/><published>2024-07-18T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><author><name>Adam Buttrick</name><uri>https://orcid.org/0000-0003-1507-1031</uri></author><category term="Technical News"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>We&rsquo;re looking for your feedback on a <a href="https://bit.ly/ror-proposal-external-ids-draft">draft proposal for criteria to include additional external identifiers in ROR records</a>, and comments are open <strong>through August 16th</strong>. ROR records already include mappings to Crossref Funder IDs, Wikidata, ISNI, and GRID. What additional external IDs do you feel should be added? What criteria do you think other identifiers should meet in order to be added to ROR?</span></div>
	</div>
</div>
<figure class="featured-figure"><img src="/img/blog/id-ideas/light-bulb-purple.png"
         alt="Light bulb"/>
</figure>

<p>Now that we&rsquo;ve <a href="https://ror.org/blog/2024-04-15-announcing-ror-v2/">released v2 of the ROR schema</a>, we have the power to add external identifiers to ROR records in the <code>external_ids</code> field <a href="https://ror.readme.io/v2/docs/schema-versions#changes-that-require-versioning">without making a major schema change</a> – and since we have the power, we want to use it! We&rsquo;re therefore issuing a request for feedback on the requirements we should set for including additional external organizational identifiers in ROR records.</p>
<p>The four external identification schemes that are currently included in ROR are all <strong>global identifiers</strong> – they are unique identifiers that are used around the world. We have, however, received several requests to include <strong>regional and national identifiers</strong> in ROR records as well as other globally unique identifiers; one important aspect of the draft external identifier proposal is to establish the criteria that such &ldquo;non-global&rdquo; organizational identifiers should meet.</p>
<p>Note that we are asking chiefly for feedback on the <strong>characteristics</strong> that new organizational identifiers should have in order to be added to ROR, although we are also more than happy to accept suggestions of organizational identifier schemes to add to ROR. To suggest that ROR add a particular new organizational identifier that would be useful for you, <a href="https://github.com/ror-community/ror-roadmap/issues/new/choose">submit an issue</a> to the <a href="https://github.com/ror-community/ror-roadmap/">ROR roadmap</a> and choose the issue template &ldquo;Add External ID.&rdquo;</p>
<p>To give feedback, please visit the <a href="https://bit.ly/ror-proposal-external-ids-draft">proposal document</a> and suggest changes to the text or add comments in the margin by <strong>August 16th</strong>, and do feel free to share the document with anyone you think might be interested. We&rsquo;ll give a summary of the comments and a plan for moving ahead at the <a href="https://ror.org/events/#ror-community-call-september-2024">ROR Community Call in September</a>.</p>
<p>We rely on the ROR community to keep us headed in the right direction – thanks in advance for contributing your time and expertise!</p>
<div class='callout '>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Questions? Write <a href="mailto:support@ror.org">support@ror.org</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">We're looking for your feedback on a draft proposal for criteria to include additional external identifiers in ROR records. Comments are open through August 16th. ROR records already include mappings to Crossref Funder IDs, Wikidata, ISNI, and GRID. What additional external IDs do you feel should be added? What criteria do you think other identifiers should meet in order to be added to ROR?</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">The Anatomy of Metadata Matching</title><id>https://doi.org/10.71938/q0sx-4772</id><link rel="alternate" href="https://ror.org/blog/2024-06-27-anatomy-of-metadata-matching/"/><link rel="related" href="https://doi.org/10.13003/zie7reeg"/><published>2024-06-27T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Dominika Tkaczyk</name><uri>https://orcid.org/0000-0001-5055-7876</uri></author><author><name>Adam Buttrick</name><uri>https://orcid.org/0000-0003-1507-1031</uri></author><category term="Explainers"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>The second blog post about metadata matching by ROR&rsquo;s Adam Buttrick and Crossref&rsquo;s Dominika Tkaczyk describes some basic matching-related terminology and the components of a matching process, then poses some typical product questions to consider when developing or integrating matching solutions. Read more about this important work here and in other posts in the <a href="/tags/matching/">matching</a> series.</span></div>
	</div>
</div>
<figure class="featured-figure"><img src="/img/blog/matching-anatomy/yellow-matching-figure-02.png"
         alt="Input, matching strategy, output"/>
</figure>

<p>In our <a href="/blog/2024-05-16-metadata-matching-101-cross-post/">previous blog post about metadata matching</a>, we discussed what it is and why we need it (tl;dr: to discover more relationships within the scholarly record). Here, we will describe some basic matching-related terminology and the components of a matching process. We will also pose some typical product questions to consider when developing or integrating matching solutions.</p>


<h2 id="basic-terminology">Basic terminology 
</h2>
<p>Metadata matching is a high-level concept, with many different problems falling into this category. Indeed, no matter how much we like to focus on the similarities between different forms of matching, matching affiliation strings to ROR IDs or matching preprints to journal papers are still different in several important ways. At Crossref and ROR, we call these problems <strong>matching tasks</strong>.</p>
<p>Simply put, a matching task defines the kind or nature of the matching. Examples of matching tasks are bibliographic reference matching, affiliation matching, grant matching, or preprint matching.</p>
<p>Every matching task has an <strong>input</strong>, which is all the data that is needed to perform the matching. Input data can come in many shapes and forms, depending on the matching task. For example, all of the following could be inputs to a matching task:</p>
<hr>
<p><code>Department of Molecular Medicine, Sapporo Medical University, Sapporo 060-8556, Japan</code></p>
<hr>
<pre tabindex="0"><code>&lt;fr:program xmlns:fr=&#34;http://www.crossref.org/fundref.xsd&#34; name=&#34;fundref&#34;&gt;
  &lt;fr:assertion name=&#34;fundgroup&#34;&gt;
    &lt;fr:assertion name=&#34;funder_name&#34;&gt;
      European Union&#39;s Horizon 2020 Research and Innovation Program through Marie Sklodowska Curie
      &lt;fr:assertion name=&#34;funder_identifier&#34;&gt;http://dx.doi.org/10.13039/501100000780&lt;/fr:assertion&gt;
    &lt;/fr:assertion&gt;
    &lt;fr:assertion name=&#34;award_number&#34;&gt;721624&lt;/fr:assertion&gt;
  &lt;/fr:assertion&gt;
&lt;/fr:program&gt;
</code></pre><hr>
<p><code>Everitt, W. N., &amp; Kalf, H. (2007). The Bessel differential equation and the Hankel transform. Journal of Computational and Applied Mathematics, 208(1), 3–19.</code></p>
<hr>
<pre tabindex="0"><code>{
  &#34;title&#34;: &#34;Functional single-cell genomics of human cytomegalovirus infection&#34;,
  &#34;issued&#34;: &#34;2021-10-25&#34;,
  &#34;author&#34;: [
    {&#34;given&#34;: &#34;Marco Y.&#34;, &#34;family&#34;: &#34;Hein&#34;},
    {&#34;given&#34;: &#34;Jonathan S.&#34;, &#34;family&#34;: &#34;Weissman&#34;, &#34;ORCID&#34;: &#34;http://orcid.org/0000-0003-2445-670X&#34;}
  ]
}
</code></pre><hr>
<p>Every matching task also has an <strong>output</strong>. For our purposes, this is almost exclusively zero or more matched identifiers. In the context of a specific matching task, output identifiers may be of a specific type (e.g. we might match to a ROR ID, and never to an ORCID ID). In some cases, there can be a certain target set as well (i.e. matching only to DataCite DOIs). The output identifiers can have different cardinality depending on the task, meaning that the matching task might allow for zero, one, or more identifiers as a result of matching to a single input.</p>
<p>A <strong>matching strategy</strong> defines how the matching is done. Multiple strategies can exist for a specific matching task. Compound strategies can run other strategies and combine their outcomes into a single result.</p>
<p>In some cases, we may also want the matching strategy to output a confidence score for each matched identifier. A confidence score represents the degree of certainty or likelihood that the matched identifier is correct, typically expressed as a value between 0 and 1. This score may help with post-processing or further interpretation of the results.</p>
<p>To summarise, the anatomy of the matching task can be diagrammed as follows:</p>
<figure class="noborder"><img src="/img/blog/matching-anatomy/matching-task-anatomy.png"
         alt="Input, matching strategy, output"/>
</figure>



<h2 id="how-to-specify-a-matching-task">How to specify a matching task 
</h2>
<p>Whenever we plan the development or integration of a matching solution, it is good to begin by answering a few basic questions:</p>
<ol>
<li>
<p>What problem do we plan to solve with our matching task? What would we call our matching task and how would we describe it?</p>
</li>
<li>
<p>What do we expect as the input for this matching task? Which input formats do we need to be able to accept? What information do we expect to find in this input?</p>
</li>
<li>
<p>What kind of identifiers should be output? Is there a target set of identifiers? Can our matching output zero/one/or multiple identifiers, and under what conditions might that occur?</p>
</li>
</ol>
<p>These sound fairly simple, but the answers to these questions can be remarkably complex. Once one tries to apply these concepts to real-world problems, they might encounter several non-obvious challenges.</p>
<p>For example, one common concern is at what level we should define each matching task. Consider the following problems:</p>


<h4 id="matching-bibliographic-reference-strings-to-dois">Matching bibliographic reference strings to DOIs. 
</h4>
<p>Example input:</p>
<hr>
<p><code>Everitt, W. N., &amp; Kalf, H. (2007). The Bessel differential equation and the Hankel transform. Journal of Computational and Applied Mathematics, 208(1), 3–19.</code></p>
<hr>


<h4 id="matching-structured-bibliographic-reference-to-dois">Matching structured bibliographic reference to DOIs. 
</h4>
<p>Example input:</p>
<hr>
<pre tabindex="0"><code>{
  volume: &#34;208&#34;,
  author: &#34;Everitt&#34;,
  journal-title: &#34;J. Comput. Appl. Math.&#34;,
  article-title: &#34;The Bessel differential equation and the Hankel transform&#34;,
  first-page: &#34;3&#34;,
  year: &#34;2007&#34;,
  issue: &#34;1&#34;
}
</code></pre><hr>
<p>Are those discrete matching tasks (unstructured reference matching vs. structured reference matching), or are they the same task (reference matching) that can accept different types of inputs (unstructured or structured)?</p>
<p>Similarly, let’s compare the following tasks:</p>


<h4 id="matching-affiliation-strings-to-ror-ids">Matching affiliation strings to ROR IDs. 
</h4>
<p>Example input:</p>
<hr>
<p><code>Department of Molecular Medicine, Sapporo Medical University, Sapporo 060-8556, Japan</code></p>
<hr>


<h4 id="matching-funder-names-to-ror-ids">Matching funder names to ROR IDs. 
</h4>
<p>Example input:</p>
<hr>
<p><code>Alexander von Humboldt Foundation</code></p>
<hr>
<p>Are these different matching tasks (affiliation matching vs. funder matching), or the same task with different inputs (organisation matching)?</p>
<p>Defining the boundaries of a matching task can also be difficult. Consider, for example, the need to obtain ROR IDs for organisations mentioned in the acknowledgements section of a full-text academic paper. To begin, one may first extract the acknowledgement section from the full text, then run something like a named entity recognition (NER) tool to isolate the organisation names from the extracted text, and finally match these names to ROR IDs. Is this entire process matching, with the input being the full text of a paper? Or perhaps matching starts with the acknowledgement section as the input? Instead, is it only the last phase, where we try to match the extracted name to the ROR ID, that constitutes the matching task, with the extraction phases being completely separate processes?</p>
<p>There are also important questions related to the expected behaviour of a matching strategy. Consider, for example, developing an affiliation matching strategy where we define our input as “an affiliation string”. What should happen when the strategy gets something else on the input, for example, song lyrics? Perhaps the strategy should simply return no matches, or an error, or we could say that in such a situation the behaviour is undefined and it simply doesn’t matter what is returned. But what should happen if in this input we have the lyrics of <a href="https://www.azlyrics.com/lyrics/roxymusic/streetlife.html">Street Life by Roxy Music</a>, a song that mentions the names of a few universities that happen to have ROR IDs?</p>
<p>It is likewise important to consider what should happen if different parts of the input match to different identifiers, like in the following example:</p>
<hr>
<p><code>Department of Haematology, Eastern Health and Monash University, Box Hill, Australia</code></p>
<hr>
<p>Here, “Eastern Health” matches to <a href="https://ror.org/00vyyx863">https://ror.org/00vyyx863</a> and “Monash University” to <a href="https://ror.org/02bfwt286">https://ror.org/02bfwt286</a>. Should the matching strategy return all the identifiers, one of them (if so, which one?), or nothing at all?</p>
<p>Similar questions arise when it is possible to match to multiple versions (or duplicates) in the target identifier set. This can happen, for example, in the context of bibliographic reference matching or preprint matching. Multiple matches may occur when there are different editions, reprints, or variations of the same publication in the target dataset, each with its own unique identifier.</p>
<p>If you are waiting for an answer to these questions, we unfortunately must disappoint you here. These can only be answered in the context of a specific problem, considering who the users are and what it is they need and expect.</p>
<p>Did you notice any other subtleties related to metadata matching and its concerns? Are there other non-obvious questions that should be considered when planning to develop or integrate metadata matching strategies? Let us know—we’d love to hear from you!</p>
<div class='callout grey'>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Questions? Contact <a href="mailto:adam@ror.org">adam@ror.org</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">The second blog post about metadata matching by ROR's Adam Buttrick and Crossref's Dominika Tkaczyk describes some basic matching-related terminology and the components of a matching process, then poses some typical product questions to consider when developing or integrating matching solutions.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Change, Continuity, and New Positions on the ROR Team</title><id>https://doi.org/10.71938/7rd7-4r81</id><link rel="alternate" href="https://ror.org/blog/2024-05-22-new-positions/"/><published>2024-05-22T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>ROR Leadership Team</name><uri>https://ror.org/about/team/#operations-team</uri></author><category term="Team Updates"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>ROR is fortunate to have a small but mighty <a href="/about/team/#core-team">team</a> that strives every day to make ROR as good as it can be and to support the diverse needs of our global community.</p>
<p>Ever since ROR was established, we’ve continually adjusted our resourcing and staffing to keep up with ROR’s needs and to align ROR activities with the needs and priorities of ROR’s operating organizations. The ROR team has therefore changed and grown over the years since the registry’s initial launch, evolving from an <em>ad hoc</em> project team to a more formal structure with dedicated staff members. This was made possible initially through startup funds from ROR’s operating organizations, contributions from community supporters, and multi-year grants.</p>
<p>As ROR began to move beyond its startup phase, ROR’s three operating organizations recognized the need to have long-term dedicated resourcing, and thus <a href="https://ror.org/blog/2022-10-10-strengthening-sustainability/">in 2022 they committed to supporting ROR’s core expenses on an ongoing basis</a>. A shared staffing model is a critical part of this agreement, ensuring that ROR has dedicated personnel to carry out essential activities. Each organization essentially “contributes” one or more staff members to ROR on a dedicated basis, and we balance these contributions across our three organizations.</p>
<p>Staff arrivals and departures are an inevitable development in any team or organization, and ROR is no exception in this regard. Since the beginning of 2024, the ROR team has been experiencing a few changes – but at the same time, many things have stayed the same. We wanted to take this opportunity to update the community on how the team has been evolving.</p>
<p>In January of this year, long-time ROR Director <a href="https://datacite.org/blog/welcome-maria-gould-datacites-new-product-director/">Maria Gould left her position at California Digital Library to assume a new role as Director of Product at DataCite.</a> With DataCite being one of ROR’s operating organizations, and ROR a core piece of DataCite’s overall strategic vision, it was natural for Maria to continue as Director of  ROR, although in a slightly more limited capacity in order to fulfill the responsibilities of her new role.</p>
<p>As of this month, long-time Metadata Curation Lead Adam Buttrick has left his position at Crossref to fill the vacancy at CDL left by Maria. The CDL position is a senior-level Product Manager role focused on leading a portfolio of persistent identifier services, of which ROR is a key element. Like Maria&rsquo;s time previously, Adam’s time will be split across ROR and other PID services and activities. With ROR’s metadata being a core product offering for CDL, Adam will continue to support metadata curation activities while also driving further product development work on ROR’s overall services and infrastructure.</p>
<p><strong>Adam’s departure from Crossref means that there is a new vacancy to fill!</strong> ROR’s curation activities continue apace, receiving approximately 700 requests every month to add or update registry records and publishing new registry releases approximately every 2-4 weeks. Curation activities are also heavily focused on the incorporation of the Funder Registry into ROR, with ongoing reconciliation work to support all those who have switched - or are planning to switch - to using ROR for funding organization identification. As the scope of curation activities and product development has grown with wider adoption of ROR, we need to ensure we have adequate resourcing and staffing to keep up with these developments. <strong>We are now hiring a new ROR team member - Metadata Manager</strong> - at Crossref to coordinate ROR metadata curation as well as support related metadata initiatives at Crossref. <a href="https://www.crossref.org/jobs/2024-05-21-metadata-manager/">Read the position announcement</a> and please consider applying yourself or spreading the word to your networks!</p>
<p>All of these developments reflect how ROR continues to stabilize and mature, and they underscore the importance of continuing to grow the team to support ongoing needs as ROR evolves. At the same time, we fully intend to keep our operations small and nimble in order to stay focused on key objectives and remain flexible enough to adapt to future changes.</p>
<div class='callout grey'>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Questions? Contact <a href="mailto:info@ror.org">info@ror.org</a>. See also ROR&rsquo;s <a href="/jobs">jobs page</a>.</span></div>
	</div>
</div>
<p>Photo by <a href="https://unsplash.com/@sens_design">Jansen Yang</a> on <a href="https://unsplash.com/photos/concrete-man-beside-lion-statue-QtVRBR2QrfI?utm_content=creditCopyText&amp;utm_medium=referral&amp;utm_source=unsplash">Unsplash</a></p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">ROR Curation Lead Adam Buttrick is staying at ROR but moving from Crossref to CDL, which means there's a new position to fill at Crossref. Read more about the Metadata Manager position and consider applying yourself!</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Metadata Matching 101: What Is It and Why Do We Need It?</title><id>https://doi.org/10.71938/gzcq-5q83</id><link rel="alternate" href="https://ror.org/blog/2024-05-16-metadata-matching-101/"/><link rel="related" href="https://doi.org/10.13003/aewi1cai"/><published>2024-05-16T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Dominika Tkaczyk</name><uri>https://orcid.org/0000-0001-5055-7876</uri></author><author><name>Adam Buttrick</name><uri>https://orcid.org/0000-0003-1507-1031</uri></author><category term="Explainers"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>ROR Curation Lead Adam Buttrick has been working with Crossref Head of Strategic Initiatives Dominika Tkaczyk to explore improvements in automatic metadata matching strategies that can result in better metadata for everyone. Read more about this important work here and in other posts in the <a href="/tags/matching/">matching</a> series.</span></div>
	</div>
</div>
<p>At Crossref and ROR, we develop and run processes that match metadata at scale, creating relationships between millions of entities in the scholarly record. Over the last few years, we&rsquo;ve spent a lot of time diving into details about metadata matching strategies, evaluation, and integration. It is quite possibly <a href="https://crossref.org/blog/matchmaker-matchmaker-make-me-a-match/">our</a> <a href="https://crossref.org/blog/reference-matching-for-real-this-time/">favourite</a> <a href="https://crossref.org/blog/what-if-i-told-you-that-bibliographic-references-can-be-structured/">thing</a> to <a href="https://www.youtube.com/watch?v=Tx5y7lX030U">talk</a> and <a href="https://crossref.org/blog/follow-the-money-or-how-to-link-grants-to-research-outputs/">write</a> <a href="https://crossref.org/blog/discovering-relationships-between-preprints-and-journal-articles/">about</a>! But sometimes it is good to step back and look at the problem from a wider perspective. In this blog, the first one in a series about metadata matching, we will cover the very basics of matching: what it is, how we do it, and why we devote so much effort to this problem.</p>


<h2 id="what-is-metadata-matching">What is metadata matching? 
</h2>
<p>Would you be able to find the DOI for the work referenced in this citation?</p>
<p><code>Everitt, W. N., &amp; Kalf, H. (2007). The Bessel differential equation and the Hankel transform. Journal of Computational and Applied Mathematics, 208(1), 3–19.</code></p>
<p>We bet you could! You might begin, for example, by pasting the whole citation, or only the title, into a search engine of your choice. This would probably return multiple results, which you would quickly skim. Then you might click on the links for a few of the top results, those that look promising. Some of the websites you visit might contain a DOI. Perhaps you would briefly compare the metadata provided on the website against what you see in the citation. If most of this information matches (see what we did there?), you would conclude that the DOI from that website is, in fact, the DOI for the cited paper.</p>
<p>Well done! You just performed metadata matching, specifically, bibliographic reference matching. Matching in general can be defined as the task or process of finding an identifier for an item based on its structured or unstructured &ldquo;description&rdquo; (in this case: finding a DOI of a cited article based on a citation string).</p>
<p>But matching doesn&rsquo;t have to just be about citations and DOIs. There are many other instances of matching we can think of, for example:</p>
<ul>
<li>finding the ROR ID for an organisation based on an affiliation string,</li>
<li>finding the ORCID ID for a researcher based on the person&rsquo;s name and affiliation,</li>
<li>finding the ROR ID for a funder based on the acknowledgements section of a research paper,</li>
<li>finding the grant DOI based on an award number and a funder name.</li>
</ul>
<p>Matching doesn&rsquo;t have to be done manually. It is possible to develop fully automated strategies for metadata matching and employ them at scale. It is also possible to use a hybrid approach, where automated strategies assist users by providing suggestions.</p>
<p>Developing automated matching strategies is not a trivial task, and if we want to do it right, it takes a great deal of time and effort. This brings us to our next question: is it worth it?</p>


<h2 id="why-do-we-need-matching">Why do we need matching? 
</h2>
<p>In short, metadata matching gives us a more complete picture of <a href="https://crossref.org/documentation/research-nexus/">the research nexus</a> by discovering missing relationships between various entities within and throughout the scholarly record:</p>
<figure class="blog-figure"><img src="/img/blog/matching101/matching-101-relationships.png"
         alt="Example relationships in the scholarly record"/>
</figure>

<p>These relationships are very powerful. They provide important context for any entity, whether it is a research output, a funder, a research institution, or an author. Imagine for a moment the scholarly record without any such relationships, where all bibliographic references, affiliations (institution names and addresses), and funding information (funder names and grant titles) are provided as unstructured strings only. In such a world, how would you calculate the number of times a particular research paper was cited? How would you get a list of research outputs supported by a specific funder? It would be incredibly challenging to navigate, summarise, and describe research activities, especially considering the scale. Thankfully, these and many other questions can be answered thanks to metadata matching that discovers relationships between entities in the scholarly record.</p>
<p>There are two primary ways we can use metadata matching in our workflows: as semi-automated tools that help users look up the appropriate identifiers or as fully automated processes that enrich the metadata in various scholarly databases.</p>
<p>The first approach is quite similar to the example we described at the beginning. If you are submitting scholarly metadata, for example of a new article to be published, you can use metadata matching to look up identifiers for the various entities and include these identifiers in the submission. For example, with the help of metadata matching, instead of submitting citation strings, you could provide the DOIs for works cited in the paper and instead of the name and address of your organisation, you could provide its ROR ID. To make this easier for people, metadata submission systems and applications sometimes integrate metadata matching tools  into user interfaces.</p>
<p>The second approach allows large, existing sources of scholarly metadata to be enriched with identifiers in a fully automated way. For example, we can match affiliation strings to ROR IDs using a combination of machine learning models and <a href="https://ror.readme.io/docs/api-affiliation">ROR&rsquo;s default matching service</a>, effectively adding more relationships between people and organisations. We can also <a href="https://crossref.org/blog/discovering-relationships-between-preprints-and-journal-articles/">compare journal articles and preprints metadata</a> in the Crossref database by calculating similarity scores for titles, authors, and years of publication to match them with each other and provide more relationships between preprints and journal articles. This automated enrichment can be done at any point in time, even after research outputs have been formally published.</p>
<p>There are fundamental differences between these two approaches. The first is done under the supervision of a user, and for the second, the matching strategy makes all the decisions autonomously. As a result, the first approach will typically (although not always) result in better quality matches. By contrast, the second approach is much faster, generally less expensive, and scales to even very large data sources.</p>
<p>In the end, no matter what approach is used, the goal is to achieve a more complete accounting of the relationships between entities in the scholarly record.</p>
<p>This blog is the first one in a series about metadata matching. In the coming weeks, we will cover more detail about the product features related to metadata matching, explain why metadata matching is not a trivial problem, and share how we can develop, assess, compare, and choose matching strategies. Stay tuned!</p>
<div class='callout grey'>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Questions? Contact <a href="mailto:adam@ror.org">adam@ror.org</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">In this blog, the first one in a series about metadata matching, we will cover the very basics of matching: what it is, how we do it, and why we devote so much effort to this problem.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Announcing Version 2 of the ROR Schema and API!</title><id>https://doi.org/10.71938/v406-1m30</id><link rel="alternate" href="https://ror.org/blog/2024-04-15-announcing-ror-v2/"/><published>2024-04-15T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>ROR Core Team</name><uri>https://ror.org/about/team/#core-team</uri></author><category term="Technical News"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span><strong>Today, we are delighted to announce the launch of version 2 of the ROR schema and API! This new version of our schema and API will serve as a rock-solid foundation for everything ROR users want to accomplish now and in the future.</strong></span></div>
	</div>
</div>
<p>Today marks a red-letter (teal-letter?) day for ROR: the official public launch of <strong>version 2 of the ROR schema and API</strong>! Version 2 of ROR, developed and beta-tested over many months with <a href="https://ror.readme.io/docs/feedback-docs">the generous input and help of the ROR community</a>, is more streamlined and more powerful than version 1, including some restructuring that will make ROR data easier to manage and use and some new fields that ROR users have been asking for. Read on to find out more.</p>


<h2 id="whats-new---summary">What&rsquo;s new - summary 
</h2>
<p>Here&rsquo;s a synopsis of what&rsquo;s new and improved.</p>
<ul>
<li>
<p>We&rsquo;ve added a field for <strong>administrative information</strong> that includes the date a record was created and the date it was last modified. This crucial change enables users to retrieve only the most recently added and updated records from the ROR registry, which has been a much-requested feature.</p>
</li>
<li>
<p>We&rsquo;ve simplified <strong>name information</strong> and have made changes to ensure that organization names in ROR work for a global, multilingual community of users. By adding a place to store language codes for all name variants and treating names in different languages as equivalent, we&rsquo;ve made ROR metadata less Anglocentric and more useful for countries with multiple official languages.</p>
</li>
<li>
<p>We&rsquo;ve removed redundant and overly granular <strong>location information</strong>, keeping only the most useful and globally relevant fields: country name, country code, latitude, longitude, specific location (city), and <a href="https://geonames.org">GeoNames</a> ID. These changes make location information much easier to read, understand, and work with.</p>
</li>
<li>
<p>We&rsquo;ve made organization <strong>website information</strong> more useful by adding a field for an organization&rsquo;s web domains, creating a single container field for the organization&rsquo;s website and its Wikipedia page, and removing deprecated fields for IP address and email address that were inherited from GRID and never populated. Web-related metadata is now more compact and better organized, and users will now be able to disambiguate organizations by domain.</p>
</li>
<li>
<p>We&rsquo;ve restructured <strong>external identifier information</strong> and made it more flexible in order to make it easier to manage current and potential future mappings to other organization identifiers.</p>
</li>
<li>
<p>We&rsquo;re adding an additional <strong>organization type</strong>, &ldquo;funder,&rdquo; which can be combined with existing organization types in ROR such as &ldquo;nonprofit&rdquo; and &ldquo;government.&rdquo; With <a href="https://www.crossref.org/blog/open-funder-registry-to-transition-into-research-organization-registry-ror/">ROR set to become the standard identifier for funders</a>, this change will make it easier for ROR users to recognize and select funding organizations.</p>
</li>
<li>
<p>The <strong>ROR API</strong> now exists in two parallel versions, and we&rsquo;ve made sure not to break any code that current ROR API users have written. Version 1 of the ROR API is now available at <a href="https://api.ror.org/v1/organizations">https://api.ror.org/v1/organizations</a> and version 2 of the ROR API is now available at <a href="https://api.ror.org/v2/organizations">https://api.ror.org/v2/organizations</a>, but version 1 is still available at the original location without the version in the path: <a href="https://api.ror.org/organizations">https://api.ror.org/organizations</a>. Version 1 of the ROR API will remain the default for at least another year, through April 2025 and perhaps beyond.</p>
</li>
<li>
<p>All the same records will always be available at the same time in both versions of the API, and all the same functionality of version 1 of the ROR API remains available in version 2. Users of version 2 of the ROR API, however, will be able to send <strong>additional queries based on version 2 metadata</strong>, such as queries based on created and last modified dates.</p>
</li>
<li>
<p>The <strong>ROR data dump</strong> now includes JSON and CSV files formatted according to both schema v1 and schema v2. This means that there will now be 4 files in each data release instead of 2, and users can choose to use the format and schema that best suits their needs. Take a look for yourself by downloading the most recent ROR data dump from Zenodo at <a href="https://zenodo.org/doi/10.5281/zenodo.6347574">https://zenodo.org/doi/10.5281/zenodo.6347574</a>.</p>
</li>
<li>
<p>The <strong>ROR web search</strong> at <a href="https://ror.org/search">https://ror.org/search</a> now uses version 2 of the ROR API, and we&rsquo;ve taken the opportunity to make a few improvements. Organization names are now identified by type and language so that it&rsquo;s more apparent what&rsquo;s an alias, what&rsquo;s an acronym, and what&rsquo;s in an alternate script. Additionally, each individual landing page for a ROR record now includes a link to the JSON view of the record, which enables users to see the underlying data behind the interface.</p>
</li>
</ul>
<div class='callout mustard'>
	<div class="row">
		<div class="col-md-12"><span><p>Not all new fields and subfields in version 2 of ROR have values yet, but we expect to add most or all of this information by the end of Q2 2024, proceeding carefully in order to ensure that added values are accurate.</p>
<p><strong>Created/last modified dates</strong> have been added to all records, using actual dates from GRID and ROR data releases.</p>
<p><strong>Language codes</strong> for names are currently only included for names inherited from the &ldquo;labels&rdquo; field in the version 1 schema. Language codes have not yet been added for other names.</p>
<p><strong>Domains</strong> have not yet been added.</p>
<p>The <strong>funder</strong> organization type has not yet been added to funder records.</p>
</span></div>
	</div>
</div>


<h2 id="compare-a-v1-record-with-a-v2-record">Compare a v1 record with a v2 record 
</h2>
<p>Compare the ROR record for Victoria University in ROR v1 with the ROR record for Victoria University in ROR v2 to see the improvements we&rsquo;ve made in the metadata.</p>
<div class='centered'><p>Victoria University, ROR v1</p>
<figure><img src="/img/ror-v1-victoria-university.png"
         alt="The version 1 ROR record for Victoria University"/><figcaption>
            <p>The version 1 ROR record for Victoria University</p>
        </figcaption>
</figure>
</div>
<div class='centered'><p>Victoria University, ROR v2</p>
<figure><img src="/img/ror-v2-victoria-university.png"
         alt="The version 2 ROR record for Victoria University"/><figcaption>
            <p>The version 2 ROR record for Victoria University</p>
        </figcaption>
</figure>
</div>


<h2 id="whats-new---more-details">What&rsquo;s new - more details 
</h2>
<p>If you&rsquo;d like to see even more details about what&rsquo;s new in version 2 of the ROR API and schema, take a look at our <a href="https://ror.readme.io/changelog/2024-04-11-schema-api-v2">changelog</a> and at our <a href="https://ror.readme.io/v2/docs/">v2 documentation</a>. In our documentation of version 2 of ROR you&rsquo;ll find all the new v2 fields and subfields listed by name, sample v2 API queries and results, and fully updated guides to implementing ROR using version 2. You can easily toggle between v1 and v2 documentation.</p>
<figure><img src="/img/ror-v2-docs-toggle.gif"
         alt="How to switch between version 1 and version 2 of ROR documentation"/><figcaption>
            <p>How to switch between version 1 and version 2 of ROR documentation</p>
        </figcaption>
</figure>



<h2 id="thanks-to-all-who-made-this-possible">Thanks to all who made this possible! 
</h2>
<p>The launch of version 2 of ROR ranks right up there with other <a href="/about/history">major milestones for ROR</a>, including the stakeholder meeting in Girona, Spain in January 2018; the &ldquo;Minimum Viable registry&rdquo; launch in January 2019; and the first independently-curated release in March 2022. We&rsquo;re very grateful to all those who made this watershed moment possible, including all the members of the ROR community who gave us <a href="https://ror.readme.io/v2/docs/feedback-docs">feedback on our proposals</a> and participated in our <a href="https://ror.org/blog/2023-09-14-beta-test/">beta test</a>.</p>
<p>If you&rsquo;d like to participate in future ROR developments, please get involved with the ROR <a href="/community">community</a>!</p>
<ul>
<li><a href="http://eepurl.com/gjkT9H">Subscribe to the quarterly ROR newsletter</a></li>
<li><a href="https://groups.google.com/a/ror.org/g/ror-tech">Sign up for the ROR Technical Forum</a></li>
<li>Join the ROR Slack</li>
<li>Attend a <a href="/events">ROR community call or other event</a></li>
</ul>
<div class='callout light-grey'>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Want help switching to ROR version 2 or integrating it into your system for the first time? Email <a href="mailto:support@ror.org">support@ror.org</a> with any questions or book a meeting with ROR Technical Community Manager Amanda French at <a href="https://calendly.com/ror-chat">https://calendly.com/ror-chat</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Today, we are delighted to announce the launch of version 2 of the ROR schema and API! This new version of our schema and API will serve as a rock-solid foundation for everything ROR users want to accomplish now and in the future.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">ROR's Fifth Anniversary: Highlights from the 2024 Annual Community Meeting</title><id>https://doi.org/10.71938/mt5h-7086</id><link rel="alternate" href="https://ror.org/blog/2024-02-20-ror-fifth-anniversary/"/><published>2024-02-20T00:00:00+00:00</published><updated>2025-11-07T11:35:16-05:00</updated><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><category term="Event Recaps"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>Every year <a href="/blog/2019-02-10-announcing-first-ror-prototype">since 2019</a>, we&rsquo;ve celebrated ROR&rsquo;s anniversary at the same time we hold our annual community meeting, updating everyone on what ROR has accomplished in the last year and expressing our gratitude for the help of the ROR community. This year, the 2024 annual meeting featured five online events: a <a href="/events/2024-01-30-annual-meeting-and-fifth-anniversary">general update and fifth anniversary celebration</a>, a <a href="/events/2024-01-30-community-showcase">community showcase</a>, a <a href="/events/2024-01-31-why-we-all-need-good-funding-metadata">panel on funding metadata</a>, and two informal (unrecorded) drop-in sessions.</p>
<p>We had over 300 unique registrants for our three recorded sessions, making this the most popular ROR annual community meeting ever! It was a true delight to see so many ROR users and supporters come together and to meet some people who are new to ROR. If you missed the meeting, here are the highlights.</p>


<h3 id="ror-annual-meeting-and-fifth-anniversary-celebration">ROR Annual Meeting and Fifth Anniversary Celebration 
</h3>
<p>The <a href="/events/2024-01-30-annual-meeting-and-fifth-anniversary">2024 edition of the main ROR annual meeting</a> was a special one! We took the opportunity to reflect on the last five years, remembering where we started and how we got here. We heard about where we’ve been and where we’re going in terms of our community of adopters, our curation activities, and our tech. Most importantly, we acknowledged ROR community contributors, without whom we’d never have come this far, and we did some brainstorming about what’s next.</p>


<h4 id="selected-highlights">Selected highlights 
</h4>
<ul>
<li>A look back at <strong>memories of ROR from 2016-2019,</strong> including a white paper on the concept of an organization identifier registry, a working group report, and pictures from in-person events</li>
<li>Stats of ROR adoption include a staggering <strong>1.48 million DOIs registered with DataCite that include ROR IDs</strong> to identify creators or contributors</li>
<li>The number of requests to add new records to ROR has surged from 50 per month in January 2022 to <strong>more than 700 per month</strong> in January 2024</li>
<li>Key technical accomplishments in 2023 include schema v2 development, implementation, and beta test; anticipated for 2024 is the <strong>public launch of v2 and a redesigned affiliation matching service</strong></li>
<li>ROR Community members are in strong alignment that we are on the right track and should keep going, specifically focusing on continuing our work on <strong>publisher adoption and the Crossref Open Funder Registry transition</strong> in 2024</li>
</ul>

<div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
  <iframe src="https://www.youtube.com/embed/mg53Lt5kWYQ" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="ROR Annual Community Meeting and Fifth Anniversary Celebration January 2024"></iframe>
</div>



<h3 id="community-showcase">Community Showcase 
</h3>
<p>The <a href="/events/2024-01-30-community-showcase">Community Showcase</a> featured presentations on how ROR is used in some key global scholarly systems and workflows, including at <a href="https://openalex.org">OpenAlex</a>, <a href="https://www.4science.com/dspace-cris/">4Science / DSpace-CRIS</a>, <a href="https://orcid.org">ORCID</a>, <a href="https://www.springernature.com/">Springer Nature</a>, US Department of Energy&rsquo;s <a href="https://osti.gov">Office of Scientific and Technical Information (OSTI)</a>, and the ROR Implementation Task Group of the <a href="https://datascience.nih.gov/data-ecosystem/generalist-repository-ecosystem-initiative">NIH’s Generalist Repository Ecosystem Initiative (GREI)</a>.</p>


<h4 id="selected-highlights-1">Selected highlights 
</h4>
<ul>
<li>A <strong>live demo</strong> of OpenAlex&rsquo;s web interface, whose institutional data relies on ROR</li>
<li>ROR will be on the roadmap for <strong>DSpace 8.0</strong></li>
<li>ROR IDs now account for <strong>half of new affiliations created in ORCID</strong>; over 1.2 million ORCID records have a ROR ID and more than 50,000 unique ROR IDs are represented in ORCID</li>
<li><strong>Springer Nature currently has over 1.1 million ROR IDs in its metadata</strong> and its next steps are to add ROR IDs to already published content, provide ROR IDs to Crossref during metadata deposit, and evaluate other use cases for ROR</li>
<li><strong>OSTI will begin including ROR IDs in its metadata</strong> once they deploy their new internal authority as part of E-Link 2.0, their new research submission tool</li>
<li>Of the 7 repository systems participating in NIH&rsquo;s GREI initiative, <strong>5 have already adopted ROR for organizational affiliations and 3 have adopted ROR for funder associations</strong></li>
</ul>

<div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
  <iframe src="https://www.youtube.com/embed/oXBgk3YWYUw" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="Community Showcase January 2024 - ROR Annual Meeting 2024"></iframe>
</div>



<h3 id="why-we-all-need-good-funding-metadata">Why We All Need Good Funding Metadata 
</h3>
<p>The session <a href="/events/2024-01-31-why-we-all-need-good-funding-metadata">Why We All Need Good Funding Metadata</a> addressed the issue that funding information is an increasingly important piece of metadata for many stakeholders, yet standardizing and using identifiers for funders remains a challenge. With ROR <a href="https://ror.org/blog/2023-09-07-open-funder-registry-transition-ror-cross-post/">set to become the standard persistent identifier for funders</a>, we convened a panel for ROR’s annual community meeting to hear diverse perspectives about how funding metadata is used and what might make it better. We heard from funders, publishers, metadata specialists, and more, including  <a href="https://chorusaccess.org">CHORUS</a>, the <a href="https://rsc.org">Royal Society of Chemistry</a>, <a href="https://datadryad.org/">Dryad</a>, <a href="https://www.crossref.org">Crossref</a>, and <a href="https://strategiesos.org/">Stratos</a> for <a href="https://parkinsonsroadmap.org/">Aligning Science Across Parkinson’s</a>.</p>


<h4 id="selected-highlights-2">Selected highlights 
</h4>
<p>All panelists were asked &ldquo;If you had a magic wand, what would be the one thing you&rsquo;d do to improve funding metadata?&rdquo; The answers:</p>
<ul>
<li>&ldquo;Make extracting PIDs and their associated metadata from articles, datasets and software <strong>simple, open and affordable</strong> for all!&rdquo;</li>
<li>&ldquo;All organisations and systems <strong>consistently using standard PIDs</strong> for people, organisations, grants, projects, data and publications throughout the funding, research and publication processes&rdquo;</li>
<li>&ldquo;Systems/tools can <strong>accurately extract &amp; identify funders from authors’ acknowledgements</strong> and match to funder IDs at the point of submission&rdquo;</li>
<li>&ldquo;All <strong>funder data available through APIs</strong>&rdquo;</li>
</ul>

<div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
  <iframe src="https://www.youtube.com/embed/mDWpO8Jz2lM" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="Why We All Need Good Funding Metadata - ROR Annual Meeting 2024"></iframe>
</div>



<h3 id="lets-ror-some-more-in-2024">Let&rsquo;s ROR some more in 2024! 
</h3>
<p>ROR has come a long way in the last five years, and ROR is going even farther in the next five. Heartfelt gratitude is due to everyone in the community who has helped to make ROR a reality, including the <a href="/about/team/#operations-team">ROR Operations Team</a>; the <a href="/community/#curation-advisory-board">ROR Curation Advisors</a>; the <a href="/community/#steering-group">ROR Steering Group</a>; ROR <a href="/community/#supporters">Sustaining Supporters</a>; ROR <a href="https://bit.ly/ror-integrations">adopters and integrators</a>; members of the <a href="https://groups.google.com/a/ror.org/g/ror-tech">ROR technical forum</a> and ROR Slack; subscribers to the ROR <a href="http://eepurl.com/gjkT9H">quarterly newsletter</a>; our colleagues at <a href="https://www.crossref.org">Crossref</a>, <a href="https://datacite.org">DataCite</a>, and the <a href="https://cdlib.org">California Digital Library</a>; all the <a href="/about//team/#alumni-and-affiliates">alumni and affiliates</a> who helped build ROR; everyone who presents at a ROR community call and participates in our <a href="/categories/case-studies">case studies</a>; all those who attend ROR <a href="/events">events</a>; and the many, many people who <a href="https://github.com/ror-community/ror-updates/issues">ask for a change or addition to ROR records</a>. We wouldn’t be here without your engagement and support!</p>
<p>Hope to see you all throughout 2024 and at the next annual meeting in 2025.</p>
<figure class="blog-figure"><img src="/img/blog/recap2024/lets-ror-some-more-in-2024.png"
         alt="Thank you! Let&#39;s ROR some more in 2024!"/>
</figure>

<div class='callout grey'>
	<div class="row">
		<div class="col-md-12"><span>
<h3 id="previous-recaps-of-ror-annual-community-meetings">Previous recaps of ROR annual community meetings 
  <a href="#previous-recaps-of-ror-annual-community-meetings" aria-label="Previous recaps of ROR annual community meetings">
<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 640 512" class="ha ha-link" width="16" height="16" alt="Previous recaps of ROR annual community meetings"><path d="M579.8 267.7c56.5-56.5 56.5-148 0-204.5c-50-50-128.8-56.5-186.3-15.4l-1.6 1.1c-14.4 10.3-17.7 30.3-7.4 44.6s30.3 17.7 44.6 7.4l1.6-1.1c32.1-22.9 76-19.3 103.8 8.6c31.5 31.5 31.5 82.5 0 114L422.3 334.8c-31.5 31.5-82.5 31.5-114 0c-27.9-27.9-31.5-71.8-8.6-103.8l1.1-1.6c10.3-14.4 6.9-34.4-7.4-44.6s-34.4-6.9-44.6 7.4l-1.1 1.6C206.5 251.2 213 330 263 380c56.5 56.5 148 56.5 204.5 0L579.8 267.7zM60.2 244.3c-56.5 56.5-56.5 148 0 204.5c50 50 128.8 56.5 186.3 15.4l1.6-1.1c14.4-10.3 17.7-30.3 7.4-44.6s-30.3-17.7-44.6-7.4l-1.6 1.1c-32.1 22.9-76 19.3-103.8-8.6C74 372 74 321 105.5 289.5L217.7 177.2c31.5-31.5 82.5-31.5 114 0c27.9 27.9 31.5 71.8 8.6 103.9l-1.1 1.6c-10.3 14.4-6.9 34.4 7.4 44.6s34.4 6.9 44.6-7.4l1.1-1.6C433.5 260.8 427 182 377 132c-56.5-56.5-148-56.5-204.5 0L60.2 244.3z"/></svg>
  </a>
</h3>
<ul>
<li><a href="/blog/2023-02-13-ror-turns-four/">2023 - ROR Turns Four</a></li>
<li><a href="/blog/2022-02-14-new-year-at-ror">2022 - A New Year at ROR</a></li>
<li><a href="/blog/2021-02-03-ror-annual-meeting">2021 - Unmute to ROAR!</a></li>
<li><a href="/blog/2020-02-10-ror-ing-in-portugal">2020 - ROR-ing Together in Portugal</a></li>
<li><a href="/blog/2019-02-10-announcing-first-ror-prototype/">2019 - Hear us ROR!</a></li>
</ul>
</span></div>
	</div>
</div>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>Want to present at a ROR community meeting? Contact <a href="mailto:support@ror.org">support@ror.org</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">The ROR Community turned out in force for our 2024 annual community meeting. Here are the highlights from ROR's fifth anniversary celebration, a community showcase of ROR users, and a panel on the importance of funding metadata.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Case Study: The Curtin Open Knowledge Initiative and ROR</title><id>https://doi.org/10.71938/empz-p763</id><link rel="alternate" href="https://ror.org/blog/2024-01-17-coki-case-study/"/><published>2024-01-17T00:00:00+00:00</published><updated>2025-11-07T11:35:16-05:00</updated><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><category term="Case Studies"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>Professor Cameron Neylon of Curtin University talks telephones, power outlets, chat services, persistent identifier education, federated versus centralized curation, providing actionable information to universities, and why the COKI Open Access Dashboard relies on ROR.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>
<h2 id="key-quotations">Key quotations 
  <a href="#key-quotations" aria-label="Key quotations">
<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 640 512" class="ha ha-link" width="16" height="16" alt="Key quotations"><path d="M579.8 267.7c56.5-56.5 56.5-148 0-204.5c-50-50-128.8-56.5-186.3-15.4l-1.6 1.1c-14.4 10.3-17.7 30.3-7.4 44.6s30.3 17.7 44.6 7.4l1.6-1.1c32.1-22.9 76-19.3 103.8 8.6c31.5 31.5 31.5 82.5 0 114L422.3 334.8c-31.5 31.5-82.5 31.5-114 0c-27.9-27.9-31.5-71.8-8.6-103.8l1.1-1.6c10.3-14.4 6.9-34.4-7.4-44.6s-34.4-6.9-44.6 7.4l-1.1 1.6C206.5 251.2 213 330 263 380c56.5 56.5 148 56.5 204.5 0L579.8 267.7zM60.2 244.3c-56.5 56.5-56.5 148 0 204.5c50 50 128.8 56.5 186.3 15.4l1.6-1.1c14.4-10.3 17.7-30.3 7.4-44.6s-30.3-17.7-44.6-7.4l-1.6 1.1c-32.1 22.9-76 19.3-103.8-8.6C74 372 74 321 105.5 289.5L217.7 177.2c31.5-31.5 82.5-31.5 114 0c27.9 27.9 31.5 71.8 8.6 103.9l-1.1 1.6c-10.3 14.4-6.9 34.4 7.4 44.6s34.4 6.9 44.6-7.4l1.1-1.6C433.5 260.8 427 182 377 132c-56.5-56.5-148-56.5-204.5 0L60.2 244.3z"/></svg>
  </a>
</h2>
<p>&ldquo;But actually, in the end, with all of the architecture and systems we had to change and the move from Microsoft Academic to OpenAlex, the shift to ROR was one of the easiest ones. In fact, we just dropped the entire ROR dataset into an identifier column. It was very simple.&rdquo;</p>
<p>&ldquo;And with this broader open data ecosystem, if people feel responsible for contributing to collectively raising the quality of it and working with these community infrastructures, then the capacity for improving the data for everyone, so that everyone&rsquo;s data looks better, is just huge. And it&rsquo;s sometimes a challenge, because people are not used to feeling like they have authority. In many cases, data was always something that was done <em>to</em> them. And yet this is data about the stuff <em>we</em> are doing, and the organizations that we are.&rdquo;</p>
<p>&ldquo;Without the underlying infrastructure that ROR provides to help us triage data from multiple data sources and connect it to organizations at the right level, we couldn&rsquo;t do what we do. And as we move forward with what we&rsquo;re trying to do, and try and scale up and scale out the kind of stuff we&rsquo;re doing, it&rsquo;s great to see that ROR is quite frequently a couple of steps ahead. ROR is thinking about the same problems and trying to address them as we can see them coming down the road ahead of us.&rdquo;</p>
<p>&ndash; Professor Cameron Neylon, co-lead of the Curtin Open Knowledge Initiative</p>
</span></div>
	</div>
</div>


<h3 id="hahahugoshortcode-s1-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Thank you so much for doing this with us. Can you please tell us your name, title, and organization?</p>


<h3 id="hahahugoshortcode-s2-hbhb-cameron-neylon"><figure class="round-figure"><img src="/img/blog/coki/cameron-neylon.png"
         alt="Avatar of Cameron Neylon"/>
</figure>
 Cameron Neylon 
</h3>
<p>I&rsquo;m Cameron Neylon, Professor of Research Communications at <a href="https://www.curtin.edu.au/">Curtin University</a>, in the <a href="https://www.curtin.edu.au/about/learning-teaching/humanities/school-of-media-creative-arts-and-social-inquiry/">School of Media, Creative Arts and Social Inquiry</a>, which is as mixed a school as it sounds like. We&rsquo;ve got everything from social sciences to critical literary studies to people doing painting and sculpture and journalists, so it&rsquo;s a really mixed and diverse group. And then within that, I&rsquo;m co-lead with <a href="https://staffportal.curtin.edu.au/staff/profile/view/lucy-montgomery-99ab1b13/">Professor Lucy Montgomery</a> of the <a href="https://openknowledge.community/">Curtin Open Knowledge Initiative</a>.</p>


<h3 id="hahahugoshortcode-s3-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Tell us about the Curtin Open Knowledge Initiative. When did that get started and why?</p>


<h3 id="hahahugoshortcode-s4-hbhb-cameron-neylon"><figure class="round-figure"><img src="/img/blog/coki/cameron-neylon.png"
         alt="Avatar of Cameron Neylon"/>
</figure>
 Cameron Neylon 
</h3>
<p>COKI, as we call it, started probably in 2015 or 2016. Lucy and I both returned to Australia after having been in Europe and overseas, and we found the level of conversation about open scholarship in Australia to be quite backwards, and behind, and kind of frustrating. And it was particularly frustrating because it was clear that this wasn&rsquo;t for a lack of interest or desire to move things forward: it was in large part because the leadership of universities weren&rsquo;t engaging with these issues, because none of the information resources that they were using brought this up as an issue.</p>
<p>Australia is and has been particularly obsessed with university rankings, partly because the sector is very dependent on international students as a revenue source. But as we know, those rankings are incredibly narrow, backwards looking, conservative, opaque, and actually not very informative. And they certainly don&rsquo;t do anything to get a university to address questions of open access, data sharing, staff demographic diversity, or many of the kinds of issues that are bound up in this idea of open knowledge or open research.</p>
<p>What we eventually realized was that there&rsquo;s this gap in knowledge, this gap in understanding, and this gap in information resources. So the idea was to do a better job of providing actionable information in the form that universities are expecting to see it, a form that shows progress towards aspects of open knowledge and comparisons with other institutions, and then that would be one way of provoking change.</p>
<p>We were very lucky to pitch this to a receptive Deputy Vice Chancellor for Research who had just started, and as a result, we got a lot of support from the university to take this forward. That&rsquo;s all very abstract, of course. What we actually started to build was a collection of open data about institutional practice, about research outputs, open access, citations, traditional kinds of things, but also things like staff demographics and other kinds of information that we thought were relevant to this big picture.</p>
<figure><img src="/img/blog/coki/coki-website.png"
         alt="COKI website"/><figcaption>
            <p>COKI website at <a href="https://openknowledge.community">https://openknowledge.community</a></p>
        </figcaption>
</figure>

<p>We were very dependent on the growing set of open data resources that were developing, have continued to develop, and will continue to develop. And we got lucky in our timing, I guess, in the sense that we started at a point where those open data resources were just about good enough to compete with proprietary data sources. But they&rsquo;re accelerating and improving at such a rate that now they are vastly superior, in almost all cases.</p>
<p>That and various advances in cloud infrastructure and computing allowed us to bring all of these datasets together, and so with the goal of making the information actionable, we essentially brought the big open datasets together into a large database using cloud infrastructure. Then we set about building things that we thought might be useful. And as I said, we had the financial support and resources that gave us the luxury to do that and let us do it properly rather than doing small projects. That has let us build up this immense data resource, which is really powerful, but it has also let us build some of the kinds of information resources that we think are needed to help people make intelligent decisions about change.</p>


<h3 id="hahahugoshortcode-s6-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Amazing. So when and where did you personally first hear about ROR?</p>


<h3 id="hahahugoshortcode-s7-hbhb-cameron-neylon"><figure class="round-figure"><img src="/img/blog/coki/cameron-neylon.png"
         alt="Avatar of Cameron Neylon"/>
</figure>
 Cameron Neylon 
</h3>
<p>That&rsquo;s kind of hard to pin down. I have been invested for a long time in the idea that persistent identifiers are central to making use of open information at scale, and we&rsquo;ve learned that over and over again. When we started COKI, it was just before ROR existed, although there were conversations happening about heading in that direction. But <a href="https://www.digital-science.com/">Digital Science</a> had <a href="https://www.digital-science.com/news/digital-science-releases-global-research-identifier-database-grid-cc0-license/">made the GRID database available under an open license</a>, so we were happy to be able to use that as a source of information for indexing around institutions. And of course, the GRID IDs were used by <a href="https://www.microsoft.com/en-us/research/project/microsoft-academic-graph/">Microsoft Academic</a>. And Microsoft Academic was one of our central data sources.</p>
<p>Then, as ROR developed, first as a concept and then as a reality, we were watching that whole journey, waiting for the moment when the maturity of the systems was there to jump across, because we obviously preferred something that was organized through community infrastructures. The thing that really pushed us across in the end, of course, was <a href="https://www.microsoft.com/en-us/research/project/academic/articles/microsoft-academic-to-expand-horizons-with-community-driven-approach/">Microsoft Academic shutting down</a> and <a href="https://blog.ourresearch.org/were-building-a-replacement-for-microsoft-academic-graph/">OpenAlex stepping into that gap</a>, specifically. That coincided with a range of things we were doing. <strong>But actually, in the end, with all of the architecture and systems we had to change and the move from Microsoft Academic to <a href="https://openalex.org">OpenAlex</a>, the shift to ROR was one of the easiest ones. In fact, we just dropped the entire ROR dataset into an identifier column. It was very simple.</strong></p>


<h3 id="hahahugoshortcode-s8-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>That&rsquo;s great to hear. I&rsquo;m talking to an increasing number of people who are using that OpenAlex dataset, and thereby using ROR, because ROR is currently the <a href="/blog/2023-09-13-openalex-case-study/">only external institutional identifier that they use</a>.</p>


<h3 id="hahahugoshortcode-s9-hbhb-cameron-neylon"><figure class="round-figure"><img src="/img/blog/coki/cameron-neylon.png"
         alt="Avatar of Cameron Neylon"/>
</figure>
 Cameron Neylon 
</h3>
<p>What&rsquo;s central in our work is that we bring all these datasets together. We have OpenAlex, ROR, Crossref, Crossref Event Data, Open Citations, OpenAire, PubMed, and I&rsquo;m probably missing a few. We bring all of these datasets together, and we organize them around community governance and persistent identifiers. And rather than just saying, &ldquo;Okay, what can OpenAlex tell me about DOIs connected with this institution as identified by ROR?&rdquo; we can do that across multiple datasets. At the moment, that works most effectively for DOIs, because the DOI has the widest uptake as a standardized identifier across these systems.</p>
<p>But that also points us to what&rsquo;s really powerful in terms of being able to connect these datasets up. Increasingly, the first step we take with integrating a new dataset is if it doesn&rsquo;t have those standardized, persistent identifiers in it, we do the mapping across, because the dataset is just not very useful without it. PubMed is a good example of this; I was working with it the other day. It has a bunch of institutional identifiers, but actually mostly the affiliation information in PubMed appears to be in the form of strings. And we just can&rsquo;t leverage that dataset as well as we might for organizational information, because those links aren&rsquo;t there. We can leverage it really well in terms of the publications, and increasingly for datasets as well, because we&rsquo;ve got the DOIs. But for organizational information, it&rsquo;s just that much weaker, so things like <a href="https://openknowledge.community/dashboards/funder-countries/">getting all the information for funding decisions</a> is a lot harder than it needs to be.</p>
<figure><img src="/img/blog/coki/coki-research-funding-dashboard.png"
         alt="COKI Research Funding Dashboard"/><figcaption>
            <p>COKI Research Funding Dashboard at <a href="https://openknowledge.community/dashboards/funder-countries">https://openknowledge.community/dashboards/funder-countries</a></p>
        </figcaption>
</figure>



<h3 id="hahahugoshortcode-s11-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Are there any other entities that you wish were using ROR IDs more?</p>


<h3 id="hahahugoshortcode-s12-hbhb-cameron-neylon"><figure class="round-figure"><img src="/img/blog/coki/cameron-neylon.png"
         alt="Avatar of Cameron Neylon"/>
</figure>
 Cameron Neylon 
</h3>
<p>Oh, I mean, everyone. It&rsquo;s really difficult to describe clearly just how much of a difference it makes. And I think that we&rsquo;re rapidly moving towards a world where this is going to become self-evident. In the old days, if you wanted to ask a question, you first started with, &ldquo;Okay, which dataset should I use to answer this question, and what limitations does that dataset impose on me?&rdquo; With open data and cloud infrastructures and the availability of these datasets as a whole, that&rsquo;s not where we start to answer a question. We ask, &ldquo;Which datasets can we bring together to maximize the amount of information and the precision that we can apply to addressing this particular question or problem?&rdquo; The datasets that are easy to integrate are the ones that we use. Increasingly, if there isn&rsquo;t a ROR ID or the ORCIDs aren&rsquo;t properly handled, then we just don&rsquo;t use the datasets.</p>
<p>The other side of that is because we are operating at a fairly large scale and making these public information products, we get a lot of people coming back and saying, &ldquo;Oh, you&rsquo;re missing this,&rdquo; or &ldquo;That&rsquo;s not there,&rdquo; or &ldquo;There&rsquo;s something wrong here.&rdquo; We can feed that back, and we can feed it back really easily. An example of this exactly with ROR: there was a university and the librarians noticed on the <a href="https://open.coki.ac/">global open access website</a> that the counts of outputs seemed a bit down. They looked closely at name variants, and they discovered that there was a bookstore on campus, and OpenAlex had assigned a bunch of the outputs from that university to the bookstore. OpenAlex is still a work in progress, so that kind of thing does happen. And the bookstore wasn&rsquo;t listed in the hierarchy for the institution; if that had been the case, everything would have flowed up nicely.</p>
<figure><img src="/img/blog/coki/coki-open-access-dashboard.png"
         alt="COKI Open Access Dashboard"/><figcaption>
            <p>COKI Open Access Dashboard at <a href="https://open.coki.ac">https://open.coki.ac</a></p>
        </figcaption>
</figure>

<p>What that story illustrates is that we don&rsquo;t find these errors. The people working with the data at scale, making information products, we don&rsquo;t often find those kinds of errors. They&rsquo;re always found by people on the ground who know the context and know exactly what the numbers should look like. We get a couple of emails a week from various people saying &ldquo;This is wrong,&rdquo; &ldquo;This is under-counting,&rdquo; &ldquo;This is over-counting,&rdquo; &ldquo;These two organizations don&rsquo;t belong together,&rdquo; or &ldquo;These two organizations do belong together.&rdquo; And it&rsquo;s really easy for us to pass that information back up when there are good identifiers. And we can say, &ldquo;No, this DOI should link to this ROR,&rdquo; or &ldquo;This ROR should probably be a child of that ROR.&rdquo;</p>
<p>That flow of curation information into the system, combined with the ease of use when things are easy to interconnect, I think is ultimately going to make these little, isolated, increasingly fragmentary datasets that are difficult to work with because they don&rsquo;t have the coverage and they don&rsquo;t have the interconnection less relevant in the long term.</p>


<h3 id="hahahugoshortcode-s14-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Fascinating. Well, I hope you and your staff do refer those kinds of issues to ROR, because we are always happy to make those kinds of corrections. If there is a bookstore that should or should not be in the <a href="https://ror.readme.io/docs/relationships">relationships of a ROR record</a>, we&rsquo;re happy to add that or take it out.</p>


<h3 id="hahahugoshortcode-s15-hbhb-cameron-neylon"><figure class="round-figure"><img src="/img/blog/coki/cameron-neylon.png"
         alt="Avatar of Cameron Neylon"/>
</figure>
 Cameron Neylon 
</h3>
<p>It raises lots of really important questions about the importance of <a href="/about#governance">community governance</a>. In these cases, a lot of the time, when someone tells us something&rsquo;s wrong, sometimes that&rsquo;s a mistake we&rsquo;ve made, and we can correct that. Sometimes it&rsquo;s a mistake in someone else&rsquo;s data system, and then we can pass that on and explain it to them. And sometimes it&rsquo;s a thing where there&rsquo;s reasonable grounds for disagreement.</p>
<p>And in fact, what we said in this case was, &ldquo;It&rsquo;s not our place to say whether that bookstore is part of that university. That&rsquo;s a decision you guys have to make, and here&rsquo;s the <a href="https://curation-request.ror.org">ROR request form</a> for when you figure that out.&rdquo; And I think that&rsquo;s a really important part of the whole story of changing the way people think about corrections.</p>
<p>In the space of bibliometrics and scientometrics, almost every university in the world has people whose job it is to correct the things that are wrong in Web of Science and Scopus. This is like an extra full-time person on top of the hundreds of thousands of dollars that are spent on a subscription to these products, working to help correct them. But that data just goes into a black hole, which we can&rsquo;t reuse.</p>
<p><strong>And with this broader open data ecosystem, if people feel responsible for contributing to collectively raising the quality of it and working with these community infrastructures, then the capacity for improving the data for everyone, so that everyone&rsquo;s data looks better, is just huge. And it&rsquo;s sometimes a challenge, because people are not used to feeling like they have authority. In many cases, data was always something that was done <em>to</em> them. And yet this is data about the stuff <em>we</em> are doing, and the organizations that we are.</strong> I think it&rsquo;s a really interesting transition to try and navigate, to get everyone get more engaged in feeling like they have a stake in making these things better.</p>


<h3 id="hahahugoshortcode-s16-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>It is interesting too, because ROR&rsquo;s curation model, as I&rsquo;m sure you know, is not the same as something like Wikipedia or Wikidata, in the sense that we curate everything. Anyone can request anything, but we evaluate that and approve or decline it centrally. Wikidata and Wikipedia have a decentralized, distributed editing model, whereas we have quite a centralized curation model.</p>
<p>But we also don&rsquo;t have a model in which only certain people can make requests, or where there is a particular person at every organization who is responsible for making sure that their information is correct in ROR. We will take a request from anybody; you don&rsquo;t have to be affiliated with that organization. And we <a href="https://github.com/ror-community/ror-updates/issues">evaluate that in the open</a> based on openly available information and on what makes sense for ROR.</p>
<p>And I think that model is unusual for people to understand. I think they tend to have a kind of a binary notion of authority over data. Either there&rsquo;s a centralized authority who is making decisions, and you are lucky if you have any input into that, or it&rsquo;s something that&rsquo;s entirely distributed. I really like our model, because I do think it has the best of both worlds. We will entertain any corrections that you wish to give us, but we do also review and curate that quite carefully.</p>


<h3 id="hahahugoshortcode-s17-hbhb-cameron-neylon"><figure class="round-figure"><img src="/img/blog/coki/cameron-neylon.png"
         alt="Avatar of Cameron Neylon"/>
</figure>
 Cameron Neylon 
</h3>
<p>I think I would agree, in the sense that there&rsquo;s clearly a tension here, right, between consistency of application and the scale at which the work can be done. With federated models, people assume that what you might lose in precision you gain by having this massive scale of work that can be done. But what you also do is create a massive scale of work that needs correction. It works, kind of, in some parts of Wikipedia or Wikidata, but mainly because of the absolutely vast scale of those enterprises. And anyone who&rsquo;s done anything on crowdsourcing knows that it&rsquo;s really, really hard to replicate that level of success at a smaller scale, so the tension there of needing to have a reasonable amount of effort and a reasonable application of resources but still get something out that&rsquo;s usable drives you toward the other end of that spectrum, toward centralization.</p>
<p>And indeed, I think that&rsquo;s again where the community governance and the model of transparency comes in. And it&rsquo;s got to be flexible, not rigid, because the other assumption people make is, &ldquo;Well, surely you can automate this or have some sort of system that just does all the checking for you.&rdquo; AI, wave hands, wave hands. And actually, none of those things work, in part because there are bad actors in the system, and you&rsquo;ve got to be on the lookout for that.</p>
<p>That&rsquo;s particularly true in terms of organizations and who they are and what they&rsquo;re connected with and who&rsquo;s connected with them. Those are exactly the areas where <a href="https://doi.org/10.1002/leap.1257">we&rsquo;ve suffered a lot of fraud in the scholarly system</a>, so that&rsquo;s got to be managed in some way that&rsquo;s ideally flexible and transparent, but also trustworthy. At the end of the day, that&rsquo;s where you&rsquo;ve got to be going, otherwise these things just aren&rsquo;t usable.</p>


<h3 id="hahahugoshortcode-s18-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Can I ask, do you teach?</p>


<h3 id="hahahugoshortcode-s19-hbhb-cameron-neylon"><figure class="round-figure"><img src="/img/blog/coki/cameron-neylon.png"
         alt="Avatar of Cameron Neylon"/>
</figure>
 Cameron Neylon 
</h3>
<p>I don&rsquo;t do a whole lot of teaching. My role is mainly research, at least at the moment. Our work doesn&rsquo;t neatly fit into a teaching program in the school that we&rsquo;re within. Like a lot of multidisciplinary groups, there are challenges in terms of where we are and what we&rsquo;re teaching. It&rsquo;s a peculiarity of the Australian system that there is a tendency to separate research and teaching, and that&rsquo;s driven by the financial model that the government imposes on Australian universities more than anything else. So, yes, I think there&rsquo;s always some challenges around that.</p>
<p>That said, we are in the school that contains the <a href="https://www.curtin.edu.au/study/offering/course-pg-graduate-diploma-in-information-and-library-science--gd-inflsc/">Information and Library Science program</a>, and so we do teach into that, and then more broadly, into what you might call Critical Information Studies. If you&rsquo;re over in the sciences, you&rsquo;d probably call it Data Science and have a little bit of ethics and critical theory applied. If you imagine the other side of that, we&rsquo;re <em>starting</em> from critical perspectives about what information is and what information does and whose benefit it serves, and then teaching a bit of the technical side so that people get an understanding of how that works out in practice: what the nitty-gritty of actually doing data analysis and visualization looks like, and why the tools are not neutral, and why it often looks like the entire world&rsquo;s data has been shoved through an Excel spreadsheet, because the limitations of Excel spreadsheets are what drive the things people do with data.</p>


<h3 id="hahahugoshortcode-s20-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Ha! Right. The reason I asked is, I imagine you have fellows or postdocs or graduate students or whatnot working with you on your projects, and I&rsquo;m just curious how you address persistent identifiers when you&rsquo;re passing knowledge along to the next generation, whether that&rsquo;s in a classroom setting or in an apprenticeship-slash-lab setting.</p>


<h3 id="hahahugoshortcode-s21-hbhb-cameron-neylon"><figure class="round-figure"><img src="/img/blog/coki/cameron-neylon.png"
         alt="Avatar of Cameron Neylon"/>
</figure>
 Cameron Neylon 
</h3>
<p>That&rsquo;s a good question. I would start by observing that we&rsquo;ve probably still got some work to do with the current and indeed past generations on the use of persistent identifiers.</p>
<p>Where we do teach it is we have a team, we bring people into that team, and where we educate people about this is through the practice. We have processes by which we write up documents and put them by default on Zenodo in <a href="https://zenodo.org/communities/coki">a COKI Zenodo community</a> so they get a DOI and so they can be automatically connected to people&rsquo;s ORCIDs. We use the <a href="https://doi.org/10.1371/journal.pone.0244611">Tenzing tool to apply CRediT taxonomy</a> when we prepare things, or at least we try to do that as much as possible.</p>
<p>At the social level, then, if you like, we try and embed it as much as possible into what we do and how we do it. And then at the technical level &ndash; and most of our team is pretty technical &ndash; it also goes into the sort of architectural principles we have for our data structures. In more traditional teaching, we teach some courses around data and telling stories with data.</p>
<p>And actually, introducing the concept of persistent identifiers and why they matter is always a challenge. Do you present it in kind of an abstract and theoretical sense of how well the system <em>could</em> work? That doesn&rsquo;t really speak to very many people. Or do you present it in a much more practical and concrete way? There, you run up against the fact that at least until recently many of these systems just didn&rsquo;t work as well as they might.</p>
<p>I&rsquo;m always reminded of the person who, when being asked to sort out their ORCID record, upon being told this was the last time they&rsquo;d need to sort out a personal profile as a researcher, observed &ldquo;Well, that was what I was told the last five times I had to do a personal profile as a researcher, and every one of those has fallen by the wayside, and now you want me to do another one, so how can you promise me another one&rsquo;s not going to come along next week?&rdquo; One of the big challenges with all of these systems is they will work really well when everyone&rsquo;s using them and when everyone doesn&rsquo;t even think about using them, but getting there is hard work.</p>


<h3 id="hahahugoshortcode-s22-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yes. And the metaphor I always think of, which is possibly a bit too simplistic, is the telephone. You have telephone numbers, and everyone needs a unique telephone number, and telephones themselves increase in utility the more people have them. And the more people who have telephones, the more important it is that you have unique telephone numbers. And if I were in a research position at a university, I would probably start doing all kinds of research into party lines and alphanumeric exchanges and the emergence of cell phones and things like that, you know, looking at the history of how we got to the globally standardized system that we have today for telephone numbers.</p>
<figure><img src="/img/blog/coki/Portland_telephone_directory_1920_DPLA_page_44.jpg"
         alt="Portland, Indiana telephone directory from 1920, page 44"/><figcaption>
            <p>Page 44 of the Portland, Indiana telephone directory from 1920, available via DPLA <a href="https://commons.wikimedia.org/wiki/File:Portland_telephone_directory,_1920_-_DPLA_-_30aa55431c1287a061f1836b3d2a642f_(page_44).jpg">on Wikimedia Commons</a></p>
        </figcaption>
</figure>



<h3 id="hahahugoshortcode-s24-hbhb-cameron-neylon"><figure class="round-figure"><img src="/img/blog/coki/cameron-neylon.png"
         alt="Avatar of Cameron Neylon"/>
</figure>
 Cameron Neylon 
</h3>
<p>And I guess the challenge with that, thinking about this, is that if you take that analogy through to the current world of social media or streaming video platforms, everything&rsquo;s so vertically integrated that I don&rsquo;t know if very many people are aware of why it is that a phone number can be a standard thing, despite the fact that lots of different people make phones and lots of different carriers will carry phone calls. And similarly with email, because those things are the last residue of that in the world as a whole.</p>
<p>Most of the things we deal with don&rsquo;t have those standards, and that&rsquo;s why they&rsquo;re awful, right? It&rsquo;s why you&rsquo;ve got three streaming subscriptions, not one TV aerial, and why you&rsquo;ve got seventeen different direct message or personal messenger accounts across multiple different services, none of which you really like to use, but because there&rsquo;s one person who does like to use that particular one you have to have it. And you know, the seemingly obvious concept that you should be able to get those messages wherever you want to, and send them to anyone based on a standardized sort of protocol or system, has partly been lost. And that&rsquo;s part of the battle of making the case for how these things can work better, is, yes, recalling the fact that you can in fact make phone calls and you don&rsquo;t have to have an AT&amp;T phone and a British Telecom phone.</p>
<figure><img src="/img/blog/coki/chat_systems.png"
         alt="xkcd 1810 Chat Systems - diagram of dozens of chat systems expressed as circles with different cartoon people drawn in each circle"/><figcaption>
            <p>xkcd 1810, <a href="https://xkcd.com/1810/">Chat Systems</a></p>
        </figcaption>
</figure>



<h3 id="hahahugoshortcode-s26-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>That&rsquo;s right. I think that&rsquo;s such a good point about looking at the failures of standardization, and the takeover of non-open commercial services for these things. I suppose some other examples would be power systems, globally. You know, when you travel from country to country, the plugs are different in the United States than they are in Europe, and, as I discovered recently, also different in South Africa. Measurement systems, metric versus non-metric, not standard everywhere you go. Even driving: which side of the road you drive on and which side of the car the steering wheel is on. All of those things are small frictions, and yet they&rsquo;ve been standardized enough to where they work regionally, and we can see how well it would work if they were standardized globally.</p>
<p>Time zones, actually! Obviously we have to have time zones, just because of the fact of having a globe, but I keep thinking, &ldquo;Wouldn&rsquo;t it be nice if everybody got rid of Daylight Saving so that we didn&rsquo;t have the problem of wondering if such and such a country has gone on Daylight Saving?&rdquo; If it were a standard interval between each timezone and didn&rsquo;t change with the seasons it would make scheduling international meetings so much easier.</p>


<h3 id="hahahugoshortcode-s27-hbhb-cameron-neylon"><figure class="round-figure"><img src="/img/blog/coki/cameron-neylon.png"
         alt="Avatar of Cameron Neylon"/>
</figure>
 Cameron Neylon 
</h3>
<p>And also the weird time zones, as well. The power one is actually quite an interesting one, because there does seem to be an emerging plug standard. There are actually moves towards persuading everyone to just use one. I mean, it&rsquo;s a bit ridiculous, but given that a lot of devices that we use, with some exceptions, can run off 5 volts, an awful lot of things could just use USB plugs. There&rsquo;s a 5 volt / 12 volt / 24 volt issue, but a lot of things could use USB, and we&rsquo;re sort of iterating towards that. Interestingly, it&rsquo;s driven not really by thoughtful top-down standardization, but by frustration of people who are traveling and the bag full of adapters that you&rsquo;re carrying, if you&rsquo;re going to more than one place. I do actually carry around now just one or two USB adapters with swappable plugs, because that works reasonably well. Until I get to South Africa and realize that <a href="https://www.worldstandards.eu/electricity/plug-voltage-by-country/south-africa/">that particular plug is quite unusual</a>.</p>
<p>But yes, I think there&rsquo;s something really interesting there. The other example from the power world, of course, is that once upon a time you couldn&rsquo;t take any appliance from, say, the US at 110 volts and bring it to anywhere in the world running at 240 volts. But these days, I don&rsquo;t even remember that I should probably check before I plug something in with an adapter, because the transformers are all standardized to deal with both cases, so you don&rsquo;t usually have to think about needing a step down transformer to operate.</p>
<figure><img src="/img/blog/coki/Map_of_the_world_coloured_by_voltage_and_frequency.png"
         alt="Map of the world coloured by voltage and frequency"/><figcaption>
            <p>Conrad H. McGregor, Map of the Countries of the World Colored by the Nominal Voltage and Frequency They Use, available on <a href="https://commons.wikimedia.org/wiki/File:Map_of_the_world_coloured_by_voltage_and_frequency.png">Wikimedia Commons</a></p>
        </figcaption>
</figure>

<p>And that gets back to things like ROR as interchanges, where yes, there are legacy systems of other identifier systems out there, and there are systems that perhaps ROR shouldn&rsquo;t cover that maybe ISNI needs to, but at least if we&rsquo;ve got places to do lookups, and those are, again, centrally maintained and kept reasonably up to date, then we don&rsquo;t need to think too much about that as we translate between things.</p>
<p>What I wish we also had, and ROR does this well, and other identifier systems not as well yet, is exactly this thing of collections of objects. Most of the work we do involves taking sets of research outputs and bundling them up into groups where those groups might be, they might be the outputs of a person, they might be the outputs of a project or an institution or a country or a discipline, or whatever it might be. And in the institution case, where because of the <a href="/blog/2023-02-27-parents-children-and-other-relationships-in-ror/">parent-child relationships in the ROR schema</a>, we can go up and down those hierarchies and figure out the level we want to operate at. Whereas I&rsquo;d say with DOIs, that&rsquo;s not so easy, because there&rsquo;s a lot of complexity and inconsistency in relationships between DOIs for collections versus DOIs for objects. And that&rsquo;s before I start getting annoyed about ISBNs and book chapters.</p>
<p>Project IDs, I think, are an area that&rsquo;s missing a lot of space, and I think institutions really ought to do a much better job of providing ways of interconnecting between ORCIDs and institutions. There&rsquo;s lots of opportunities there. And thinking carefully about those collections kind of questions, I think, is really valuable.</p>


<h3 id="hahahugoshortcode-s29-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Maybe that&rsquo;s a good occasion for me to ask, what do you hope ROR does better in the future? What can we do that would make your life easier?</p>


<h3 id="hahahugoshortcode-s30-hbhb-cameron-neylon"><figure class="round-figure"><img src="/img/blog/coki/cameron-neylon.png"
         alt="Avatar of Cameron Neylon"/>
</figure>
 Cameron Neylon 
</h3>
<p>There&rsquo;s very little that we bump up against as a problem. At the moment, I think most of the immediate issues we&rsquo;re facing are to do with the challenges of actually making the interconnections, so the assignments of, say, outputs to two organizations, and the heuristics involved with that, which is a hard problem to solve.</p>
<p>The area where I can imagine the real benefits will arise, and this is already happening, is the expansion of the set of organizations being covered, and that&rsquo;s really welcome. When there&rsquo;s a <a href="https://github.com/ror-community/ror-updates/tags">new release</a>, that number clicks up and you think, &ldquo;Oh, my brain is about three releases behind, I thought it was still 95,000, and now it&rsquo;s 110, 115, 120, something like that.&rdquo; [<em>Note: It&rsquo;s currently <a href="https://github.com/ror-community/ror-updates/releases/tag/v1.38">over 107,000</a>.</em>] I think that&rsquo;s great. I&rsquo;m really glad that people are thinking hard about how to manage those questions of curation, not because I have opinions necessarily about how they should be done better, but because I know that they are really hard problems. I&rsquo;m glad smart people are thinking about them.</p>
<p>One of the questions I have, and it&rsquo;s an issue that, again, is a real challenge, both from a curation perspective and from a schema perspective, is how we can tackle the question of units within organizations and what that looks like. I&rsquo;ve seen that on both sides. I completely understand why people would want to run screaming in the opposite direction from even trying to deal with it. And I can also see from within institutions how important it is to have the capacity to be able to do that. I think that&rsquo;s an area that&rsquo;s really worth thinking through. And the answer might be something like component DOIs, so actually for ROR to not deal with it, but to allow institutions to self-manage. I can also see that going horribly wrong.</p>


<h3 id="hahahugoshortcode-s31-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Right.</p>


<h3 id="hahahugoshortcode-s32-hbhb-cameron-neylon"><figure class="round-figure"><img src="/img/blog/coki/cameron-neylon.png"
         alt="Avatar of Cameron Neylon"/>
</figure>
 Cameron Neylon 
</h3>
<p>I&rsquo;m not sure I&rsquo;ve got a better solution. But one of the things we see really frequently is exactly that question of how to reflect back to an institution what the parts of it are looking like while having to deal with the fact that only the institution itself has any sort of real knowledge of what that looks like. And it&rsquo;s changing so fast!</p>


<h3 id="hahahugoshortcode-s33-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yes, I know. <a href="/about/faqs/#why-doesnt-ror-include-university-departments">Departments</a>, in particular, will merge and change names and whatnot. And I have to tell you, I absolutely believe that there would be internal arguments at an institution about what their institutional hierarchy would be.</p>


<h3 id="hahahugoshortcode-s34-hbhb-cameron-neylon"><figure class="round-figure"><img src="/img/blog/coki/cameron-neylon.png"
         alt="Avatar of Cameron Neylon"/>
</figure>
 Cameron Neylon 
</h3>
<p>These are real issues, right? This is the thing we discovered, I mean, coming right back to the starting point of why we were doing this in the way we did. What a Vice-Chancellor or the University President sees is something like &ldquo;Here are the faculties, here are the number of outputs, here are the number the citations, here&rsquo;s the amount of research income.&rdquo; It&rsquo;s really at a very high level. And those numbers are garbage for the most part. They are appallingly bad. Years out of date, and often related to an organizational structure that hasn&rsquo;t existed for half a decade or more.</p>
<p>And so there&rsquo;s a desperate need for automating more of this so that information will be more up-to-date and more responsive and more flexible. But at the same time, if heads of schools are being judged on how much research revenue they&rsquo;re getting assigned to them, <em>and</em> they&rsquo;re being judged on how much they&rsquo;re collaborating across the university, are they going to quietly not argue about where that money got assigned to, even if it then got spread out across multiple faculties or departments?</p>
<p>It&rsquo;s a real issue on which people&rsquo;s livelihoods depend. They&rsquo;re really complicated questions, and getting them wrong can break institutions. Information has to be summarized, and summarization is a lossy process, and so you&rsquo;ve got to make decisions about what you lose and what matters.</p>
<p>And maybe that&rsquo;s the other piece, and I think this isn&rsquo;t just a challenge for ROR, but the question of literacy. How do we raise the literacy of all the players? Not to the extent that they need to know the nitty-gritty of all the details of how everything works, but so they can ask intelligent questions about how the information that they&rsquo;re getting has been processed, and whether at least in principle there&rsquo;s a transparent audit trail so they can go back and check and see how things might have looked if it was done differently. And again, that&rsquo;s something we build into pretty much everything we do now.</p>
<p>It&rsquo;s amazingly common in the proprietary world not to do this. University rankings are particularly bad at this. The thing you do if you&rsquo;re going to change something is you do one run the old way, and then you do your next run the new way, and you look at how different they are. And you make that clear and transparent, and ideally, you&rsquo;ve got open source code. It&rsquo;s critically important, but it&rsquo;s amazing how rarely it&rsquo;s done, and even where it is done, without community-owned open indexes and infrastructures and systems, it&rsquo;s really hard for that to be for that to be transparent.</p>
<p>These are the real challenges. People are exercised about artificial intelligence, and this, that, and the other, but the problems aren&rsquo;t really to do with the scale at which these computational systems can work. They&rsquo;re to do with the fact that people are making decisions so far away from the point where the context and on-the-ground information is that the quality of the information that&rsquo;s going through those systems needs to be a lot higher than it is.</p>
<p>And the consequences of getting it wrong is that entire departments or even entire universities get closed down. Whole programs of research cease to exist. Teaching programs that you might need tomorrow are axed, because apparently someone didn&rsquo;t get a job five years ago. Every time there&rsquo;s a major world event, there&rsquo;s a story that goes round about how someone three days earlier didn&rsquo;t get a grant about that particular thing, because it wasn&rsquo;t important. And suddenly it is.</p>


<h3 id="hahahugoshortcode-s35-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yes, absolutely. Well, we can wrap up. What else would you like to say about ROR or about your work?</p>


<h3 id="hahahugoshortcode-s36-hbhb-cameron-neylon"><figure class="round-figure"><img src="/img/blog/coki/cameron-neylon.png"
         alt="Avatar of Cameron Neylon"/>
</figure>
 Cameron Neylon 
</h3>
<p>At core what I would say is we couldn&rsquo;t do what we do without ROR. We&rsquo;re focused on organizations and institutions. <strong>Without the underlying infrastructure that ROR provides to help us triage data from multiple data sources and connect it to organizations at the right level, we couldn&rsquo;t do what we do. And as we move forward with what we&rsquo;re trying to do, and try and scale up and scale out the kind of stuff we&rsquo;re doing, it&rsquo;s great to see that ROR is quite frequently a couple of steps ahead. ROR is thinking about the same problems and trying to address them as we can see them coming down the road ahead of us.</strong> So, yes: keep on keeping on.</p>


<h3 id="hahahugoshortcode-s37-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>We will! We&rsquo;re feeling good. We are hiring more curator help and thinking about ways to scale up that curation that you noticed is so crucial. This has been really marvelous. I only wish we could talk for another half an hour about infrastructure generally. Thanks so much.</p>


<h3 id="hahahugoshortcode-s38-hbhb-cameron-neylon"><figure class="round-figure"><img src="/img/blog/coki/cameron-neylon.png"
         alt="Avatar of Cameron Neylon"/>
</figure>
 Cameron Neylon 
</h3>
<p>Have a good day.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>Questions? Want to be featured in a ROR case study? Contact <a href="mailto:support@ror.org">support@ror.org</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Professor Cameron Neylon of Curtin University talks telephones, power outlets, chat services, persistent identifier education, federated versus centralized curation, providing actionable information to universities, and why the COKI Open Access Dashboard relies on ROR.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">ROR's Year in Review: 2023</title><id>https://doi.org/10.71938/r78t-8b94</id><link rel="alternate" href="https://ror.org/blog/2023-12-15-ror-year-in-review-2023/"/><published>2023-12-15T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="Year in Review"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>As we near the end of 2023, we are taking a moment to reflect on what has been another significant year for ROR. We are thrilled that ROR is widely recognized and trusted as the leading standard for organization identifiers. This recognition is reflected in the skyrocketing use of ROR&rsquo;s API (which reached a high of 27 million requests per month), wide range of implementations across the research ecosystem, and non-stop activity in ROR&rsquo;s curation pipeline (currently handling approximately 700 requests per month). This momentum signals the broad adoption of ROR as well as the unequivocal need for something like ROR to fill crucial gaps in the landscape and address critical use cases around the tracking of research outputs. Heading into 2024, we expect that this growth will continue and that even more use cases will continue to develop.</p>
<p>Looking back on 2023, we wanted to highlight a selection of milestones from the past year that we are especially proud of and that particularly represent ROR’s unique journey so far.</p>


<h3 id="ror-as-a-new-solution-for-identifying-funders-and-addressing-key-use-cases-for-funding-metadata">ROR as a new solution for identifying funders and addressing key use cases for funding metadata 
</h3>
<p>In September, <a href="https://ror.org/blog/2023-09-07-open-funder-registry-transition-ror-cross-post/">Crossref and ROR announced that the Open Funder Registry would be merged into ROR</a>. With a significant number of Funder IDs <a href="https://ror.org/blog/2023-10-12-ror-funder-registry-overlap/">already represented in the registry</a>, ROR is well positioned to incorporate the Funder Registry and be used in research and publishing systems to identify funding organizations. Users are beginning to switch from the Funder Registry to ROR, and we’ll be sharing more updates on this progress as the transition continues into the new year.</p>
<figure class="blog-figure"><img src="/img/blog/funder-overlap/crossref-funder-overlap.png"
         alt="Overlap between Crossref Open Funder Registry and ROR as of October 2023"/><figcaption>
            <p>Aggregate overlap between Crossref Open Funder Registry and ROR as of October 2023</p>
        </figcaption>
</figure>



<h3 id="notable-ror-integrations-announced-updated-and-completed">Notable ROR integrations announced, updated, and completed 
</h3>
<p>ROR adoption in publishing platforms continues to grow, with recent developments in AAAS&rsquo;s Science Content Tracking System, Octopus, Janeway, <a href="https://www.youtube.com/playlist?list=PLagyLN5PZI3zVbJNYCNFmL1cJif_31ReV">ResearchEquals</a>, and ScienceOpen. This year saw more integrations in repository and CRIS systems such as <a href="https://knowledge.figshare.com/article/figshare-and-the-generalist-repository-ecosystem-initiative-grei-what-weve-achieved-and-where-were-heading">figshare for Institutions</a>, ChemrXiv, DSpace-CRIS, InvenioRDM, and <a href="https://blog.zenodo.org/2022/12/07/2022-12-07-zenodo-on-inveniordm/">Zenodo</a>; and in many other tools and indexes such as the COKI Open Access Dashboard and OA.Report. We&rsquo;re looking forward to hearing about and announcing even more ROR integrations in 2024. Throughout the year, the ROR blog featured in-depth conversations with ROR adopters about how and why they are using ROR to address key needs in their workflows. Check out the <a href="https://ror.org/categories/case-studies/">complete list of case studies</a>, and let ROR know if you have suggestions for upcoming features.</p>
<figure class="blog-figure"><img src="/img/blog/recap2023/ror-case-studies.png"
         alt="ROR adopter case studies"/><figcaption>
            <p>ROR adopter case studies</p>
        </figcaption>
</figure>



<h3 id="development-of-version-2-of-the-ror-schema-and-api">Development of version 2 of the ROR schema and API 
</h3>
<p>Following extensive community input, ROR developed the first major update to its schema and API (v2), which will incorporate key feedback from users and integrators and enable ROR to build upon the data model originally inherited from GRID. The <a href="https://ror.org/blog/2023-09-14-beta-test/">beta version of the new schema was released in September</a>, and the production release is expected early in the new year.</p>
<figure class="blog-figure"><img src="/img/blog/recap2023/ror-v2-elements.png"
         alt="Elements of ROR schema version 2"/><figcaption>
            <p>Elements of ROR schema version 2</p>
        </figcaption>
</figure>



<h3 id="enhancing-rors-global-coverage-and-metadata-quality">Enhancing ROR&rsquo;s global coverage and metadata quality 
</h3>
<p>This year marked significant growth in ROR&rsquo;s <a href="https://ror.readme.io/docs/updates#community-based-curation-model">community-based curation model</a> and included a number of dedicated efforts to improve the registry&rsquo;s coverage in specific areas. These efforts included&ndash;among many others&ndash;a collaboration with France&rsquo;s <a href="https://www.enseignementsup-recherche.gouv.fr">Ministère de l’enseignement supérieur et de la recherche</a> focused on representation of French research organizations and their research units, comprehensive reviews of Latin American and Caribbean organization records by members of <a href="https://www.lareferencia.info">LA Referencia</a>, and analyses of Spanish organizations by both <a href="https://www.fecyt.es">FECYT</a> and <a href="https://www.csic.es">CSIC</a> have contributed to enhancing the data quality and coverage for the country. In the United States, collaborations with government organizations such as the <a href="https://www.osti.gov">U.S. Department of Energy Office of Scientific and Technical Information</a>, <a href="https://www.noaa.gov/">NOAA</a>, and the <a href="https://discover.dtic.mil/">Defense Technical Information Center</a> have ensured more comprehensive and accurate representation of US government organizations. We are grateful for the numerous contributions provided by ROR’s community, which have greatly enhanced the registry&rsquo;s value and utility.</p>
<figure class="blog-figure"><img src="/img/blog/recap2023/ror-updates-projects.png"
         alt="ROR Curation Updates projects board on GitHub"/><figcaption>
            <p>ROR Curation Updates projects board on GitHub</p>
        </figcaption>
</figure>



<h3 id="participation-in-a-range-of-community-events">Participation in a range of community events 
</h3>
<p>In 2023 we joined community members in person and online at the Society for Scholarly Publishing annual meeting, the <a href="https://ror.org/blog/2023-05-09-pids-open-science-latin-america/">#PIDsLATAM workshop</a> in Buenos Aires, a workshop at the Open Repositories conference, the biannual meeting of the  <a href="https://www.issi-society.org/conferences/">International Society for Scientometrics and Informetrics</a> (ISSI), and a number of webinars with collaborators including ORCID, <a href="https://datacite.org/event/the-role-of-funders-in-building-a-robust-and-trustworthy-output-tracking-mechanism-using-pids-and-open-metadata/">DataCite</a>, <a href="https://africarxiv.pubpub.org/pub/h50au5wa/release/1">AfricArXiv</a>, and <a href="https://ror.org/events/2023-12-07-us-federal-guidance-and-pids/">DOE/OSTI</a>.</p>
<figure class="blog-figure"><img src="/img/blog/pidslatam/pidslatam-group-photo.jpg"
         alt="Participantes (¡Felices!) del evento PIDs y ciencia abierta en América Latina"/><figcaption>
            <p>Participantes (¡Felices!) del evento PIDs y ciencia abierta en América Latina</p>
        </figcaption>
</figure>

<p>We will be kicking off the new year with a celebration of ROR&rsquo;s fifth anniversary at the end of January. Every year since ROR&rsquo;s launch, we have celebrated the anniversary with an open community meeting to reflect on the past and plan for the future. The 2024 meeting will be an opportunity to mark five years of ROR and bring the community together to discuss the next five years and beyond. Stay tuned for more information about the celebration, and in the meantime please reserve the slots on your calendar by registering at the links on ROR&rsquo;s <a href="/events">Events page</a>.</p>
<p>ROR wouldn&rsquo;t be ROR without the community behind it. We express our sincere gratitude and appreciation for the many ways in which community members around the world have supported and engaged with ROR over the past year: showing up to community calls, giving us feedback on ROR’s technical development work, submitting registry updates, building ROR implementations, making financial contributions, and much much more.</p>
<p>Here&rsquo;s to 2023, and to ROR-ing more in 2024!</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>Contact us at <a href="mailto:info@ror.org">info@ror.org</a> with any questions or comments.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">As we near the end of 2023, we are taking a moment to reflect on what has been another significant year for ROR.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Case Study: Clear Skies, Research Integrity, Data Science, and ROR</title><id>https://doi.org/10.71938/k0xk-2x70</id><link rel="alternate" href="https://ror.org/blog/2023-11-27-clear-skies-case-study/"/><published>2023-11-27T00:00:00+00:00</published><updated>2025-11-07T11:35:16-05:00</updated><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><category term="Case Studies"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>Adam Day, CEO of Clear Skies Ltd., discusses how he works to improve research integrity with tools like Papermill Alarm, why such tools can assist but can&rsquo;t replace human investigators, and what&rsquo;s so important for data scientists about free and open identifers like ROR.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>
<h2 id="key-quotations">Key quotations 
  <a href="#key-quotations" aria-label="Key quotations">
<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 640 512" class="ha ha-link" width="16" height="16" alt="Key quotations"><path d="M579.8 267.7c56.5-56.5 56.5-148 0-204.5c-50-50-128.8-56.5-186.3-15.4l-1.6 1.1c-14.4 10.3-17.7 30.3-7.4 44.6s30.3 17.7 44.6 7.4l1.6-1.1c32.1-22.9 76-19.3 103.8 8.6c31.5 31.5 31.5 82.5 0 114L422.3 334.8c-31.5 31.5-82.5 31.5-114 0c-27.9-27.9-31.5-71.8-8.6-103.8l1.1-1.6c10.3-14.4 6.9-34.4-7.4-44.6s-34.4-6.9-44.6 7.4l-1.1 1.6C206.5 251.2 213 330 263 380c56.5 56.5 148 56.5 204.5 0L579.8 267.7zM60.2 244.3c-56.5 56.5-56.5 148 0 204.5c50 50 128.8 56.5 186.3 15.4l1.6-1.1c14.4-10.3 17.7-30.3 7.4-44.6s-30.3-17.7-44.6-7.4l-1.6 1.1c-32.1 22.9-76 19.3-103.8-8.6C74 372 74 321 105.5 289.5L217.7 177.2c31.5-31.5 82.5-31.5 114 0c27.9 27.9 31.5 71.8 8.6 103.9l-1.1 1.6c-10.3 14.4-6.9 34.4 7.4 44.6s34.4 6.9 44.6-7.4l1.1-1.6C433.5 260.8 427 182 377 132c-56.5-56.5-148-56.5-204.5 0L60.2 244.3z"/></svg>
  </a>
</h2>
<p>&ldquo;Having institutional identifiers in there, it&rsquo;s part of the puzzle. And it&rsquo;s an important part of the puzzle. But there are limits to data analysis. Data analysis has got to help people to do the investigation – not try to do it for them.&rdquo;</p>
<p>&ldquo;If we&rsquo;re talking about misconduct, then you might need to be able to contact the institution that the author is from. On an individual manuscript, it doesn&rsquo;t matter if there&rsquo;s no identifier – an address will do. But if you find some signal that is on manuscripts at scale, and you&rsquo;ve got thousands of them, well, you need an identifier. You can&rsquo;t go through them and try and search for every single one of those institutions.&rdquo;</p>
<p>&ldquo;Having nice, clean, big, open datasets like this is really valuable, especially for a problem like this one, a problem which does require taking a big step back and looking at all of the data together, and not having barriers in the way of getting parts of it.&rdquo;</p>
<p>&ndash; Adam Day, CEO, Clear Skies Ltd.</p>
</span></div>
	</div>
</div>


<h3 id="hahahugoshortcode-s1-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Thank you so much for agreeing to talk to me about ROR. Can you begin by telling us your name, title and organization?</p>


<h3 id="hahahugoshortcode-s2-hbhb-adam-day"><figure class="round-figure"><img src="/img/blog/clearskies/adam-day.jpg"
         alt="Avatar of Adam Day"/>
</figure>
 Adam Day 
</h3>
<p>Adam Day, CEO, <a href="https://clear-skies.co.uk/">Clear Skies, Ltd</a>.</p>


<h3 id="hahahugoshortcode-s3-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Lovely. Tell us about Clear Skies Ltd. When was it founded? What does it do? What makes it special?</p>


<h3 id="hahahugoshortcode-s4-hbhb-adam-day"><figure class="round-figure"><img src="/img/blog/clearskies/adam-day.jpg"
         alt="Avatar of Adam Day"/>
</figure>
 Adam Day 
</h3>
<p>Clear Skies was founded in, let&rsquo;s see, the first quarter of 2022, so not very long ago. The whole idea behind Clear Skies is to take data from multiple sources, bring it together, and use it to help research integrity. That’s what makes us special – our data is completely unique. But I mean ‘research integrity’ in a broad sense: not just research integrity as in dealing with misconduct or papermills, but the whole process. What can we do by bringing data together to help make research better or more robust?</p>


<h3 id="hahahugoshortcode-s5-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>I know that your flagship product is <a href="https://www.science.org/content/article/fake-scientific-papers-are-alarmingly-common">The Papermill Alarm</a>, but does Clear Skies have other products that you&rsquo;re working on?</p>


<h3 id="hahahugoshortcode-s6-hbhb-adam-day"><figure class="round-figure"><img src="/img/blog/clearskies/adam-day.jpg"
         alt="Avatar of Adam Day"/>
</figure>
 Adam Day 
</h3>
<p>We do. That being said, the Papermill Alarm has been a much bigger success than I had expected. Initially, the Papermill Alarm was one pipeline for detecting papermills, but then we came up with a bunch of other pipelines for that purpose. To avoid confusion, we rolled all of those services up into one product with simple, clear, output. That’s the Papermill Alarm.</p>


<h3 id="hahahugoshortcode-s7-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>When and where did you personally first hear about ROR?</p>


<h3 id="hahahugoshortcode-s8-hbhb-adam-day"><figure class="round-figure"><img src="/img/blog/clearskies/adam-day.jpg"
         alt="Avatar of Adam Day"/>
</figure>
 Adam Day 
</h3>
<p>I was at a FORCE11 conference in Edinburgh, and one of the sessions was about ROR. I wanted to do analysis at an institutional level. So, you want to say, &ldquo;Okay, which institution is producing the most papers on some topic?&rdquo; or &ldquo;Which is the best-cited institution on some topic?&rdquo; or maybe &ldquo;Which institution in which country is growing the fastest?&rdquo; Those are the sorts of questions that you might want to ask, and if you only have, say, addresses to go on, that&rsquo;s a total nightmare. Completely impossible at industry-scale.</p>
<p>There are other systems for identifying institutions, but they are not easily connected with the data. When you look at a dataset like <a href="https://openalex.org">OpenAlex</a>, a very large part of it has these ROR IDs, and you go, “Great, I could just use that&quot; and suddenly all this institutional analysis is possible at any scale.</p>


<h3 id="hahahugoshortcode-s9-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>And then for what purpose did you begin using ROR?</p>


<h3 id="hahahugoshortcode-s10-hbhb-adam-day"><figure class="round-figure"><img src="/img/blog/clearskies/adam-day.jpg"
         alt="Avatar of Adam Day"/>
</figure>
 Adam Day 
</h3>
<p>At Clear Skies, I wanted to examine the papermill situation at an institutional level. Initially, this was part of sense-checking the results coming from the Papermill Alarm development process. ROR showed which institutions trigger the Papermill Alarm most often – and it turns out that the same institutions already have retractions relating to papermills. I doubt that’s a coincidence!</p>


<h3 id="hahahugoshortcode-s11-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>How important do you think that organization correlation is in detecting papermills? What are the other factors that are important in that?</p>


<h3 id="hahahugoshortcode-s12-hbhb-adam-day"><figure class="round-figure"><img src="/img/blog/clearskies/adam-day.jpg"
         alt="Avatar of Adam Day"/>
</figure>
 Adam Day 
</h3>
<p>There are a lot of factors in there. Institution is part of the picture. The key metrics that have proven the value of the Papermill Alarm are things like:</p>
<ul>
<li>
<p>Recall of known papermill-products. When we train the Papermill Alarm, we show it examples of papermill-products, but not all of them. We keep some aside. Then, after training, we show some non-papermill-products and those known papermill-products. If the Papermill Alarm can accurately flag the known papermill-products and flags none of the non-papermill-products, then we have good evidence that it works.</p>
</li>
<li>
<p>Prediction of mass-retractions. There is a long and growing list of these. Given real-world data, the Papermill Alarm consistently identifies the potential for these events.</p>
</li>
</ul>
<p>In practice, the fundamental thing is that there has to be a human-in-the-loop in the analysis of every case. It would be fantastic if we could just have this wonderful, magical machine that could detect all of the fraud and deal with it for us with 100% accuracy. But the reality is that you need to have humans in there who can exercise judgment over each case.</p>
<p>Imagine a plagiarism case, where you find two papers that are basically identical. And so you say to the author, &ldquo;Hey, look, we found these two papers, your paper and this other one, and they&rsquo;re the same. Did you copy this paper?&rdquo; And the author might come back and say, &ldquo;No, I didn&rsquo;t. It just coincidentally happens that I wrote an identical paper to this other person who I have never met.&rdquo; That is an excuse that comes up sometimes. And, of course, there has to be someone there who can look at the evidence, weigh it up carefully, and say &ldquo;I don&rsquo;t believe you.&rdquo;</p>
<p>That&rsquo;s it. <strong>Having institutional identifiers in there, it&rsquo;s part of the puzzle. And it&rsquo;s an important part of the puzzle. But there are limits to data analysis. Data analysis has got to help people to do the investigation – not try to do it for them.</strong></p>


<h3 id="hahahugoshortcode-s13-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>What other persistent identifiers do you use in Papermill Alarm?</p>


<h3 id="hahahugoshortcode-s14-hbhb-adam-day"><figure class="round-figure"><img src="/img/blog/clearskies/adam-day.jpg"
         alt="Avatar of Adam Day"/>
</figure>
 Adam Day 
</h3>
<p>There are lots of things that come into the analysis. <a href="https://doi.org">DOIs</a>, frankly, are absolutely critical. Then there&rsquo;s <a href="https://orcid.org">ORCID</a>, which we use to some extent, although it is not universally adopted. There isn’t a perfect way to identify journals and publishers, but we get pretty far using a mix of OpenAlex’s IDs, ISSNs and Crossref’s member id.</p>
<p>Does the ROR identifier get added from the affiliation string?</p>


<h3 id="hahahugoshortcode-s15-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yes! We actually just published <a href="https://ror.org/blog/2023-09-13-openalex-case-study/">a case study with OpenAlex</a>, which you may want to read. They did some pretty heavy lifting in terms of doing that string to ROR ID matching, too, and apparently it&rsquo;s working quite well. The large majority of OpenAlex materials do have ROR IDs, and they are using exclusively ROR IDs for organizations right now.</p>
<p>Can I ask, do you do this inductively, where you look for interesting patterns in the data? Or do you start out with a hypothesis and then see whether or not it checks out based on your analysis?</p>


<h3 id="hahahugoshortcode-s16-hbhb-adam-day"><figure class="round-figure"><img src="/img/blog/clearskies/adam-day.jpg"
         alt="Avatar of Adam Day"/>
</figure>
 Adam Day 
</h3>
<p>The original Papermill Alarm started out with a hypothesis about what was going to be detectable and what wasn&rsquo;t. And it turned out to be a lot more than I&rsquo;d expected. But there have been plenty of times when I&rsquo;ve been digging through data (something which I do more than I would care to admit). And then you&rsquo;ll see something and think, &ldquo;Oh, maybe I could look at this.&rdquo; And it turns out that the thing you&rsquo;re looking at is quite important. And also, more often than not, the thing you&rsquo;re looking at turns out to not be important. So hypotheses come up, you know, as you&rsquo;re digging through stuff, and you find things that way.</p>


<h3 id="hahahugoshortcode-s17-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>That sounds like good science, I&rsquo;d say. Going back to something we were talking about earlier, how do you think that persistent identifiers like ROR can help with the overall project of trying to ensure research integrity?</p>


<h3 id="hahahugoshortcode-s18-hbhb-adam-day"><figure class="round-figure"><img src="/img/blog/clearskies/adam-day.jpg"
         alt="Avatar of Adam Day"/>
</figure>
 Adam Day 
</h3>
<p>On the one hand, it&rsquo;s just about data integrity. ROR is a key part of the data. We need to connect these articles and these individuals to the institutions where they work. That needs to be done well. The same is true of all persistent identifiers – they allow you to connect up the data at scale. Getting persistent identifiers right means we have a complete and accurate picture of things.</p>
<p><strong>If we&rsquo;re talking about misconduct, then you might need to be able to contact the institution that the author is from. On an individual manuscript, it doesn&rsquo;t matter if there&rsquo;s no identifier – an address will do. But if you find some signal that is on manuscripts at scale, and you&rsquo;ve got thousands of them, well, you need an identifier. You can&rsquo;t go through them and try and search for every single one of those institutions.</strong></p>
<p>But in terms of integrity generally, there&rsquo;s also just this point about accountability. Who is accountable for the content of the paper? You&rsquo;ve got the individual researchers, and you can identify them. But then who else is accountable? Well, the publisher is, because they are the ones who peer-reviewed it, and they&rsquo;re putting their name on it. Then you&rsquo;ve got the institution as well, and they&rsquo;re also accountable. So when it comes to the difficult task of investigating misconduct, having the ROR institutional ID on there means you know who to call.</p>


<h3 id="hahahugoshortcode-s19-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Do you do any work with funders at all?</p>


<h3 id="hahahugoshortcode-s20-hbhb-adam-day"><figure class="round-figure"><img src="/img/blog/clearskies/adam-day.jpg"
         alt="Avatar of Adam Day"/>
</figure>
 Adam Day 
</h3>
<p>We don&rsquo;t. At least not yet.</p>


<h3 id="hahahugoshortcode-s21-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>I almost think of funders as more the regulatory or enforcement part. OpenAlex actually uses ROR to identify funders in its funder model as well, because funders are in scope for ROR, and we have lot of them. You may know that Crossref runs the <a href="https://www.crossref.org/services/funder-registry/">Open Funder Registry</a>, which has been curated by Elsevier, and we&rsquo;ve just <a href="https://www.crossref.org/blog/open-funder-registry-to-transition-into-research-organization-registry-ror/">announced that the Open Funder Registry will eventually be deprecated in favor of ROR</a>. ROR IDs to identify funders is the future.</p>


<h3 id="hahahugoshortcode-s22-hbhb-adam-day"><figure class="round-figure"><img src="/img/blog/clearskies/adam-day.jpg"
         alt="Avatar of Adam Day"/>
</figure>
 Adam Day 
</h3>
<p>Oh, great! I like that!</p>


<h3 id="hahahugoshortcode-s23-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>It may be a while. We&rsquo;re thinking on the scale of about two years or so. But we&rsquo;ve got <a href="https://ror.org/blog/2023-10-12-ror-funder-registry-overlap/">more blog posts coming out about that</a>.</p>


<h3 id="hahahugoshortcode-s24-hbhb-adam-day"><figure class="round-figure"><img src="/img/blog/clearskies/adam-day.jpg"
         alt="Avatar of Adam Day"/>
</figure>
 Adam Day 
</h3>
<p>I saw the announcement about <a href="https://www.crossref.org/blog/news-crossref-and-retraction-watch/">Crossref and Retraction Watch</a>. That&rsquo;s great news.</p>


<h3 id="hahahugoshortcode-s25-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yes, it is! Glad you saw it and that you think so. Now, let me give you a chance to make some feature requests. What would make your life easier if ROR did something differently?</p>


<h3 id="hahahugoshortcode-s26-hbhb-adam-day"><figure class="round-figure"><img src="/img/blog/clearskies/adam-day.jpg"
         alt="Avatar of Adam Day"/>
</figure>
 Adam Day 
</h3>
<p>Being able to go down to the departmental level, that would certainly be useful in cases where you&rsquo;re finding an institution where there are tons of problem papers going in, but it&rsquo;s a gigantic institution and they&rsquo;re putting out thousands of papers a month. When you&rsquo;re finding dozens of problem papers coming out in all of these thousands, that might not actually tell us all that much. But then if we were to look at that at a department level, and see that those dozens are all coming from one department, that&rsquo;s a really useful signal.</p>
<p>The last time I did some analysis on that, I think I wasn&rsquo;t able to drill down.  Maybe when some people have got their ROR ID, they&rsquo;re going to have either the top-level one or one that&rsquo;s lower down, not both.</p>


<h3 id="hahahugoshortcode-s27-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>All of those high-level <a href="https://ror.org/blog/2023-02-27-parents-children-and-other-relationships-in-ror/">relationships are in the ROR record</a>, but in their implementations or in their analyses, not everybody will do that linking. Some do, though. To use OpenAlex as an example again, they relatively recently released in their API a thing they call <a href="https://docs.openalex.org/api-entities/institutions/institution-object#lineage">institution lineage</a>, where you can do a query on an organization to find any parent organizations it may have.</p>


<h3 id="hahahugoshortcode-s28-hbhb-adam-day"><figure class="round-figure"><img src="/img/blog/clearskies/adam-day.jpg"
         alt="Avatar of Adam Day"/>
</figure>
 Adam Day 
</h3>
<p>Oh that sounds very useful – I will look into that! Thanks!</p>


<h3 id="hahahugoshortcode-s29-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>What else do you want to tell us about ROR, or about your work, or about anything at all?</p>


<h3 id="hahahugoshortcode-s30-hbhb-adam-day"><figure class="round-figure"><img src="/img/blog/clearskies/adam-day.jpg"
         alt="Avatar of Adam Day"/>
</figure>
 Adam Day 
</h3>
<p>There is one thing I&rsquo;d like to say, which is that the availability and openness of data like this is really important. Because if it is free and open and available, then it&rsquo;s easy to work with. As soon as you get datasets where there are even little barriers to using them, that&rsquo;s an issue. Something like authentication doesn&rsquo;t really hold you back, but you can be using some tool that requires authentication, and then maybe the authentication breaks one day, and suddenly you don&rsquo;t have your data anymore. <strong>Having nice, clean, big, open datasets like this is really valuable, especially for a problem like this one, a problem which does require taking a big step back and looking at all of the data together, and not having barriers in the way of getting parts of it.</strong></p>
<p>That&rsquo;s one of issues around, say, analysis of full-text data. You can get some of it, because it&rsquo;s open access. There&rsquo;s a dataset that <a href="https://www.semanticscholar.org/">Semantic Scholar</a> pulled together and made available and things like that, but if you want all of it? No, that&rsquo;s just not even remotely possible.</p>


<h3 id="hahahugoshortcode-s31-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Even abstracts are difficult. Crossref has done a lot of <a href="https://www.crossref.org/blog/open-abstracts-where-are-we/">work on open abstracts with its Initiative for Open Abstracts</a>.</p>


<h3 id="hahahugoshortcode-s32-hbhb-adam-day"><figure class="round-figure"><img src="/img/blog/clearskies/adam-day.jpg"
         alt="Avatar of Adam Day"/>
</figure>
 Adam Day 
</h3>
<p>It&rsquo;s a good thing to do, because we do use abstracts in papermill detection. Actually, one point about that, which is maybe a bit tangential, is that every now and then when someone finds a dodgy paper, they retract it, and they change the title to put the word &ldquo;retracted&rdquo; in. For me, if I&rsquo;m trying to use that for training a machine learning model, that&rsquo;s bad: I have to take that out. &ldquo;Retracted&rdquo; has to go. But that&rsquo;s okay, I can do that. I can make a little rule to do that. But what happens sometimes is that they change the abstract. The abstract disappears, and then it gets replaced with something like &ldquo;The publisher has removed this paper.&rdquo; Now that&rsquo;s not useful data anymore. It&rsquo;s just gone. The publisher retracted this one, but we can&rsquo;t help them stop the next one getting retracted. [For more on this topic, see the <a href="https://www.niso.org/standards-committees/crec">NISO Working Group on the Communication of Retractions, Removals, and Expressions of Concern</a>. Comment is open on the working group&rsquo;s best practices draft through December 2, 2023.]</p>
<p>That&rsquo;s the thing I think is really important and valuable about ROR, is that being open and being available, it just makes so much stuff possible that wouldn&rsquo;t be otherwise. It’s also one of the things to say to anyone wanting to get into stuff like this. If you want to do data science on published research, just go for it. You can use the ROR API, you can use the Crossref API, OpenAlex, all this stuff. They&rsquo;re free. Go and do it. And that&rsquo;s a great way for people to learn. I learned tons of stuff by just trying things like that. That freedom is one of the great things that we get from having resources like this.</p>
<p>Then there’s serendipitous discovery. If people are able to get into it and just mess around, they won’t just find what they are looking for. They’ll find other stuff as well. And who knows what that might be.</p>


<h3 id="hahahugoshortcode-s33-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>I&rsquo;m glad you said that. Having the data be open is a really crucial part of what we think is important about ROR.</p>


<h3 id="hahahugoshortcode-s34-hbhb-adam-day"><figure class="round-figure"><img src="/img/blog/clearskies/adam-day.jpg"
         alt="Avatar of Adam Day"/>
</figure>
 Adam Day 
</h3>
<p>It&rsquo;s like serendipitous discovery, right? If people are able to get into it and just mess around, eventually, you&rsquo;ll get people just going, &ldquo;Oh, look, you can do this.&rdquo; And they would never have known if they didn&rsquo;t have the freedom to play around with it, so it&rsquo;s really important.</p>


<h3 id="hahahugoshortcode-s35-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>That&rsquo;s great. Well, thank you for speaking with me.</p>


<h3 id="hahahugoshortcode-s36-hbhb-adam-day"><figure class="round-figure"><img src="/img/blog/clearskies/adam-day.jpg"
         alt="Avatar of Adam Day"/>
</figure>
 Adam Day 
</h3>
<p>Cool. Thanks. Bye.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>Questions? Want to be featured in a ROR case study? Contact <a href="mailto:support@ror.org">support@ror.org</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Adam Day, CEO of Clear Skies Ltd., discusses how he works to improve research integrity with tools like Papermill Alarm, why such tools can assist but can't replace human investigators, and what's so important for data scientists about free and open identifers like ROR</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">How ROR and the Open Funder Registry Overlap: A Closer Look at the Data</title><id>https://doi.org/10.71938/ke93-w861</id><link rel="alternate" href="https://ror.org/blog/2023-10-12-ror-funder-registry-overlap/"/><published>2023-10-12T00:00:00+00:00</published><updated>2025-11-07T11:35:16-05:00</updated><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><author><name>Adam Buttrick</name><uri>https://orcid.org/0000-0003-1507-1031</uri></author><category term="Explainers"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>Following on the <a href="https://www.crossref.org/blog/open-funder-registry-to-transition-into-research-organization-registry-ror/">announcement that Crossref&rsquo;s Open Funder Registry will be merging with ROR after 2024</a>, we&rsquo;d like to do a deep dive into the specifics of the evidence that ROR is ready to take on the important work that the Open Funder Registry has been doing: identifying research funders in a clean, consistent, comprehensive, and interoperable way. The main thing you need to know is that ROR&rsquo;s data is up to the challenge. As of today, <strong>there is a corresponding ROR ID for over 94% of Funder ID assertions in both DataCite and Crossref DOI records</strong>.<sup id="fnref:1"><a href="#fn:1" class="footnote-ref" role="doc-noteref">1</a></sup></p>
<p>Crossref and Elsevier have done the entire scholarly communication ecosystem a great service not just in creating and maintaining the <a href="https://www.crossref.org/services/funder-registry/">Open Funder Registry</a> (formerly FundRef), but also in promulgating its use by publishers and other systems. Thanks to the Open Funder Registry, it has never been easier to match funders to research outputs using open connection metadata, and the current moment is one in which linking funding to research outputs is of critical importance to ensure transparency and research integrity.</p>
<p>As more and more systems have begun using ROR for contributor affiliations, however, it has become clear that also using ROR for the all-important task of identifying funders provides many benefits:</p>
<ul>
<li>Streamlining processes by using one registry instead of two</li>
<li>Reducing difficulty with ROR&rsquo;s easy-to-use REST API, clear documentation, and web-based landing pages</li>
<li>Being able to see and participate in ROR&rsquo;s open, transparent metadata curation processes</li>
<li>Supporting community-led and community-governed infrastructure</li>
</ul>
<p>Current users of the Open Funder Registry can begin planning now to enjoy these benefits by switching to ROR for funder identification.</p>
<p>ROR has been working throughout 2022 and 2023 to ensure that ROR has excellent coverage of funders, and we intend to continue this work through 2024. Read on for a detailed account of the current overlap between the Open Funder Registry and ROR and to learn more about the <strong><a href="https://rorfunderoverlap.streamlit.app/">ROR / Funder Registry Overlap tool</a></strong> we&rsquo;ve built to help you conduct your own analyses.</p>


<h2 id="comparing-at-the-record-level">Comparing at the record level 
</h2>
<p><strong>Much of the information included in ROR and Open Funder Registry records is similar.</strong> Both ROR records and Open Funder Registry records include a globally unique identifier for the organization, a name for the organization, alternate names and acronyms for the organization, the location of the organization, the organization&rsquo;s type, and relationships to other organizations. If we examine the record for UK Research and Innovation (UKRI) in both the Open Funder Registry and ROR, for instance, we can see that the associated metadata is similar.</p>
<figure class="blog-figure"><img src="/img/blog/funder-overlap/funder-id-ukri-01.png"
         alt="JSON data from the Open Funder Registry for the funding organization UKRI"/><figcaption>
            <p>The Open Funder Registry record for UKRI is available as JSON at <a href="http://doi.org/10.13039/100014013">http://doi.org/10.13039/100014013</a>.</p>
        </figcaption>
</figure>

<figure class="blog-figure"><img src="/img/blog/funder-overlap/ror-ukri.png"
         alt="The ROR record for the funding organization UKRI"/><figcaption>
            <p>The ROR record for UKRI is available on the web at <a href="https://ror.org/001aqnf71">https://ror.org/001aqnf71</a> and in the ROR API as JSON at <a href="https://api.ror.org/organizations/001aqnf71">https://api.ror.org/organizations/001aqnf71</a>.</p>
        </figcaption>
</figure>

<p>Note that the ROR record includes a reference to the corresponding Funder ID, and note that both the ROR record and the Open Funder Registry record list the same nine &ldquo;child&rdquo; organizations for UKRI (e.g., Research England). Each of those subsidiary organizations has its own identifier in both registries.</p>
<figure class="blog-figure"><img src="/img/blog/funder-overlap/funder-id-ukri-02.png"
         alt="JSON record for the funding organization UKRI from the Crossref API funders endpoint"/><figcaption>
            <p>The metadata from the Crossref API funders endpoint at <a href="https://api.crossref.org/funders/100014013">https://api.crossref.org/funders/100014013</a> shows the names of UKRI&rsquo;s nine child organizations. The funders endpoint of the Crossref API uses the Open Funder Registry to connect funders with the works they have funded. The web-based Open Funder Registry Search at <a href="https://search.crossref.org/funding">https://search.crossref.org/funding</a> uses the funders endpoint.</p>
        </figcaption>
</figure>

<p>One difference between the metadata in the Open Funder Registry and the metadata in ROR concerns organization type. The Open Funder Registry <a href="https://www.crossref.org/documentation/funder-registry/accessing-the-funder-registry/">gives each funder a type</a> of <code>government</code> or <code>private</code> with associated subtypes for each. Government funder subtypes include <code>federal</code> and <code>non-federal,</code> while private funder subtypes include <code>academic</code>, <code>corporate</code>, <code>foundation</code>, <code>international</code>, <code>professional associations and societies</code>, and <code>other non-profit.</code> ROR also <a href="https://ror.readme.io/docs/ror-data-structure#types">supports organization types</a> and currently allows eight options: <code>Archive</code>, <code>Company</code>, <code>Education</code>, <code>Facility</code>, <code>Government</code>, <code>Healthcare</code>, <code>Nonprofit</code>, and <code>Other</code>.</p>
<p>In <a href="https://ror.readme.io/docs/schema-v2">version 2.0 of the ROR metadata schema, dataset, and API</a>, which is due to be released early in 2024, ROR will <a href="https://github.com/ror-community/ror-roadmap/issues/159">add support for a &ldquo;funder&rdquo; organization type</a> and will add this type to all ROR records that include a Funder ID as an addition to the existing organization type value. This means that early in 2024, the version 2.0 ROR record for UKRI will include organization types of both <code>government</code> and <code>funder</code>. In the meantime, as we explain in <a href="https://ror.readme.io/docs/api-advanced-query">our documentation of the ROR API advanced query parameter</a>, the list of all active ROR records that currently include a Funder ID can be retrieved using the following query:</p>
<pre tabindex="0"><code>https://api.ror.org/organizations?query.advanced=_exists_:external_ids.FundRef.all
</code></pre><p>Version 2.0 of the ROR schema and API is <a href="https://ror.readme.io/docs/ror-schema-api-v2-beta">open for beta testing through October 16th, 2023</a> – current users of the Open Funder Registry are more than welcome to participate and to give us their feedback.</p>
<p>At the record level, then, ROR and the Open Funder Registry are far more similar than they are different. Let&rsquo;s look now at the general overlap between the two registries to learn how well-suited ROR data is for the purpose of identifying funders.</p>


<h2 id="comparing-at-the-registry-level">Comparing at the registry level 
</h2>
<p><strong>Most of the Funder Registry records that see the most use already have equivalent records in ROR</strong>, which is good news for users of the Funder Registry. Throughout 2022 and into 2023, the ROR team has been <a href="https://github.com/ror-community/ror-roadmap/issues/131">working diligently to reconcile the Open Funder Registry with ROR</a>, and this reconciliation work has resulted in the addition of 631 new ROR records identifying funders. Moreover, we have modified 6,080 ROR records to add Funder ID mappings and to ensure that these records include key metadata from the Open Funder Registry.</p>
<p>The project to reconcile ROR with the Open Funder Registry has been focused on Funder IDs that have existing uses, or &ldquo;assertions,&rdquo; in Crossref records, and the work has been prioritized on the basis of the total number of assertions. The Funder Registry includes identifiers for about 35,000 funding organizations, and of these, nearly 33,000 are used at least once in Crossref metadata. More than 8 million content items registered with Crossref include Funder IDs,<sup id="fnref:2"><a href="#fn:2" class="footnote-ref" role="doc-noteref">2</a></sup> and since many of those items cite more than one funder, there are over 13 million total funding assertions in Crossref metadata.<sup id="fnref:3"><a href="#fn:3" class="footnote-ref" role="doc-noteref">3</a></sup></p>
<p>To analyze the overlap between ROR and the Funder Registry, we have built and made publicly available the <a href="https://rorfunderoverlap.streamlit.app/">ROR / Funder Registry Overlap tool</a>. This tool shows that <strong>after ROR&rsquo;s reconciliation efforts to date, 94.1% of the existing Funder ID assertions in Crossref metadata can be matched to ROR IDs.</strong> We expect this percentage to increase as the reconciliation work continues.</p>
<figure class="blog-figure"><img src="/img/blog/funder-overlap/crossref-funder-overlap.png"
         alt="Two pie charts, with the one on the left showing that 54.6% of organizations in the Open Funder Registry are available in ROR and the one on the right showing that 94.1% of Funder ID uses in Crossref records have a corresponding ROR ID"/><figcaption>
            <p>Crossref - Aggregate ROR/Funder Registry OverlapTool view from the <a href="https://rorfunderoverlap.streamlit.app">ROR / Funder Registry Overlap tool</a>.</p>
        </figcaption>
</figure>

<p>The pie chart on the left shows that of the 35,000+ funders in the Funder Registry, more than 19,000, or a small majority of 54.6%, are represented in ROR, usually on a one-to-one basis. However, about 2,000 Funder IDs in the Funder Registry are never used in Crossref metadata, and many of the remaining 33,000 are used only once. Individual funders who issue many awards can account for a high number of funding assertions, and ROR has therefore focused its Open Funder Registry reconciliation work on these large funders. For example, the United States National Science Foundation currently accounts for over 332,000 funding assertions by itself, and with its many associated laboratories and agencies, it accounts for over 412,000 funding assertions in works registered with Crossref.<sup id="fnref:4"><a href="#fn:4" class="footnote-ref" role="doc-noteref">4</a></sup> The pie chart on the right shows that <strong>of the more than 13 million uses of Funder IDs in Crossref metadata, nearly 12.5 million have a corresponding ROR ID.</strong></p>
<p>The total overlap is even higher for DataCite, with <strong>96.7% of funding assertions in DataCite having corresponding ROR IDs</strong>, as shown below.</p>
<figure class="blog-figure"><img src="/img/blog/funder-overlap/datacite-funder-overlap.png"
         alt="Two pie charts, with the one on the left showing that 54.6% of organizations in the Open Funder Registry are available in ROR and the one on the right showing that 96.7% of Funder ID uses in DataCite records have a corresponding ROR ID"/><figcaption>
            <p>DataCite - Aggregate ROR/Funder Registry OverlapTool view from the <a href="https://rorfunderoverlap.streamlit.app">ROR / Funder Registry Overlap tool</a>.</p>
        </figcaption>
</figure>

<p>Just as in the previous view, the pie chart on the left shows that a majority of Funder IDs have an equivalent ROR ID, while the pie chart on the right shows that the overwhelming majority of DataCite funding assertions can be matched to a ROR ID, with <strong>available ROR IDs for 941,049 of the 973,072 funding assertions in DataCite records.</strong></p>
<p>In addition, while the preceding chart shows the number of funder assertions in DataCite records that <em>could</em> be matched to a ROR ID, the data on the actual number of funder assertions in DataCite records that <em>currently use</em> a ROR ID is also impressive. About 1.5 million DataCite records cite a funder in the <code>fundingReferences</code> element, and over 1.3 million of those also include connection metadata in the form of an identifier for that funder. As of this writing, the largest proportion of those 1.3 million funder identifiers are Funder IDs at 883,098 uses. ROR, however, is the next most commonly used identifier at 357,568 uses.</p>
<div class='centered'><figure><img src="/img/blog/funder-overlap/datacite-funder-ids-chart.png"
         alt="ROR DOI stats, available on the ROR website. See also the discussion of these statistics during the September ROR Community Call."/><figcaption>
            <p>ROR DOI stats, available <a href="https://ror.org/about/impact">on the ROR website</a>. See also the discussion of these statistics <a href="https://ror.org/events/2023-09-28-ror-community-call">during the September ROR Community Call</a>.</p>
        </figcaption>
</figure>
</div>
<p><strong>Clearly, then, ROR is ready to take on the work that the Open Funder Registry is currently doing with regard to funding data in Crossref and DataCite records.</strong></p>
<p>ROR&rsquo;s curation team plans to continue adding entities listed in the Open Funder Registry to ROR based on priority and usage. You can help us determine what needs to be done by analyzing your own funder data with the <a href="https://rorfunderoverlap.streamlit.app/">ROR / Funder Registry Overlap tool</a> that we&rsquo;ve made available to help you in this transition.</p>


<h2 id="analyzing-your-own-funder-data">Analyzing your own funder data 
</h2>
<p>If you are a funder or if you work with Funder IDs, you can use <a href="https://rorfunderoverlap.streamlit.app/">the ROR / Funder Registry Overlap tool</a> to look up any funder in the Open Funder Registry and find its equivalent ROR record. If the information in the ROR record needs a correction, or if there is no matching ROR record, you can request that ROR make the change from within the tool.</p>
<figure><img src="/img/blog/funder-overlap/funder-mapping-lookup.gif"
         alt="GIF showing how to look up the Arcadia Fund in the ROR / Funder Registry Overlap tool"/><figcaption>
            <p>How to look up a funder such as the Arcadia Fund in the <a href="https://rorfunderoverlap.streamlit.app">ROR / Funder Registry Overlap tool</a>.</p>
        </figcaption>
</figure>

<p>You can also look up how many of a particular Crossref member&rsquo;s funding assertions can be mapped to ROR IDs by choosing &ldquo;Crossref - Overlap by Member&rdquo; from the views on the left side of the screen.</p>
<figure><img src="/img/blog/funder-overlap/crossref-member-lookup.gif"
         alt="GIF showing how to look up the Crossref member Royal Society of Chemistry in the ROR / Funder Registry Overlap tool"/><figcaption>
            <p>How to look up the funding assertions of a Crossref member such as the Royal Society of Chemistry in the <a href="https://rorfunderoverlap.streamlit.app">ROR / Funder Registry Overlap tool</a>.</p>
        </figcaption>
</figure>

<p>For instance, Crossref member the <a href="https://rsc.org">Royal Society of Chemistry (RSC)</a> has as of this writing used Funder IDs over 478,000 times in Crossref metadata, and nearly 93% of these assertions can be matched to a ROR ID. Of the 9,000+ funders that RSC publications currently acknowledge in Crossref metadata, nearly 73% have corresponding ROR IDs.</p>
<p>Remember, too, that the project of reconciling ROR to the Funder Registry has to date focused on making sure that frequently-asserted funders are accurately and fully represented in ROR rather than on trying to create a new ROR record for every single Funder Registry record. The work to reconcile the Funder Registry to ROR is ongoing, and over time we fully expect even more Funder IDs to have matching ROR IDs.</p>
<p>We hope you find the ROR / Funder Registry Overlap tool useful in assessing funding data. We have made the <a href="https://github.com/ror-community/ror_funder_registry_overlap">source code available on GitHub</a> for those who wish to examine or clone it.</p>


<h2 id="get-ready-and-get-in-touch">Get ready and get in touch 
</h2>
<p><strong>If you&rsquo;re already a user of the Open Funder Registry, you should start planning now to switch to ROR</strong> – we&rsquo;re happy to work with you to make your transition to ROR easier. If you haven&rsquo;t yet adapted your systems to capture funding data in a structured way that uses persistent identifiers, we&rsquo;re also happy to work with you to help you integrate ROR. We all need clean, consistent, open, and interoperable information about all kinds of research organizations, including funders.</p>
<p>In the coming months, Crossref will be publishing at least one blog post detailing the technical work they are doing to build comprehensive support for ROR into Crossref’s services, and Crossref and ROR will continue to consult with key users of the Open Funder Registry to learn what they want and need.</p>
<p>Meanwhile, if you have questions or comments about ROR and the Open Funder Registry or if you&rsquo;d like to contribute to our ongoing work to improve information about funders in ROR, <strong>get in touch</strong>! You can <a href="https://calendly.com/ror-chat">book a meeting</a> with ROR Technical Community Manager Amanda French or write to her at <a href="mailto:support@ror.org">support@ror.org</a>. We look forward to hearing from you as all we prepare for these exciting changes together.</p>
<hr>
<div class='callout grey'>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Contact <a href="mailto:support@ror.org">support@ror.org</a> with any and all questions.</span></div>
	</div>
</div>
<div class="footnotes" role="doc-endnotes">
<hr>
<ol>
<li id="fn:1">
<p>Please note that all figures, percentages, and statistics in this blog post are current as of October 12, 2023 but might change when updates are made to the Open Funder Registry, ROR, and funding assertions in Crossref and DataCite records. Similarly, the <a href="https://rorfunderoverlap.streamlit.app/">ROR / Funder Registry Overlap tool</a> might produce different results when new data is ingested. [Note added 11/16/2023.]&#160;<a href="#fnref:1" class="footnote-backref" role="doc-backlink">&#x21a9;&#xfe0e;</a></p>
</li>
<li id="fn:2">
<p><a href="http://api.crossref.org/works?filter=has-funder-doi:t">Crossref API works with Funder IDs</a>&#160;<a href="#fnref:2" class="footnote-backref" role="doc-backlink">&#x21a9;&#xfe0e;</a></p>
</li>
<li id="fn:3">
<p>The number of Funder ID assertions in Crossref is derived from a sum of the results for the query <a href="https://api.crossref.org/funders/%7Bfunder_id%7D%7D/works">https://api.crossref.org/funders/{funder_id}}/works</a> for every Funder ID, e.g., <a href="https://api.crossref.org/funders/100000001/works">https://api.crossref.org/funders/100000001/works</a>&#160;<a href="#fnref:3" class="footnote-backref" role="doc-backlink">&#x21a9;&#xfe0e;</a></p>
</li>
<li id="fn:4">
<p><a href="http://api.crossref.org/funders/100000001">Crossref API Funder Registry &ldquo;work-count&rdquo; and &ldquo;descendant-work-count&rdquo; for the U.S. National Science Foundation</a>&#160;<a href="#fnref:4" class="footnote-backref" role="doc-backlink">&#x21a9;&#xfe0e;</a></p>
</li>
</ol>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">ROR is ready to take on the important work that the Open Funder Registry has been doing: identifying research funders in a clean, consistent, comprehensive, and interoperable way. This post compares both registries and provides data showing that most of the Open Funder Registry records that see the most use already have equivalent records in ROR.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Help Us Test v2 of the ROR API!</title><id>https://doi.org/10.71938/5f1f-cq44</id><link rel="alternate" href="https://ror.org/blog/2023-09-14-beta-test/"/><published>2023-09-14T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Liz Krznarich</name><uri>https://orcid.org/0000-0001-6622-4910</uri></author><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><category term="Technical News"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>After nearly a year of planning and community input, we are thrilled to release a beta version of ROR&rsquo;s first major schema and API update, which is open to the public for testing through <strong>October 16, 2023.</strong> Please visit our <a href="https://ror.readme.io/docs/ror-schema-api-v2-beta">v2 beta test documentation</a> for detailed information on what&rsquo;s new and how to participate in the beta test.</span></div>
	</div>
</div>
<p>Throughout 2022 and into 2023, ROR <a href="https://ror.readme.io/docs/feedback-docs">gathered community feedback</a> about how to handle schema and API versioning and about what should be included in a new version of ROR&rsquo;s metadata schema. Today, <strong>we are thrilled to release a beta version of ROR&rsquo;s first major schema and API update.</strong> Version 2 of the ROR API and metadata schema is open to the public for testing through <strong>October 16, 2023.</strong> As a community-driven initiative, ROR relies on input from its stakeholders to ensure that new and updated features serve the needs of its wide array of users and use cases.</p>
<p>We plan to incorporate changes from this beta test this fall and to make API v2 available in production in early 2024. The current version, version 1, will remain the default version until at least early 2025.</p>
<p>A full account of the changes and detailed instructions for participating in the beta test are <a href="https://ror.readme.io/docs/ror-schema-api-v2-beta">available in our v2 beta test documentation</a>, but we provide some highlights below.</p>


<h2 id="whats-new-in-v2-of-the-ror-metadata-schema">What&rsquo;s new in v2 of the ROR metadata schema 
</h2>
<ul>
<li>
<p><strong>Name information</strong> previously in <code>name</code>, <code>acronyms</code>, <code>aliases</code>, and <code>labels</code> fields is now contained in a single parent field <code>names</code> with subfields <code>lang</code>, <code>value</code> and <code>types</code>.</p>
</li>
<li>
<p><strong>Location information</strong> previously in <code>addresses</code> field is now in <code>locations</code> field with subfields <code>geonames_id</code> and <code>geoneames_details</code>. Many fields containing very granular information derived from Geonames have been removed, as this information is available directly from <a href="https://www.geonames.org/">Geonames</a>.</p>
</li>
<li>
<p><strong>Website/domain information</strong> previously in <code>links</code> and <code>wikipedia_url</code> have been combined into a single parent field <code>links</code> with subfields <code>type</code> and <code>value</code>. The <code>ip_addresses</code> field has been removed, and a new  <code>domains</code> field has been added.</p>
</li>
<li>
<p><strong>External identifiers information</strong> has been restructured within the existing <code>external_ids</code> field. Each item in external_ids now has subfields <code>type</code>, <code>all</code> and <code>preferred</code>.</p>
</li>
<li>
<p><strong>Administrative information</strong> was not included previously. A new parent field <code>admin</code> has been added, which contains subfields <code>created</code> and <code>last_modified</code>. Each of those subfields contains additional subfields <code>date</code> and <code>schema_version</code>.</p>
</li>
<li>
<p><strong>Controlled lists</strong> previously had variations in casing. For example, values in the <code>types</code> and <code>relationships.type</code> fields began with an uppercase character, while values in <code>status</code> were lowercase and external ID types contained a variety of casings. In v2, allowed values in controlled lists are consistently lowercase, with the exception of country codes derived from <a href="https://www.iso.org/iso-3166-country-codes.html">ISO-8166</a>, which are uppercase per the standard.</p>
</li>
</ul>


<h2 id="whats-new-in-v2-of-the-ror-api">What&rsquo;s new in v2 of the ROR API 
</h2>
<p>Version 2 of the ROR REST API includes all the same search and retrieval functionality as the <a href="https://ror.readme.io/docs/rest-api">current version (v1) of the ROR REST API</a>, but with ROR records in responses formatted according to the v2 schema. For all ROR API requests, the API version is specified in the path portion of the request, as in the following example:</p>
<p><code>curl https://api.dev.ror.org/v2/organizations</code></p>


<h2 id="sample-v2-ror-data">Sample v2 ROR data 
</h2>
<p>A sample data dump containing both v1 and v2 files is available in the locations below:</p>
<ul>
<li>Zenodo sandbox: <a href="https://sandbox.zenodo.org/record/1241263">https://sandbox.zenodo.org/record/1241263</a></li>
<li>GitHub: <a href="https://github.com/ror-community/ror-data-test/blob/main/v1.32-2023-09-14-ror-data.zip">https://github.com/ror-community/ror-data-test/blob/main/v1.32-2023-09-14-ror-data.zip</a></li>
</ul>
<p><strong>These files are for review and testing purposes only and are not official ROR release files.</strong> Note too that there are several new fields/subfields in v2, and the dataset used in the beta has not been fully updated with values in all new fields/subfields. In particular:</p>
<ul>
<li>Created/last modified dates HAVE been added to all records in the new <code>admin</code> field, using actual dates from GRID and ROR data releases.</li>
<li>Domains HAVE NOT been added. The new <code>domains</code> field is currently an empty list for all records.</li>
<li>Language codes for items in the new <code>names</code> field are only included for names inherited from the <code>labels</code> field in the current schema. Language codes HAVE NOT been added for names inherited from the <code>name</code> and <code>aliases</code> fields in the current schema.</li>
</ul>
<p>We plan to add data to empty fields in v2 over the coming months, carefully curating the records to ensure accuracy.</p>


<h2 id="how-can-i-provide-feedback-on-the-v2-beta">How can I provide feedback on the v2 beta? 
</h2>
<ol>
<li>
<p>Read the <a href="https://ror.readme.io/docs/ror-schema-api-v2-beta">complete beta documentation and testing instructions</a>.</p>
</li>
<li>
<p>Check the <a href="https://github.com/orgs/ror-community/projects/12/views/1">ROR schema &amp; API v2 beta project board</a> to see if the same or a closely related issue has already been reported. If so, please add comments to that issue.</p>
</li>
<li>
<p>If you have new feedback that’s not related to another issue on the ROR schema &amp; API v2 beta project board, submit a GitHub issue using the <a href="https://github.com/ror-community/ror-roadmap/issues/new?assignees=lizkrznarich&amp;labels=api+v2+beta&amp;projects=&amp;template=ror-schema---api-v2-beta-feedback.md&amp;title=%5BV2+BETA%5D">ROR schema &amp; API v2 beta feedback template</a>.</p>
</li>
</ol>
<p>All issues will be reviewed by ROR staff, and we may contact you using your GitHub handle to request additional information. If you are not a GitHub user or prefer to submit your feedback privately, please email <a href="mailto:support@ror.org">support@ror.org</a> . <strong>Please provide your feedback by October 16, 2023.</strong></p>
<div class='callout mustard'>
	<div class="row">
		<div class="col-md-12"><span><strong><a href="https://ror.readme.io/docs/ror-schema-api-v2-beta">ROR schema and API v2 beta - complete documentation and instructions</a></strong></span></div>
	</div>
</div>
<p>We are grateful for your help! Following the close of the beta test, we will review all reports and work to incorporate the improvements before we release v2 into production.</p>
<p>Thanks, as always, to everyone who works to make ROR the best it can be.</p>
<div class='callout grey'>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Questions? Contact <a href="mailto:support@ror.org">support@ror.org</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">After nearly a year of planning and community input, we are thrilled to release a beta version of ROR's first major schema and API update, which is open to the public for testing through October 16, 2023.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Case Study: OpenAlex, ROR, and Machine Learning Models</title><id>https://doi.org/10.71938/5q2g-gp90</id><link rel="alternate" href="https://ror.org/blog/2023-09-13-openalex-case-study/"/><published>2023-09-13T00:00:00+00:00</published><updated>2025-11-07T11:35:16-05:00</updated><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><author><name>Adam Buttrick</name><uri>https://orcid.org/0000-0003-1507-1031</uri></author><category term="Case Studies"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>Justin Barrett, Lead Machine Learning Engineer for OpenAlex at OurResearch, talks with ROR Technical Community Manager Amanda French and ROR Curation Lead Adam Buttrick about using ROR both as an identifier for institutions in OpenAlex and as a dataset for training machine learning models that enrich OpenAlex metadata.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>
<h2 id="key-quotations">Key quotations 
  <a href="#key-quotations" aria-label="Key quotations">
<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 640 512" class="ha ha-link" width="16" height="16" alt="Key quotations"><path d="M579.8 267.7c56.5-56.5 56.5-148 0-204.5c-50-50-128.8-56.5-186.3-15.4l-1.6 1.1c-14.4 10.3-17.7 30.3-7.4 44.6s30.3 17.7 44.6 7.4l1.6-1.1c32.1-22.9 76-19.3 103.8 8.6c31.5 31.5 31.5 82.5 0 114L422.3 334.8c-31.5 31.5-82.5 31.5-114 0c-27.9-27.9-31.5-71.8-8.6-103.8l1.1-1.6c10.3-14.4 6.9-34.4-7.4-44.6s-34.4-6.9-44.6 7.4l-1.1 1.6C206.5 251.2 213 330 263 380c56.5 56.5 148 56.5 204.5 0L579.8 267.7zM60.2 244.3c-56.5 56.5-56.5 148 0 204.5c50 50 128.8 56.5 186.3 15.4l1.6-1.1c14.4-10.3 17.7-30.3 7.4-44.6s-30.3-17.7-44.6-7.4l-1.6 1.1c-32.1 22.9-76 19.3-103.8-8.6C74 372 74 321 105.5 289.5L217.7 177.2c31.5-31.5 82.5-31.5 114 0c27.9 27.9 31.5 71.8 8.6 103.9l-1.1 1.6c-10.3 14.4-6.9 34.4 7.4 44.6s34.4 6.9 44.6-7.4l1.1-1.6C433.5 260.8 427 182 377 132c-56.5-56.5-148-56.5-204.5 0L60.2 244.3z"/></svg>
  </a>
</h2>
<p>&ldquo;When we started to make the second version of the institution parsing model, I got a lot deeper into ROR, and that&rsquo;s when I started to use a lot more of the data, like the aliases and the acronyms and the relationships. ROR became a lot more central to our model, because I was using it so much, and then also because we made the decision to not include non-ROR institutions in our database. We switched to completely ROR institutions.&rdquo;</p>
<p>&ldquo;ROR is playing a huge part in the author disambiguation as well, because we&rsquo;re using institutions to determine if authors are the same. Unfortunately, you can&rsquo;t only use institutions to decide that. You can&rsquo;t only use co-authors, you can&rsquo;t only use topics, so we&rsquo;re using a mixture of all of those things. We&rsquo;re getting that institution from the author&rsquo;s affiliation string and parsing it, and that&rsquo;s going directly into our author disambiguation model to try and match that institution. So ROR, again, playing a vital part in our system.&rdquo;</p>
<p>&ldquo;The artificial strings have made it a lot more robust of a model, and we wouldn&rsquo;t have been able to do that without the ROR data, because we would have just had to use existing affiliation strings.&rdquo;</p>
<p>&ndash; Justin Barrett, Machine Learning Engineer, OurResearch</p>
</span></div>
	</div>
</div>


<h3 id="hahahugoshortcode-s1-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Thanks so much for talking to Adam and me, Justin. Can you tell us your name, title and organization?</p>


<h3 id="hahahugoshortcode-s2-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>My name is Justin Barrett. I am the lead Machine Learning Engineer for <a href="https://openalex.org">OpenAlex</a>, which is a part of the <a href="https://ourresearch.org">OurResearch</a> group. OpenAlex has been around since 2021, and I joined pretty early on, so I&rsquo;ve been working for them for a little under two years now.</p>


<h3 id="hahahugoshortcode-s3-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Tell us about OurResearch, and tell us about OpenAlex.</p>


<h3 id="hahahugoshortcode-s4-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>The overall goal of OurResearch is to increase open access, open data, and open science for everyone. OurResearch has several different projects to promote that. There&rsquo;s <a href="https://unpaywall.org">Unpaywall</a>, which tries to get Open Access articles to end users. There&rsquo;s <a href="https://profiles.impactstory.org">Impact Story</a>, which I believe is the first project that our CEO Jason Priem created with Heather Piwowar, who used to work at OurResearch and was a co-founder. There are a lot of projects, and all of them try to make open science a reality for more and more people. And that&rsquo;s basically what OpenAlex is continuing to do. We have established a large database of research that is completely open for everyone. If they want to, they can download a snapshot of our entire database and use that any way that they want to &ndash; to create recommendations, to create their own website, to create their own analytics. It&rsquo;s really just empowering people to do that.</p>


<h3 id="hahahugoshortcode-s5-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Great. And can you tell us a little bit more about the origin of OpenAlex?</p>


<h3 id="hahahugoshortcode-s6-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>Microsoft built and maintained the <a href="https://www.microsoft.com/en-us/research/project/microsoft-academic-graph/">Microsoft Academic Graph</a>, MAG, and then decided to sunset that whole system. And at that point, Jason and others in OurResearch decided that it was a good time to try and take over what MAG was doing, because they were serving a pretty important function for a lot of people. They knew that there were a lot of people that were going to be missing out with MAG not updating their database anymore. OpenAlex was created to jump on that opportunity in order to continue to provide researchers, academics, with access to the data in MAG coming in through <a href="https://www.crossref.org">Crossref</a>, coming in through <a href="https://pubmed.ncbi.nlm.nih.gov/">PubMed</a>, coming in through a lot of different data sources. OpenAlex is just trying to fill that gap.</p>


<h3 id="hahahugoshortcode-s7-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>And were you party to any of the discussions with Microsoft about that?</p>


<h3 id="hahahugoshortcode-s8-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>I was not. I came on after the decision was made to carry on MAG&rsquo;s work. Jason contacted me because there were models that MAG already had within their system that Jason needed to have replicated at the very least and then eventually improved upon. The first project that I worked on was creating a <a href="https://github.com/ourresearch/openalex-concept-tagging">concept tagger</a> for papers, a topic tagger based on the title of the paper and on the abstract if we had it. That project was the first where we were trying to bridge the gap between just getting the data in and enriching it with some of the features that MAG was enriching all of their data with.</p>
<p>Concepts was where it started, and institution parsing was next. There have since been a couple of different iterations for both of those models. Then it was author disambiguation, and that&rsquo;s actually the current project that I&rsquo;m working on, to completely revamp that system so that it&rsquo;s improved. We&rsquo;re hoping to release that this week or next week. [Read the <a href="https://groups.google.com/g/openalex-users/c/jzlh1Mp_s-g">release announcement for OpenAlex Authors</a>.]</p>


<h3 id="hahahugoshortcode-s9-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Can I ask you about your background and why Jason came to you when he needed somebody to work on these models?</p>


<h3 id="hahahugoshortcode-s10-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>That&rsquo;s a good question. I guess it comes down to just luck. He hired me through <a href="https://www.upwork.com/">Upwork</a>, which is a pretty popular platform for hiring contractors for short-term projects. I went to school for aerospace engineering, and I got my master&rsquo;s in mechanical engineering. I have a lot of coding experience from my years in engineering, and during my master&rsquo;s I got into Python a lot more. When I went through my master&rsquo;s, I figured out that I didn&rsquo;t want to do mechanical engineering anymore, and I did a lot of self-learning through <a href="https://www.coursera.org/">Coursera</a> and <a href="https://www.udemy.com/">Udemy</a> and basically made that jump over to data science.</p>
<p>I pretty quickly started specializing in <a href="https://en.wikipedia.org/wiki/Deep_learning">deep learning</a> applications. <a href="https://en.wikipedia.org/wiki/Natural_language_processing">Natural language processing, NLP,</a> and deep learning are what I like to do and what I&rsquo;m pretty competent in. Jason found me when I was about a couple of years into Upwork. I also had a job at Advance Auto Parts as a data scientist, so I had been doing a lot of AI and machine learning work. And I guess when I applied for the job on Upwork, Jason liked my experience. Then we met, and it all went from there.</p>


<h3 id="hahahugoshortcode-s11-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>When and where did you personally first hear about ROR?</p>


<h3 id="hahahugoshortcode-s12-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>When I was creating my first institution parsing model here at OpenAlex. That was my first time hearing about it, and so I wasn&rsquo;t aware of how much data was available in ROR. My CEO, Jason Priem, told me that ROR would be a good source of information for institutions.</p>
<p>At that time, for our first model, we inherited all of our institutions from MAG, and so we had non-ROR institutions included in there. I wasn&rsquo;t as eager to go to ROR completely, because I had enough institutions where I thought, &ldquo;All right, I need to not depend on ROR, because a lot these institutions are not included in ROR.&rdquo; And so I used ROR for gathering additional information about the ROR institutions, like a city and a state &ndash; basically, I used it to supplement all the data that I couldn&rsquo;t find within MAG. City, state, and country were the main things that I used for the first version.</p>
<p><strong>When we started to make the second version of the institution parsing model, I got a lot deeper into ROR, and that&rsquo;s when I started to use a lot more of the data, like the aliases and the acronyms and the relationships. ROR became a lot more central to our model, because I was using it so much, and then also because we made the decision to not include non-ROR institutions in our database. We switched to completely ROR institutions.</strong> That is something that we could possibly change in the future, if we maybe generate a list of institutions that we want to also include in our database, but we made that decision to kind of streamline all of our data, because it&rsquo;s a lot easier to keep track. But that&rsquo;s basically how I started hearing about ROR and started using that data.</p>


<h3 id="hahahugoshortcode-s13-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Were there institutional identifiers in MAG that you inherited?</p>


<h3 id="hahahugoshortcode-s14-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>MAG had an institution identifier that was an &ldquo;I&rdquo; followed by whatever number MAG had assigned to that institution, so those were the IDs that I used. We had a mapping from those institution IDs to a ROR ID if there was one available. Those were the institution IDs that I used at first, and now that we&rsquo;re using ROR only, there&rsquo;s basically a one-to-one mapping of ROR IDs to OpenAlex&rsquo;s Institution IDs.</p>


<h3 id="hahahugoshortcode-s15-hbhb-adam-buttrick"><figure class="round-figure"><img src="/img/adam-sq-200.jpg"
         alt="Avatar of Adam Buttrick"/>
</figure>
 Adam Buttrick 
</h3>
<p>Can you talk a little bit more about this streamlining? Why did you pivot to narrowing the <a href="https://docs.openalex.org/api-entities/institutions">OpenAlex institution model</a> to use only ROR IDs?</p>


<h3 id="hahahugoshortcode-s16-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>We did a little bit of analysis at first to see &ldquo;Okay, what&rsquo;s the distribution in OpenAlex as far as the number of papers for ROR institutions versus non-ROR institutions?&rdquo; Through that analysis, we found that about 97% of institution assignments were ROR institutions.</p>
<p>In addition to that, a lot of our non-ROR institutions were just garbage. They weren&rsquo;t linking to any <a href="https://www.wikidata.org/wiki/Wikidata:Main_Page">Wikidata</a> page, they didn&rsquo;t have any information about location, and some of them were duplicates of ROR entities. The more we looked into it, the more we wanted to just start with ROR, and then maybe in the future, we could start adding institutions on top of ROR.</p>
<p>So we just made that decision, and it made my job a lot easier when it came to assigning institutions. It cleaned up the training data, it allowed me to try out different things with the training data, and it just made things a lot easier.</p>


<h3 id="hahahugoshortcode-s17-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Have we ever talked with you about adding OpenAlex institutions to ROR that don&rsquo;t currently have a ROR ID?</p>


<h3 id="hahahugoshortcode-s18-hbhb-adam-buttrick"><figure class="round-figure"><img src="/img/adam-sq-200.jpg"
         alt="Avatar of Adam Buttrick"/>
</figure>
 Adam Buttrick 
</h3>
<p>I don&rsquo;t know if we talked to Justin about it, but I have done it, because I used to look at OpenAlex a lot and say &ldquo;What isn&rsquo;t in here? What looks valid?&rdquo; My conclusion in doing that was very similar to Justin&rsquo;s, that there&rsquo;s a bunch of things that are essentially duplicates of existing organizations, and there were some defunct organizations, like organizations from the Soviet Union that don&rsquo;t exist anymore, that kind of stuff.</p>


<h3 id="hahahugoshortcode-s19-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>There was also a separate process that happened when we converted to ROR-only where we looked at some of those top institutions that were considered non-ROR, the ones that had the most papers associated with them. We made sure that they were not supposed to link to an actual ROR ID.</p>
<p>We ended up merging a lot of our institution IDs to make sure that we weren&rsquo;t dropping too many, and we found a large percentage of them that should have linked to a ROR and didn&rsquo;t or that had a separate ROR for that entity. That was another member of our team who took care of that process. At the end of the day, we currently only have ROR institutions that are getting tagged for our affiliation strings.</p>


<h3 id="hahahugoshortcode-s20-hbhb-adam-buttrick"><figure class="round-figure"><img src="/img/adam-sq-200.jpg"
         alt="Avatar of Adam Buttrick"/>
</figure>
 Adam Buttrick 
</h3>
<p>Just to give the people who will be reading this interview some perspective, what&rsquo;s the scale of works in OpenAlex that are being indexed with ROR IDs?</p>


<h3 id="hahahugoshortcode-s21-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>I think OpenAlex has around 243 million works as of right now. Of course, not all of those have institution strings or affiliation strings that come with them. The last time I checked, about 80% of the new data we have coming in has an affiliation string. When you go back to some of the old data in MAG, papers from the 1960s and 1970s, a lot of those are missing institutional strings, but for our more current data, I would say about 80% at least have an affiliation string with them, so they are getting matched to a ROR ID.</p>


<h3 id="hahahugoshortcode-s22-hbhb-adam-buttrick"><figure class="round-figure"><img src="/img/adam-sq-200.jpg"
         alt="Avatar of Adam Buttrick"/>
</figure>
 Adam Buttrick 
</h3>
<p>That&rsquo;s great. But it also kind of highlights the need for publisher partners to contribute those, to impose that little bit of order onto the chaos of it and not let the metadata wither in the same way that the historic data has done.</p>


<h3 id="hahahugoshortcode-s23-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>Yeah, it would be nice if it came in with a ROR ID from the publisher.</p>


<h3 id="hahahugoshortcode-s24-hbhb-adam-buttrick"><figure class="round-figure"><img src="/img/adam-sq-200.jpg"
         alt="Avatar of Adam Buttrick"/>
</figure>
 Adam Buttrick 
</h3>
<p>We&rsquo;re working on that!</p>


<h3 id="hahahugoshortcode-s25-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>I&rsquo;d like to hear a bit more about how the matching that you do works and how you get new ROR data.</p>


<h3 id="hahahugoshortcode-s26-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>We get the ROR data a couple of different ways. When I was creating this model, I used the most recent <a href="https://ror.readme.io/docs/data-dump">data dump</a> to get an updated snapshot of all the ROR data, and then that&rsquo;s what I used to perform all the string matching. Currently, that snapshot is what the model is using in order to match those strings.</p>
<p>In the future, we are going to be doing that slightly differently, because we will be implementing some processes to use the <a href="https://ror.readme.io/docs/rest-api">ROR API</a>. I believe we&rsquo;re using the ROR API to update some institution data in our database with ROR data, but that is not propagating through our whole system as of right now. I didn&rsquo;t work on that project, that was one of our data engineers. But I believe at some point in the future, we will ping the ROR API once a day to check to see if any of the data has changed for specific institutions, and if there&rsquo;s a change, we will take that data and put it into our system.</p>
<p>When I get done with the author disambiguation project, institutions are something that I&rsquo;m about to start looking at again, and because we are now populating our own database with ROR data, that gives me the opportunity to constantly update the strings that it&rsquo;s matching on. The process of matching those strings will be a little more streamlined to current data, as opposed to right now where it&rsquo;s just a snapshot from the past, which probably works most of the time, but which doesn&rsquo;t give me new data that you guys are providing, and which doesn&rsquo;t correct any data that might be wrong from that snapshot. I&rsquo;m looking forward to getting back into institutions and making it more of a sustainable pipeline as opposed to what it is right now, where it&rsquo;s kind of a point in time model.</p>


<h3 id="hahahugoshortcode-s27-hbhb-adam-buttrick"><figure class="round-figure"><img src="/img/adam-sq-200.jpg"
         alt="Avatar of Adam Buttrick"/>
</figure>
 Adam Buttrick 
</h3>
<p>I have a related question. Did you work on the <a href="https://groups.google.com/g/openalex-users/c/NTTXy0315vg/m/Hg9XSgb5AQAJ">new funder model</a> that has been integrated? I saw that the OpenAlex <a href="https://docs.openalex.org/api-entities/funders">funder model</a> also integrated ROR, so I wondered if the ROR data was being pulled in in the same way.</p>


<h3 id="hahahugoshortcode-s28-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>I did not work on that, so I could not speak to the funders. But I know that was a big project that we were pretty excited about releasing.</p>


<h3 id="hahahugoshortcode-s29-hbhb-adam-buttrick"><figure class="round-figure"><img src="/img/adam-sq-200.jpg"
         alt="Avatar of Adam Buttrick"/>
</figure>
 Adam Buttrick 
</h3>
<p>It was very forward thinking to include ROR ID mappings, because over the long term, <a href="https://www.crossref.org/blog/open-funder-registry-to-transition-into-research-organization-registry-ror/">the Funder Registry and ROR are intended to merge</a>. It&rsquo;s a long ways out, but you&rsquo;ll be ahead of the game, because you already have a Funder ID to ROR ID mapping in the Funder object, so you can just privilege one ID or the other.</p>


<h3 id="hahahugoshortcode-s30-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>I&rsquo;m sure it won&rsquo;t be that easy at the time, but we&rsquo;ll have a head start. I wish it was always as easy as a one-to-one mapping from one thing to another.</p>
<p>We have the alpha version of our <a href="https://alpha.openalex.org/">user interface</a> up right now, which is very rough, but it gives you kind of a preview of what we will probably have within the next month or two. It&rsquo;ll be really nice: you&rsquo;ll be able to look and see funders and institutions and all that information pretty easily. Before, you could look at that information, but it wasn&rsquo;t really user friendly, so we&rsquo;re pretty excited about the UI that&rsquo;s coming.</p>


<h3 id="hahahugoshortcode-s31-hbhb-adam-buttrick"><figure class="round-figure"><img src="/img/adam-sq-200.jpg"
         alt="Avatar of Adam Buttrick"/>
</figure>
 Adam Buttrick 
</h3>
<p>Do you know if you&rsquo;ll leverage things like <a href="https://ror.readme.io/docs/relationships">ROR hierarchies and relationships</a> in the UI such that the works of child organizations can be associated with parent organizations? Some people have done that, others have not, so I wondered what OpenAlex&rsquo;s approach would be.</p>


<h3 id="hahahugoshortcode-s32-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>At first, no. At first, it&rsquo;ll probably just be a mapping to each institution&rsquo;s own ROR ID in the alpha version of the UI. Once we start getting into subsequent versions, I think that we&rsquo;ll start looking to link all of the data a little bit better, so yes, at that point we&rsquo;d probably add ROR institutions with the parents and children and all those relationships. However, we have recently started leveraging some of the ROR relationships and hierarchies in our API. It is a separate <a href="https://docs.openalex.org/api-entities/institutions/institution-object#lineage">feature called ‘lineage’</a> which links the children of an institution with the parent only if a parent was tagged using our institution parser. Eventually this feature will make it into our UI but for now it is only accessible through our API. Concepts also have a defined hierarchy that we would be most likely looking to leverage at some point in the future. So yes, there are a couple of things that we&rsquo;ll be looking to improve in our UI, but getting a better version out there to start is our top goal.</p>


<h3 id="hahahugoshortcode-s33-hbhb-adam-buttrick"><figure class="round-figure"><img src="/img/adam-sq-200.jpg"
         alt="Avatar of Adam Buttrick"/>
</figure>
 Adam Buttrick 
</h3>
<p>I think everyone&rsquo;s really excited for it. I know from the <a href="https://groups.google.com/g/openalex-users">OpenAlex user group</a> that people are super engaged with everything you&rsquo;re doing with the UI.</p>


<h3 id="hahahugoshortcode-s34-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>You&rsquo;ve already touched on this, but what are some of the major things on the OpenAlex roadmap that are coming up?</p>


<h3 id="hahahugoshortcode-s35-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>As far as our roadmap, author disambiguation is at the top of my list, and it has been for the last four or five months. Originally, we had a very patchwork model that we built when we first came away from MAG just so that we wouldn&rsquo;t be minting new author IDs for every author that came into our system. We needed to improve on that, which we did with the v2 model, which is what we have now. But there were still a lot of improvements that we could have made.</p>
<p>This version 3 model should improve our author disambiguation system a lot. Right now we get a lot of feedback like, &ldquo;Hey, my name is split into 20 different profiles, and my ORCID isn&rsquo;t in my top profile, can we fix this?&rdquo; And we say, &ldquo;Yes, we would love to fix that.&rdquo; I would say that the system we currently have isn&rsquo;t good with merging clusters or taking them away. We have to do it very manually, which is okay at the moment, because there aren&rsquo;t a ton of users who are asking us for this. We&rsquo;re getting a lower number of requests than we can deal with. But eventually, if we want to scale up, and we have thousands of users or hundreds of thousands of users asking us to change their author profiles, we want to make sure that the system we&rsquo;re building now can handle that.</p>
<p>And then we also want to make sure that we have a system in place for them to make those changes themselves. One of the big things that&rsquo;s going to be coming out eventually is that we&rsquo;re going to let authors request changes to their own profile. If they see that a work assigned to them doesn&rsquo;t belong to them, they&rsquo;ll be able to submit a request to get that work removed, and then our system will either try to assign it to the next most likely author or will create a new author for that work.</p>
<p>We&rsquo;ll also allow authors to merge two profiles, and we&rsquo;ll allow them to add works to their profile, and we&rsquo;ll have some process for authors to claim a profile on OpenAlex. It should be a pretty cool improvement, but to do it we need a good author disambiguation model and good author clusters to begin with. That&rsquo;s what our current goal is.</p>
<p>We have a lot of different projects going on. One of our team members is working on increasing the number of statistics, the &ldquo;science of science&rdquo; metrics that we have within our system to make filtering easier through the API. We want to give people more ways to filter our data for metrics like &ldquo;number of works within the last five years&rdquo; or &ldquo;percentage of citations within the last five years.&rdquo; Right now there just aren&rsquo;t as many metrics to filter on.</p>
<p>We also have general system architecture projects. We&rsquo;re trying to make it so that our data is updated much more quickly so that when we eventually get to a place where users are sending us requests for data to be changed, those results will propagate through our system much more quickly than they do right now. One of our engineers has been working on that for probably the past week or two, so I&rsquo;d say that&rsquo;s another big project.</p>


<h3 id="hahahugoshortcode-s36-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Do you have any sense of who the users of the OpenAlex API are, roughly?</p>


<h3 id="hahahugoshortcode-s37-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>We have some idea because of the requests that we&rsquo;re getting &ndash; the problem reports, or bug reports. We have a group of people that submit tickets for any number of reasons, and there are definitely a lot of graduate students that are asking us, &ldquo;Hey, can you increase the limits for the API for this email address so that I can pull up more data?&rdquo; Just based on the number of grad students that message us about increasing their API limits, I would think that those are one of our largest groups of users for the API. I can&rsquo;t imagine many other groups that are more prominent for API usage.</p>


<h3 id="hahahugoshortcode-s38-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>What are they doing with the OpenAlex API? What are they using it for?</p>


<h3 id="hahahugoshortcode-s39-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>That is a good question. When people submit support tickets for us, a lot of the time they include the reason why they need the API increase. We don&rsquo;t need that reason; all we need is an email from you requesting more data from the API, and usually we just give the user a higher limit. But a lot of people will tell us about the research they&rsquo;re doing. They want to analyze trends. Some of them want to analyze trends because they want to figure out where they should research, what topic they should explore, where they should submit a paper.</p>
<p>There are a lot of interesting support tickets that come in that tell us why people are using our API, but we don&rsquo;t really analyze API usage as far as what people are searching for in the API.</p>


<h3 id="hahahugoshortcode-s40-hbhb-adam-buttrick"><figure class="round-figure"><img src="/img/adam-sq-200.jpg"
         alt="Avatar of Adam Buttrick"/>
</figure>
 Adam Buttrick 
</h3>
<p>Could you tell us a little bit more about the author disambiguation that you&rsquo;re doing? Your new author model is more tightly coupled with ORCID, is that correct?</p>


<h3 id="hahahugoshortcode-s41-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>Yes. For our <a href="https://groups.google.com/g/openalex-users/c/xEJi9IwjYa8">new model</a>, we are treating ORCID as the source of truth for author data. That comes with some drawbacks, because ORCID data isn&rsquo;t always reliable, so there are issues with that as well. But our hope is that over time those issues will get resolved, and our data will be resolved with it.</p>
<p>We have a lot of ORCID data in our system. We realized that we had gaps in our data where Crossref wasn&rsquo;t giving us an ORCID, possibly. Something was happening where you&rsquo;d look in the ORCID public data dump, and there&rsquo;d be a DOI that had an ORCID attached, and we&rsquo;d look in our system and we would not have that ORCID. We&rsquo;ve reconciled a lot of those differences.</p>
<p>ORCID is the basis of how our clustering starts. The ORCID clusters are the clusters we start with, and then we have some algorithms that are matching works to those clusters. If they don&rsquo;t work, they don&rsquo;t match with those ORCID clusters, then they are creating new clusters.</p>
<p>So yes, ORCID is at the center of what we do, at least for author disambiguation. It was able to give us a good training dataset. That&rsquo;s the hardest thing about author disambiguation, is finding good training data. Which brings us right back to ROR, because that was also giving us good training data for training our model, because sometimes we couldn&rsquo;t depend on the MAG data to give us everything we needed. It&rsquo;s been very helpful.</p>


<h3 id="hahahugoshortcode-s42-hbhb-adam-buttrick"><figure class="round-figure"><img src="/img/adam-sq-200.jpg"
         alt="Avatar of Adam Buttrick"/>
</figure>
 Adam Buttrick 
</h3>
<p>I think that OpenAlex is going to become kind of a more interesting source of truth, or maybe a source for building services on top of. I don&rsquo;t know if you&rsquo;ve heard that ROR is now the primary institutional identifier behind ORCID, because <a href="https://info.orcid.org/orcid-support-for-ringgold-organization-ids-ending/">Ringgold has opted not to renew their agreement with ORCID around data sharing for organizational data</a>. And because of the association between ROR IDs and authors&rsquo; works, and now ORCID IDs by way of your new author model, I think that as a result of that people can make a lot more connections that probably weren&rsquo;t even exposed in either the Crossref API or the ORCID API. It&rsquo;s very cool.</p>


<h3 id="hahahugoshortcode-s43-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p><strong>ROR is playing a huge part in the author disambiguation as well, because we&rsquo;re using institutions to determine if authors are the same. Unfortunately, you can&rsquo;t only use institutions to decide that. You can&rsquo;t only use co-authors, you can&rsquo;t only use topics, so we&rsquo;re using a mixture of all of those things. We&rsquo;re getting that institution from the author&rsquo;s affiliation string and parsing it, and that&rsquo;s going directly into our author disambiguation model to try and match that institution. So ROR, again, playing a vital part in our system.</strong></p>


<h3 id="hahahugoshortcode-s44-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Awesome. Does the OpenAlex data model support multiple affiliations for authors? I would assume it does.</p>


<h3 id="hahahugoshortcode-s45-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>Yes. We did not used to. Our first version of the institution parser was pretty good, but it could not handle strings that had multiple institutions in it. And so we worked a lot with <a href="https://www.cwts.nl/">CWTS</a>. They help create the <a href="https://www.leidenranking.com/">Leiden ranking</a>.</p>
<p>They took samples of our data, and they came back with suggestions for ways that we could improve it. We ended up incorporating a lot of those suggestions into our v2 model, and one of the main ones was to fix the problem that we could not handle multiple institutions in a single string.</p>
<p>The v2 model that we created was highly dependent on ROR. The models in the first version of the institution parser used some ROR data, but they didn&rsquo;t really integrate ROR into our system. For v2, the way that we figured out to get multiple institutions in a string is to take the top 20 most likely institutions that the model says might match and then to use some custom string matching on those top 20 in order to see which ones would match. That has been very successful.</p>
<p>There are issues when it comes to longer strings that have more than three or four institutions in them. As you can imagine, the more institutions contained within a single string, the harder it is for the model to point you towards all of them accurately. There&rsquo;s definitely degradation when there&rsquo;s that many institutions within a string, but if you&rsquo;re looking at two or three institutions in a string, sometimes four or five, it can normally get all of them. It&rsquo;s pretty good.</p>


<h3 id="hahahugoshortcode-s46-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Cool. Now I want to give you a chance to make some feature requests for ROR. What should we do that would make your life easier?</p>


<h3 id="hahahugoshortcode-s47-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>I know that the relationships between ROR institutions are very tricky sometimes. I see a lot of weird relationships, and I don&rsquo;t really understand how they work, and maybe that is actually how they are. I think that the next step for us is to use that relationship data. I would hope that when I start using it, it&rsquo;s ready to go, but I don&rsquo;t know if that&rsquo;ll be the case. We are working right now to look at French institutions. I don&rsquo;t know if you are familiar with how complicated that is.</p>


<h3 id="hahahugoshortcode-s48-hbhb-adam-buttrick"><figure class="round-figure"><img src="/img/adam-sq-200.jpg"
         alt="Avatar of Adam Buttrick"/>
</figure>
 Adam Buttrick 
</h3>
<p>I&rsquo;m working on a project with the <a href="https://www.enseignementsup-recherche.gouv.fr/fr">Ministry of Higher Education and Research</a> in France, and it can be challenging to reconcile local conceptions of relationship hierarchies with ROR’s data, as you say, and also what&rsquo;s an organization and what&rsquo;s not an organization. We&rsquo;re doing a lot of work on that. [Release <a href="https://github.com/ror-community/ror-updates/releases/tag/v1.30">v1.30</a> of the ROR Registry includes a great many updates to French institutions as the result of this intensive curation work.]</p>


<h3 id="hahahugoshortcode-s49-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>It was something that I saw when I was training the model. You see &ldquo;CNRS&rdquo; everywhere, and you see a bunch of different acronyms. I just assumed, &ldquo;Okay, I see that acronym, and I know that the words in that acronym are this, and so, sure, we match that.&rdquo; But in reality, it&rsquo;s not that simple. And of course, we do not have those relationships correct with our current institution model. And so that&rsquo;s something that we are definitely about to tackle.</p>
<p>It&rsquo;s one of the reasons I&rsquo;m going to be looking into institutions again is to try and figure out how to improve the French part of our affiliation matching, even though I know that it&rsquo;s very confusing. There are people that have solutions to that. There&rsquo;s a company, <a href="https://www.sirisacademic.com/">SIRIS Academic</a>, that has been developing some models to specifically deal with French institutions. We might be relying more on the open source community to come up with a solution to that, because I don&rsquo;t speak French, and I couldn&rsquo;t tell you the ins and outs of the French academic system or the research organizations, so it&rsquo;ll be helpful to get other people&rsquo;s points of view.</p>


<h3 id="hahahugoshortcode-s50-hbhb-adam-buttrick"><figure class="round-figure"><img src="/img/adam-sq-200.jpg"
         alt="Avatar of Adam Buttrick"/>
</figure>
 Adam Buttrick 
</h3>
<p>You should keep us in the loop about that work. To give you a perspective on what we&rsquo;re doing on the curation side, the Ministry of Higher Education and Research in France maintains their own registry of research organizations and research structures.</p>


<h3 id="hahahugoshortcode-s51-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>Is that <a href="https://hal.science/">HAL</a>?</p>


<h3 id="hahahugoshortcode-s52-hbhb-adam-buttrick"><figure class="round-figure"><img src="/img/adam-sq-200.jpg"
         alt="Avatar of Adam Buttrick"/>
</figure>
 Adam Buttrick 
</h3>
<p>No, it&rsquo;s separate from HAL. It&rsquo;s called <a href="https://www.data.gouv.fr/en/datasets/repertoire-national-des-structures-de-recherche-rnsr/">RNSR</a>, the <em>Répertoire national des structures de recherche</em>. It enumerates all the French research structures which exist at various levels of solidity.</p>
<p>We&rsquo;re working on a project to include various French research units, as well as their relationships to their parent organizations. We&rsquo;re starting out by improving our coverage for what are called <a href="https://en.wikipedia.org/wiki/Unit%C3%A9_mixte_de_recherche">UMRs, <em>Unités mixte de recherche</em></a>, or &ldquo;mixed research units.&rdquo; We&rsquo;re adding a bunch of them, making sure that they&rsquo;re better disambiguated,  and that they have the correct parents, based on the RNSR data.</p>
<p>In general, we&rsquo;re doing a lot of work around improving French research organizations, and are being graciously <a href="https://ror.org/blog/2023-07-24-community-supported-infra#supporting-ror-through-global-coalitions">supported by the French government to do that</a> through the <a href="https://www.ouvrirlascience.fr/home/">French Committee for Open Science</a>. Hopefully, that will accrue benefit to OpenAlex. But I&rsquo;d also be interested to know what you find as a result of doing the analysis, because I&rsquo;m sure there are things that we&rsquo;ve missed.</p>


<h3 id="hahahugoshortcode-s53-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>You will hear something probably within the next month or two. I think that one of the other things is that there are sometimes universities that have many institutes underneath them, and I think sometimes not all of those institutes are in ROR, and not all of those institutes are linked to the correct universities. That&rsquo;s another thing that I have seen that could be improved. I mean, the ROR data&rsquo;s very useful to me, so I can&rsquo;t really think of anything that I would need to have it improved. It has been very useful to me so far.</p>


<h3 id="hahahugoshortcode-s54-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Great. Good to know.</p>


<h3 id="hahahugoshortcode-s55-hbhb-adam-buttrick"><figure class="round-figure"><img src="/img/adam-sq-200.jpg"
         alt="Avatar of Adam Buttrick"/>
</figure>
 Adam Buttrick 
</h3>
<p>It is really interesting to hear from someone who else is dealing with it from an aggregate perspective, though, because we&rsquo;re so used to talking to integrators who are dealing with it on a more one to one basis, or a more regional basis.</p>


<h3 id="hahahugoshortcode-s56-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>Yes, I&rsquo;m dealing with them all.</p>


<h3 id="hahahugoshortcode-s57-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>What else would you like to say about ROR, or about version three of the institution matching that you&rsquo;re planning, or about anything at all?</p>


<h3 id="hahahugoshortcode-s58-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>I can say that one of the things that was most useful for us was that with all the information you can get in the ROR data, we were able to artificially create affiliation strings. We get raw affiliation strings from a bunch of different sources, but not all of the ROR institutions have affiliations in our system, and not all of them have a lot of affiliation. Some institutions have three or four affiliated works, while other institutions have thousands.</p>
<p>We built a process for creating artificial affiliation strings using certain patterns that you normally see in an affiliation string, and that was able to increase our coverage of ROR institutions. Maybe we had those strings in our system, but we weren&rsquo;t connecting them to a ROR institution because MAG never connected to them. Or maybe there&rsquo;s an institution that one day will give us papers that we need to parse, but they haven&rsquo;t given us any yet. This process allows us to get rid of a <a href="https://en.wikipedia.org/wiki/Cold_start_(recommender_systems)">cold start problem</a> where you don&rsquo;t have any data about an institution, so when it comes into your system it does not get tagged because our model has never seen it before.</p>

<div class="blog-video">
  <iframe src="https://www.youtube.com/embed/Tx5y7lX030U" allowfullscreen title="YouTube Video"></iframe>
</div>

<p><em>Beginning at 03:34, watch Justin&rsquo;s presentation at the 2023 ROR Annual Meeting about OpenAlex&rsquo;s institutional parsing and how they used ROR to generate artificial author affiliation strings to use as training data.</em></p>
<p><strong>The artificial strings have made it a lot more robust of a model, and we wouldn&rsquo;t have been able to do that without the ROR data, because we would have just had to use existing affiliation strings.</strong> If we wanted to add more data, we would have had to find each individual ROR institution that didn&rsquo;t have any data in our system and do manual string searching in order to get all of that training data, which would be very time-consuming. That&rsquo;s definitely something that we could potentially do one day, but we were looking for something that was a lot quicker, because the timeline was pretty short for the v2 institution parsing model.</p>
<p>So yes: as I said, for v3, I think I&rsquo;ll grab an updated snapshot of the ROR data so that I&rsquo;m working with the latest and greatest, and I&rsquo;ll probably look into using our database, because we&rsquo;re grabbing data from ROR daily. I&rsquo;ll probably look into having those strings added to a dictionary that I keep up to with strings to match to ROR. It&rsquo;ll be fun to continue to develop the institutions model, because it&rsquo;s already pretty good at what it does, so just finding areas where it&rsquo;s not doing well and probably using ROR data to make it better will be a fun little problem.</p>


<h3 id="hahahugoshortcode-s60-hbhb-adam-buttrick"><figure class="round-figure"><img src="/img/adam-sq-200.jpg"
         alt="Avatar of Adam Buttrick"/>
</figure>
 Adam Buttrick 
</h3>
<p>I just want to say that your fake affiliation thing was a really brilliant piece of work. When I was reading through <a href="https://github.com/ourresearch/openalex-institution-parsing">your code</a> and I saw that, I thought, &ldquo;Oh, this is genius.&rdquo;</p>


<h3 id="hahahugoshortcode-s61-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>I truly can&rsquo;t take all the credit for it. I was in a meeting with Jason Priem, and he mentioned something about using the ROR data. He asked if we could use the ROR data to train on, and I said no, because we had non-ROR institutions we were looking to match. But then I realized we could do it, because affiliation strings have a standard syntax. It was probably a joint idea &ndash; it wasn&rsquo;t just me. But it is a very good idea. Artificial data creation for non-structured data is almost impossible, but we&rsquo;re dealing with semi-structured data, which lends itself better to artificially creating something. So yes, that worked out pretty well.</p>


<h3 id="hahahugoshortcode-s62-hbhb-adam-buttrick"><figure class="round-figure"><img src="/img/adam-sq-200.jpg"
         alt="Avatar of Adam Buttrick"/>
</figure>
 Adam Buttrick 
</h3>
<p>I did actually write <a href="https://github.com/ror-community/affiliation-matching-experimental/tree/main/fake_affiliations">a script</a> to recreate what Justin did, to take the <a href="https://ror.readme.io/docs/data-dump">ROR data dump</a> and convert it into a set of fake affiliation strings. I don&rsquo;t know if it exactly follows the patterns that OpenAlex used, but others can use it to do the same thing, if they want. If they&rsquo;re building their own models, they want to supplement, they want to train, they can just pull the ROR data, run it through the script and do that.</p>
<p>I also want to say that it&rsquo;s fantastic that <a href="https://github.com/ourresearch">OpenAlex does all this work open source</a> so that other people can see it and read it, because I was able to just immediately go through and read all of the code for the institution model, see how it was working, understand it, and read the <a href="https://docs.google.com/document/d/1ppbKRVtyneWc7Hjpo8TOm57YLGx1C2Oo/edit#heading=h.5w2tb5fcg77r">white paper</a>. And that&rsquo;s really fantastic, how much is being given back.</p>
<p>As we&rsquo;re looking at improving our <a href="https://ror.readme.io/docs/api-affiliation">affiliation matching service</a>, we have that benchmark, we have that thought process, so we don&rsquo;t have a cold start problem for improving our service, because it&rsquo;s all just out there in the open. There are so many other matching services for institutional identifiers that are closed and proprietary, and you don&rsquo;t have any idea how they&rsquo;re doing any of the matching or its level of correctness or its scores or anything like that. So what you&rsquo;re doing, that&rsquo;s great.</p>


<h3 id="hahahugoshortcode-s63-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>A lot of people aren&rsquo;t used to how open we are with our data. People have asked, &ldquo;Is there any way you could show me the code for this?&rdquo; and I&rsquo;ve said, &ldquo;Oh, it&rsquo;s all in this repository right here. And here&rsquo;s a paper that explains how to do it. And here&rsquo;s the artifacts, if you want to grab these artifacts, you can just download them to your account.&rdquo; That&rsquo;s the one thing that Jason has said from day one. We are completely open. We want everyone to have our data. We want everyone to have everything so that there&rsquo;s no question of us being an open data, open science company. That is that&rsquo;s our number one goal. That&rsquo;s been the number one goal since OurResearch, and OpenAlex will continue to have that goal until we exist no more. But hopefully that will be not for a long time.</p>


<h3 id="hahahugoshortcode-s64-hbhb-adam-buttrick"><figure class="round-figure"><img src="/img/adam-sq-200.jpg"
         alt="Avatar of Adam Buttrick"/>
</figure>
 Adam Buttrick 
</h3>
<p>It&rsquo;s great. And I think it accrues benefit to ROR, just as much as ROR accrues benefit to OpenAlex, that exchange of the open code, the open data, everything. Speaking of which, as part of our affiliation matching improvement, we put <a href="https://github.com/ror-community/affiliation-matching-experimental">new training datasets in a repository</a>. It&rsquo;s a large set of affiliation strings that were manually paired to ROR IDs at the <a href="https://www.aps.org/">American Physical Society</a>. They&rsquo;re all manually assigned, so it&rsquo;s good data, and it&rsquo;s about 15,000 pairs.</p>


<h3 id="hahahugoshortcode-s65-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>Oh wow. That&rsquo;s a lot.</p>


<h3 id="hahahugoshortcode-s66-hbhb-adam-buttrick"><figure class="round-figure"><img src="/img/adam-sq-200.jpg"
         alt="Avatar of Adam Buttrick"/>
</figure>
 Adam Buttrick 
</h3>
<p>It&rsquo;s good for test scores. We also pulled all of the publisher-submitted items out of the Crossref API, which you can also use as a good benchmark. Now, of course, we don&rsquo;t know necessarily how those were assigned, because a publisher could use some matching service that we don&rsquo;t know about to assign their ROR IDs, but that&rsquo;s another good one. But the APS data in particular is very good.</p>


<h3 id="hahahugoshortcode-s67-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>That is good, because there aren&rsquo;t many good benchmarks, especially ones that match directly to ROR and go straight from a raw string, so it&rsquo;s good to get a lot of those. I know that the French company that we&rsquo;re working with right now, SIRIS Academic, they have also created some datasets and some benchmarks specifically for French institutions, and they have an easy dataset and a hard dataset, so that&rsquo;s something that we&rsquo;re going to be looking at.</p>


<h3 id="hahahugoshortcode-s68-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Thank you so much for your time, Justin.</p>


<h3 id="hahahugoshortcode-s69-hbhb-adam-buttrick"><figure class="round-figure"><img src="/img/adam-sq-200.jpg"
         alt="Avatar of Adam Buttrick"/>
</figure>
 Adam Buttrick 
</h3>
<p>I am probably the most excited about this interview, of all the readers!</p>


<h3 id="hahahugoshortcode-s70-hbhb-justin-barrett"><figure class="round-figure"><img src="/img/blog/openalex/justin_barrett.png"
         alt="Avatar of Justin Barrett"/>
</figure>
 Justin Barrett 
</h3>
<p>Thank you very much.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>Questions? Want to be featured in a ROR case study? Contact <a href="mailto:support@ror.org">support@ror.org</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Justin Barrett, Lead Machine Learning Engineer for OpenAlex at OurResearch, talks with ROR Technical Community Manager Amanda French and ROR Curation Lead Adam Buttrick about using ROR in OpenAlex both as an identifier for institutions and as a dataset for training machine learning models.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Now Hiring: Metadata Curation Support Contractor</title><id>https://doi.org/10.71938/cdn3-jg69</id><link rel="alternate" href="https://ror.org/blog/2023-09-11-hiring-curation-contractor/"/><published>2023-09-11T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="Team Updates"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>ROR is seeking a short-term curation support contractor as registry curation activities continue to expand. Please consider submitting an application and help spread the word to your networks!</p>


<h1 id="about-ror-and-registry-curation">About ROR and registry curation 
</h1>
<p>The Research Organization Registry (ROR) is a community-led registry of open, sustainable, usable, and unique identifiers for every research organization in the world.</p>
<p>ROR is a <a href="/about#governance">cross-organization collaborative led by California Digital Library, Crossref, and DataCite</a>. Each organization provides input on decisions and overall strategy that support the growth and sustainability of ROR by engaging in outreach to funders, potential adopters, and other stakeholders in their respective networks.</p>
<p>The ROR registry includes IDs and metadata for more than 105,000 research organizations and counting. ROR receives regular input from its global community of users and stakeholders about records that should be added to or updated in the registry. ROR also seeks and incorporates input from the broader community about policies and processes for maintaining registry data. We review and process this feedback through a centralized process and according to metadata policies established and maintained by ROR&rsquo;s <a href="/registry#curation-advisory-board">curation advisory board</a> and <a href="/about/team/#core-team">metadata curation lead</a>. All requests are publicly visible in an open <a href="https://github.com/ror-community/ror-updates#readme">Github repository</a> so that the process is as transparent as possible. New registry <a href="https://github.com/ror-community/ror-updates/releases">releases</a> are published on a rolling basis, approximately once a month.</p>
<p>Since <a href="/blog/2022-03-17-first-independent-release">officially launching its new curation model</a> in early 2022, ROR&rsquo;s volume of curation activities has been increasing steadily. In order to keep up with this growth, we are looking to scale resourcing on the ROR team accordingly, beginning with hiring short-term contract support to help maintain the pipeline of requests. Contract support will better equip ROR to be as responsive as possible to requests, to work with community members interested in submitting unique types of requests such as bulk lists of organizations, and to pursue special strategic projects such as the <a href="/blog/2023-09-07-open-funder-registry-transition-ror-cross-post">reconciliation of Crossref&rsquo;s Open Funder Registry with ROR</a>. This short-term support will also allow us to assess bigger-picture and longer-term needs with regard to curation staffing.</p>
<p>The project description and application instructions follow below.</p>


<h2 id="curation-contractor-general-responsibilities">Curation Contractor: General responsibilities 
</h2>
<ul>
<li>Triage new curation requests using Github-based and command-line tools</li>
<li>Evaluate accuracy and validate requests by researching information in ROR, on organizational websites, and other sources such as Google Scholar</li>
<li>Review bulk requests as assigned by the metadata curation lead</li>
<li>Provide recommendations to curation lead and curation advisory board about how requests should be processed</li>
<li>Provide input to curation lead and curation advisory board to help improve the curation process and registry data quality</li>
<li>Provide support to curation lead and curation advisory board if troubleshooting is needed</li>
</ul>


<h2 id="contract-deliverables">Contract deliverables 
</h2>


<h3 id="curation-backlog-management-and-processing">Curation backlog management and processing 
</h3>
<p>Carry out Github- and command-line-based workflow to triage new curation requests and keep backlog small. Evaluate metadata accuracy and complete requests by researching information in ROR, on organizational websites, and other sources such as Google Scholar. Provide recommendations to curation lead and curation advisory board about how requests should be processed</p>


<h3 id="project-support">Project support 
</h3>
<p>Review bulk requests as identified and assigned by the metadata curation lead. Provide support to curation lead and curation advisory board if troubleshooting is needed</p>


<h3 id="data-analysismetadata-qa">Data analysis/metadata Q&amp;A 
</h3>
<p>Identify opportunities to improve ROR metadata quality and address metadata quality issues at scale.</p>


<h2 id="skills-and-qualifications">Skills and qualifications 
</h2>
<ul>
<li>Familiarity with Github issues and projects</li>
<li>Familiarity with command line operations</li>
<li>Ability to use basic functions of the ROR API (we can provide training)</li>
<li>Ability to perform intelligent searches on the ROR website (multiple queries etc) and related authority sources (e.g. Wikidata, ISNI, ORCID)</li>
<li>Excel/spreadsheet data analysis</li>
<li>Attention to detail</li>
<li>Professional proficiency in English and at least one other language</li>
<li>Ability to search and analyze information about research institutions and scholarly publications</li>
<li>Knowledge of research/publishing landscape and persistent identifiers for scholarly communications desirable but not required</li>
</ul>


<h2 id="position-details">Position details 
</h2>
<p>This is a part-time contracted position (up to 20 hours a week) for approximately three months. This position will be in contract with Crossref but will be accountable to the three organizations that operate ROR (Crossref, California Digital Library, and DataCite).</p>
<p>All work will be performed remotely. The contractor will have no set work hours.</p>


<h2 id="to-apply">To apply 
</h2>
<p>To apply, send a resume, statement of interest, and rate schedule to <a href="mailto:jobs@ror.org">jobs@ror.org</a>. Proposals will be reviewed until the position is filled. Review of proposals will begin September 15.</p>


<h2 id="equal-opportunity-statement">Equal opportunity statement 
</h2>
<p>ROR, and the collaborating organizations behind it, California Digital Library, Crossref, and DataCite, are committed to non-discrimination and making opportunities available to anyone without regard to race, color, religion, sex, pregnancy or a condition related to pregnancy, sexual orientation, gender identity or expression, national origin, ancestry, age, physical or mental disability, genetic information, veteran status, uniform service member status, or any other protected class under applicable law.</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">ROR is seeking a short-term curation support contractor as registry curation activities continue to expand. Please consider submitting an application and help spread the word to your networks!</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Open Funder Registry to Transition to ROR</title><id>https://doi.org/10.71938/6snb-ya06</id><link rel="alternate" href="https://ror.org/blog/2023-09-07-open-funder-registry-transition-ror/"/><link rel="related" href="https://doi.org/10.64000/v3429-p7810"/><published>2023-09-07T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><author><name>Ginny Hendricks</name><uri>https://orcid.org/0000-0002-0353-2702</uri></author><author><name>Rachael Lammey</name><uri>https://orcid.org/0000-0001-5800-1434</uri></author><author><name>Fabienne Michaud</name><uri>https://orcid.org/0000-0003-4389-7463</uri></author><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="General Updates"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>Crossref has announced a long-term plan to deprecate the Open Funder Registry and merge it with ROR in order to make workflows more efficient for all concerned. ROR and Crossref are working closely together on this important initiative, and we&rsquo;re happy to answer any questions that users of the Funder Registry may have. Read the full text of Crossref&rsquo;s announcement below, and feel free to write <a href="mailto:info@ror.org">info@ror.org</a>, <a href="https://calendly.com/ror-chat">book a meeting with Amanda French</a>, or attend a <a href="/events">ROR event</a> if you&rsquo;d like to learn more.</span></div>
	</div>
</div>


<h2 id="open-funder-registry-to-transition-to-the-research-organization-registry-ror">Open Funder Registry to transition to the Research Organization Registry (ROR) 
</h2>
<p>Today, we are announcing a long-term plan to deprecate the <a href="https://www.crossref.org/services/funder-registry/">Open Funder Registry</a>. For some time, we have understood that there is significant overlap between the Funder Registry and the <a href="https://ror.org">Research Organization Registry (ROR)</a>, and funders and publishers have been asking us whether they should use Funder IDs or ROR IDs to identify funders. It has therefore become clear that <strong>merging the two registries will make workflows more efficient and less confusing for all concerned.</strong> Crossref and ROR are therefore working together to ensure that Crossref members and funders can use ROR to simplify persistent identifier integrations, to register better metadata, and to help connect research outputs to research funders.</p>
<p>Just yesterday, we published <a href="https://www.crossref.org/blog/open-funding-metadata-community-workshop-report/">a summary of a recent workshop between funders and publishers on funding metadata workflows</a> that we convened with the Dutch Research Council (NWO) and Sesame Open Science. As the report notes, &ldquo;open funding metadata is arguably the next big thing&rdquo; [in Open Science]. That being the case, we think this is the ideal time to strengthen our support of open funding metadata by beginning this transition to ROR.</p>


<h3 id="comparing-the-features-of-ror-and-the-funder-registry">Comparing the features of ROR and the Funder Registry 
</h3>
<p>Let&rsquo;s look at some of the major similarities and differences between the two registries, including their history, features, scope, and usage, since there are important nuances and distinctions that are helpful to understand.</p>


<h5 id="overview">Overview 
</h5>
<table>
<thead>
<tr>
<th>ROR</th>
<th>Funder Registry</th>
</tr>
</thead>
<tbody>
<tr>
<td>Launched in 2019</td>
<td>Launched in 2013</td>
</tr>
<tr>
<td>Primary use case is contributor affiliation</td>
<td>Primary use case is funding acknowledgement</td>
</tr>
<tr>
<td>105k+ records</td>
<td>35k+ records</td>
</tr>
<tr>
<td>CC0 data</td>
<td>CC0 data</td>
</tr>
<tr>
<td>REST API</td>
<td>REST API</td>
</tr>
<tr>
<td>Free to use</td>
<td>Free to use</td>
</tr>
<tr>
<td>Entire registry downloadable as JSON and CSV</td>
<td>Entire registry downloadable as RDF; funder names and IDs downloadable as CSV</td>
</tr>
<tr>
<td>Records contain mappings to other IDs</td>
<td>Records do not contain mappings to other IDs</td>
</tr>
<tr>
<td>Organization relationships and hierarchy</td>
<td>Organization relationships and hierarchy</td>
</tr>
<tr>
<td>8 organization types</td>
<td>2 funder types, 8 funder subtypes</td>
</tr>
<tr>
<td>Open source code and multiple open-source tools available</td>
<td>Open source code</td>
</tr>
<tr>
<td>Web-based registry search</td>
<td>Web-based search for works in Crossref associated with each Funder ID</td>
</tr>
<tr>
<td>Web-based landing pages for each ROR record</td>
<td>JSON landing pages for each Funder Registry record</td>
</tr>
<tr>
<td>Updated monthly</td>
<td>Updated monthly</td>
</tr>
<tr>
<td>Public curation process</td>
<td>Private curation process</td>
</tr>
<tr>
<td>Anyone can request changes and additions</td>
<td>Anyone can request changes and additions</td>
</tr>
<tr>
<td>Stable financial support</td>
<td>Stable financial support</td>
</tr>
<tr>
<td>Beginning to be supported in funding and publishing workflows</td>
<td>Somewhat well supported in most funding and publishing workflows</td>
</tr>
<tr>
<td>Currently used by 260+ Crossref members <sup id="fnref:1"><a href="#fn:1" class="footnote-ref" role="doc-noteref">1</a></sup></td>
<td>Currently used by 2100+ Crossref members <sup id="fnref:2"><a href="#fn:2" class="footnote-ref" role="doc-noteref">2</a></sup></td>
</tr>
</tbody>
</table>


<h5 id="history">History 
</h5>
<p>The <a href="https://www.crossref.org/services/funder-registry/">Open Funder Registry</a> was <a href="https://www.crossref.org/news/2013-05-28-crossrefs-fundref-launches-publishers-and-funders-track-scholarly-output/">launched as FundRef over a decade ago</a> to enable the community to <strong>cite research financing</strong> and assert it within the scholarly record, acknowledging the organizations granting their support. Elsevier generously donated the seed data for the Funder Registry and has managed its curation for the last ten years, while we have maintained the technical operations and promoted community adoption of the Funder Registry.</p>
<p>The <a href="https://ror.org/">Research Organization Registry (ROR)</a> was <a href="https://ror.org/blog/2019-02-10-announcing-first-ror-prototype/">introduced in 2019</a> by the California Digital Library, DataCite, and Crossref to enable the community to <strong>cite contributor affiliations</strong> and assert them within the scholarly record, acknowledging the organizations that housed or performed the research. Digital Science generously donated the seed data for the Research Organization Registry from its Global Research Identifier Database (GRID) initiative, and Crossref, DataCite, and the California Digital Library have contributed labor and resources to turn ROR into a mature, independent, freely available offering.</p>


<h5 id="scope">Scope 
</h5>
<p>One key difference between the registries is that <strong>ROR has always included funding organizations, and ROR records have always included mappings to Funder IDs where available,</strong> while the reverse is not true: the Funder Registry includes only funding organizations, not other kinds of organizations, and Funder Registry records do not currently include mappings to ROR IDs or other identifiers. It therefore makes sense to expand ROR&rsquo;s initial contributor affiliation use case to include the function of identifying research financing.</p>


<h5 id="usage">Usage 
</h5>
<p>More Crossref members use Funder IDs than use ROR IDs, to be sure. You can see from the table above that the number of Crossref members using Funder IDs in Crossref records is higher by almost a factor of 10 than the number of Crossref members using ROR IDs in Crossref records. But note too that <strong>the current <em>rate</em> of adoption is far higher for ROR than it is for the Funder Registry.</strong> Since <a href="https://www.crossref.org/blog/a-ror-some-update-to-our-api/">January of 2022</a>, we&rsquo;ve seen a gratifying number of publishers and service providers beginning to use ROR identifiers for contributor affiliations in Crossref. In the last year, the number of Crossref members depositing ROR IDs has increased by 356%, while the number depositing Funder IDs has increased only by 12%. As evidenced by its ballooning API traffic, too, with more than 20 million requests last month,<sup id="fnref:3"><a href="#fn:3" class="footnote-ref" role="doc-noteref">3</a></sup> ROR is clearly being used by many scholarly research systems for many purposes. <strong>The more systems that use an identifier, the more valuable that identifier becomes as a vehicle for exchanging information.</strong></p>
<p>Even though ROR&rsquo;s primary use case has been to identify contributor affiliations, ROR is in fact already being used by funders. Nineteen funding organizations are depositing ROR IDs in their grant records with Crossref to denote principal investigator affiliations,<sup id="fnref:4"><a href="#fn:4" class="footnote-ref" role="doc-noteref">4</a></sup> and, following a meeting of the <a href="https://www.crossref.org/working-groups/funders/">Crossref Funder Advisory Group</a> last month, all eighty funder members are primed to start using ROR IDs to identify themselves in grant records. DataCite has allowed ROR IDs as a funding identifier since 2019<sup id="fnref:5"><a href="#fn:5" class="footnote-ref" role="doc-noteref">5</a></sup>, and while there are currently over 877,000 DataCite records that use Funder IDs to identify funders,<sup id="fnref:6"><a href="#fn:6" class="footnote-ref" role="doc-noteref">6</a></sup> there are also over 161,000 DataCite records that use ROR IDs to identify funders.<sup id="fnref:7"><a href="#fn:7" class="footnote-ref" role="doc-noteref">7</a></sup></p>


<h5 id="tools-and-services">Tools and services 
</h5>
<p>Both the Funder Registry and ROR offer open data and open source code, but we think that ROR&rsquo;s suite of free and open source utilities (some of which were developed by Crossref staff) gives it a competitive advantage. We know that publishers and their service providers have ongoing challenges in collecting and matching funding information from authors and in validating Funder IDs. With ROR’s extensive toolkit, <strong>publishers and their technology providers who adopt ROR will be in a much better position to improve the accuracy of funding acknowledgements in metadata, which can in turn enable the development of reliable analytics, tools, and services for funders, regulators, research facilities, and the public</strong>.</p>
<p>Crossref has built tools based on OpenRefine for both the Funder Registry and ROR: the <a href="https://www.crossref.org/labs/fundref-reconciliation-service/">Open Funder Registry Reconciliation Service</a> and the <a href="https://ror.readme.io/docs/openrefine-reconciler">ROR Reconciler</a> are both useful ways to clean messy data. ROR, however, also offers a much-used <a href="https://ror.readme.io/docs/api-affiliation">API endpoint that helps match organization names to ROR IDs</a>, and several third parties have also developed and shared <a href="https://ror.readme.io/docs/matching#match-organization-names-to-ror-ids-using-third-party-tools">open source matching tools and services for ROR</a>. Crossref and ROR are also collaborating on new strategies for affiliation matching that will be able to match funding references.</p>


<h5 id="community-engagement-models">Community engagement models 
</h5>
<p>The Funder Registry has been curated for over a decade through time and expertise generously donated by Elsevier. ROR offers more transparency and community involvement; it is <a href="https://ror.org/about/#governance">openly governed</a> by Crossref, DataCite, and the California Digital Library and is advised by a global network of community stakeholders through its <a href="https://ror.org/community/#steering-group">Steering Group and</a> Community Advisory Group. ROR is <a href="https://github.com/ror-community/ror-updates/issues">openly curated</a> and is aided by a global <a href="https://ror.org/registry/#curation-advisory-board">Curation Advisory Board</a> of volunteers.</p>


<h5 id="summary">Summary 
</h5>
<p>For all of the above reasons, then, we believe that in the long term ROR will serve the community better as an identifier for funders. In a future post, we&rsquo;ll do an even deeper dive into comparing the Funder Registry and ROR, comparing the metadata and data in each registry and giving statistics on funder assertions in our metadata.</p>


<h3 id="what-will-this-mean-for-you">What will this mean for you? 
</h3>
<p>The many organizations whose tools, services, and workflows have been architected to use Funder Registry IDs will find this transition a challenge, and we don&rsquo;t want to make light of that issue. Over the last ten years, we have encouraged the community to adopt Funder IDs, and the community has demonstrably recognized the benefits of doing so. Publishers have put a great deal of time, thought, and effort into collecting funder data and including it in Crossref metadata, and they have built internal reports and workflows around the Funder Registry. <strong>Both Crossref and ROR are committed to making the transition from the Funder Registry to the Research Organization Registry as simple as possible for those who have adopted the Funder Registry.</strong></p>
<p>If you are not already using the Funder Registry and are planning to begin standardizing funding data, we recommend that you use ROR to identify funders. If you are currently using the Funder Registry in your systems and workflows, don&rsquo;t worry! <strong>In the short term, and even in the medium term, Funder IDs aren&rsquo;t going away.</strong> Eventually, however, the Funder Registry will cease to be updated, so any new funders will only be registrable in Crossref metadata with ROR IDs. Legacy Funder IDs and their mapping to ROR IDs will be maintained, so if Crossref members submit a legacy Funder ID, it will get mapped to a ROR ID automatically. Note, too, that Crossref is committed to maintaining the current funder API endpoints until ROR IDs become the predominant identifier for newly registered content.</p>
<p>In short, if you are already using Funder IDs, you can and should continue to do so. However, we do recommend that you begin looking at what it will take to integrate ROR into your systems and workflows for identifying funders. Think of it as warming up before a workout: it&rsquo;s time to start swinging your arms and stretching your hamstrings.</p>
<p>We face challenges in this transition, too. Of these, we think the largest will be (1) completing the reconciliation work involved in mapping Funder IDs to ROR IDs, and (2) overhauling Crossref&rsquo;s schemas, APIs, and deposit tools to support ROR IDs in all the ways we currently support Funder IDs. We&rsquo;ll discuss both of these challenges in future blog posts, but it&rsquo;s worth saying that <strong>any challenges pale in comparison to the benefit of enabling the whole community to use a single open identifier in multiple places in the scholarly record.</strong></p>


<h3 id="tell-us-what-you-need">Tell us what you need! 
</h3>
<p>We want to hear from you. You can use our <a href="https://community.crossref.org/">Community Forum</a> talk to us about the Crossref Funder Registry, and you can join the ROR Slack to talk to the ROR team and community. You can also contact Crossref via our <a href="https://support.crossref.org/hc/en-us/requests/new?ticket_form_id=360001642691">request form</a> or email ROR at <a href="mailto:info@ror.org">info@ror.org</a>, and you can attend online <a href="/events/">Crossref events</a> and <a href="https://ror.org/events">ROR events</a> to get updates from us and ask us your questions.</p>
<p>One of the major messages we&rsquo;re already hearing from funders and publishers is expressed in <a href="https://www.crossref.org/blog/open-funding-metadata-community-workshop-report/">yesterday&rsquo;s post on open funding metadata</a>: &ldquo;While many concluded that there was still a long way to go to solve the many technical challenges related to funding metadata, attendees were unanimous on its importance.&rdquo; We look forward to beginning this important work together.</p>
<div class='callout grey'>
	<div class="row">
		<div class="col-md-2" style="text-align: center;"><i class='fa-solid fa-envelope'></i></div><div class="col-md-10"><span>Contact <a href="mailto:info@ror.org">info@ror.org</a> with any and all questions.</span></div>
	</div>
</div>
<div class="footnotes" role="doc-endnotes">
<hr>
<ol>
<li id="fn:1">
<p><a href="https://api.crossref.org/works?filter=has-ror-id:t&amp;facet=publisher-name:*">Crossref API works with ROR IDs faceted by publisher name</a>&#160;<a href="#fnref:1" class="footnote-backref" role="doc-backlink">&#x21a9;&#xfe0e;</a></p>
</li>
<li id="fn:2">
<p><a href="https://api.crossref.org/works?filter=has-funder-doi:t&amp;facet=publisher-name:*">Crossref API works with Funder IDs faceted by publisher name</a>&#160;<a href="#fnref:2" class="footnote-backref" role="doc-backlink">&#x21a9;&#xfe0e;</a></p>
</li>
<li id="fn:3">
<p><a href="https://p.datadoghq.eu/sb/db1aec04-0c1a-11ec-860a-da7ad0900005-7d7c572812608235cca3359ee5ec591a?from_ts=1690924139911&amp;to_ts=1693516139911&amp;live=true">ROR API Public API Usage Insights</a>&#160;<a href="#fnref:3" class="footnote-backref" role="doc-backlink">&#x21a9;&#xfe0e;</a></p>
</li>
<li id="fn:4">
<p><a href="http://api.crossref.org/works?filter=has-ror-id:t,type-name:Grant&amp;facet=publisher-name:*">Crossref API works of type &ldquo;Grant&rdquo; with ROR IDs faceted by publisher name</a>&#160;<a href="#fnref:4" class="footnote-backref" role="doc-backlink">&#x21a9;&#xfe0e;</a></p>
</li>
<li id="fn:5">
<p><a href="https://schema.datacite.org/meta/kernel-4.3/">DataCite Metadata Schema 4.3 release notes, August 2019</a>&#160;<a href="#fnref:5" class="footnote-backref" role="doc-backlink">&#x21a9;&#xfe0e;</a></p>
</li>
<li id="fn:6">
<p><a href="https://api.datacite.org/dois?query=fundingReferences.funderIdentifierType:%22Crossref%20Funder%20ID%22&amp;page%5Bsize%5D=0">DataCite API Funder ID in funding reference</a>&#160;<a href="#fnref:6" class="footnote-backref" role="doc-backlink">&#x21a9;&#xfe0e;</a></p>
</li>
<li id="fn:7">
<p><a href="https://api.datacite.org/dois?query=fundingReferences.funderIdentifierType:ROR&amp;page%5Bsize%5D=0">DataCite API ROR ID in funding reference</a>&#160;<a href="#fnref:7" class="footnote-backref" role="doc-backlink">&#x21a9;&#xfe0e;</a></p>
</li>
</ol>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Crossref has announced a long-term plan to deprecate the Open Funder Registry and merge it with ROR in order to make workflows more efficient for all concerned. ROR and Crossref are working closely together on this important initiative.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Case Study: Why ResearchEquals Integrated ROR and Live Streamed It</title><id>https://doi.org/10.71938/yrbb-e418</id><link rel="alternate" href="https://ror.org/blog/2023-08-22-researchequals-case-study/"/><published>2023-08-22T00:00:00+00:00</published><updated>2025-11-07T11:35:16-05:00</updated><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><category term="Case Studies"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>Chris Hartgerink, the founder of <a href="https://libscie.org">Liberate Science</a>, discusses why and how they integrated ROR into the modular publishing platform <a href="https://www.researchequals.com">ResearchEquals</a> for author affiliations in user profiles and Crossref DOIs and explains why they live streamed all eight hours of the work.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>
<h2 id="key-quotations">Key quotations 
  <a href="#key-quotations" aria-label="Key quotations">
<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 640 512" class="ha ha-link" width="16" height="16" alt="Key quotations"><path d="M579.8 267.7c56.5-56.5 56.5-148 0-204.5c-50-50-128.8-56.5-186.3-15.4l-1.6 1.1c-14.4 10.3-17.7 30.3-7.4 44.6s30.3 17.7 44.6 7.4l1.6-1.1c32.1-22.9 76-19.3 103.8 8.6c31.5 31.5 31.5 82.5 0 114L422.3 334.8c-31.5 31.5-82.5 31.5-114 0c-27.9-27.9-31.5-71.8-8.6-103.8l1.1-1.6c10.3-14.4 6.9-34.4-7.4-44.6s-34.4-6.9-44.6 7.4l-1.1 1.6C206.5 251.2 213 330 263 380c56.5 56.5 148 56.5 204.5 0L579.8 267.7zM60.2 244.3c-56.5 56.5-56.5 148 0 204.5c50 50 128.8 56.5 186.3 15.4l1.6-1.1c14.4-10.3 17.7-30.3 7.4-44.6s-30.3-17.7-44.6-7.4l-1.6 1.1c-32.1 22.9-76 19.3-103.8-8.6C74 372 74 321 105.5 289.5L217.7 177.2c31.5-31.5 82.5-31.5 114 0c27.9 27.9 31.5 71.8 8.6 103.9l-1.1 1.6c-10.3 14.4-6.9 34.4 7.4 44.6s34.4 6.9 44.6-7.4l1.1-1.6C433.5 260.8 427 182 377 132c-56.5-56.5-148-56.5-204.5 0L60.2 244.3z"/></svg>
  </a>
</h2>
<p>&ldquo;Integrating ROR just makes it very easy to match all these authors to the same institution. And it also lets you get all the metadata related to an institution so that you can display institutions visually and add more information, like when the institution was founded and where it&rsquo;s located, all this information that you wouldn&rsquo;t get from just a raw string.&rdquo;</p>
<p>&ldquo;So when people add an affiliation, they all use ROR. I have not heard any feedback that people can&rsquo;t find their institution. Sometimes it might be under a slightly different name than they expect, especially if there&rsquo;s a bilingual name or if it&rsquo;s known under different names.&rdquo;</p>
<p>&ldquo;A nice thing about ROR is that it is living, breathing – with the continual releases, there&rsquo;s new organizations being added, making it more worthwhile without us doing anything. As the data get richer and the usage gets richer, I think that&rsquo;s really where the added value is.&rdquo;</p>
<p>&ndash; Chris Hartgerink, Founder, Liberate Science</p>
</span></div>
	</div>
</div>


<h3 id="hahahugoshortcode-s1-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Hello, and thank you for participating in this case study interview. Can you tell us your name, title and organization?</p>


<h3 id="hahahugoshortcode-s2-hbhb-chris-hartgerink"><figure class="round-figure"><img src="/img/blog/researchequals/chris_hartgerink.png"
         alt="Avatar of Chris Hartgerink"/>
</figure>
 Chris Hartgerink 
</h3>
<p>Hi, everyone, I&rsquo;m Chris Hartgerink from <a href="https://libscie.org">Liberate Science</a>. I&rsquo;m the CEO and founder there, and before I was a practicing researcher in meta-research.</p>


<h3 id="hahahugoshortcode-s3-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>And tell us about Liberate Science. When was it founded? What does it do? What makes it special?</p>


<h3 id="hahahugoshortcode-s4-hbhb-chris-hartgerink"><figure class="round-figure"><img src="/img/blog/researchequals/chris_hartgerink.png"
         alt="Avatar of Chris Hartgerink"/>
</figure>
 Chris Hartgerink 
</h3>
<p>Well, Liberate Science was founded back in 2019 when I left academia. I wanted to keep working on improving research, but not within the academic institution, because I tried that, and even though it might work for some people, it definitely didn&rsquo;t work for me. I always felt like I was working towards not having a loss instead of towards wins. What&rsquo;s different about Liberate Science really is that it tries to stay as independent as possible so that we can grow serious alternatives to all aspects of doing research. Right now our focus is on a serious alternative to the big publishers and an alternative to the article format. And so we built <a href="https://www.researchequals.com">ResearchEquals</a>, which was the first modular publishing platform. If you like that, it&rsquo;s quite special.</p>


<h3 id="hahahugoshortcode-s5-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Wonderful. What are these alternative formats in scholarly publishing?</p>


<h3 id="hahahugoshortcode-s6-hbhb-chris-hartgerink"><figure class="round-figure"><img src="/img/blog/researchequals/chris_hartgerink.png"
         alt="Avatar of Chris Hartgerink"/>
</figure>
 Chris Hartgerink 
</h3>
<p>Alternative formats are really nice, because they vary in both size and content. Of course, the book is the largest entity of publishing, and the article is a smaller one. One alternative format is the micropublication, which is a small article, but it is still an article. Then there&rsquo;s modular publishing, which can be as big as a preprint, but it can be as small as a conference abstract. And then there&rsquo;s also nanopublications, which are the smallest entity I know, where it&rsquo;s just the statement that a certain variable positively predicts another variable, and that is then one publication.</p>
<p>Alternative formats are not necessarily paper-based. The nice thing about them is that they&rsquo;re creating diversity in the publishing space. We&rsquo;re used to reading text, but why not also have podcasts be part of the scholarly record? And videos, and study materials, and blogs? We&rsquo;re really seeing much more diversity in the outputs and that increased diversity in what gets published will be good for the assessment of researchers as well.</p>


<h3 id="hahahugoshortcode-s7-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Fascinating. So when and where did you personally first hear about ROR?</p>


<h3 id="hahahugoshortcode-s8-hbhb-chris-hartgerink"><figure class="round-figure"><img src="/img/blog/researchequals/chris_hartgerink.png"
         alt="Avatar of Chris Hartgerink"/>
</figure>
 Chris Hartgerink 
</h3>
<p>I heard about ROR for the first time when it was still in development. In following along over time, it just made sense to me: ROR is to organizations what ORCID is to individual researchers. Of course we want to know where people come from, what their affiliation is, and have that be entirely unambiguous. And so I followed along on GitHub a bit and saw some of the discussion happening there. I really like this community-based curation model. As with any open source work, it always takes a bit of time. But I would say I&rsquo;ve known about ROR since 2019.</p>


<h3 id="hahahugoshortcode-s9-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>We calculate ROR&rsquo;s official launch date as January 2019, which means that ROR&rsquo;s fifth anniversary is coming up in January 2024.</p>


<h3 id="hahahugoshortcode-s10-hbhb-chris-hartgerink"><figure class="round-figure"><img src="/img/blog/researchequals/chris_hartgerink.png"
         alt="Avatar of Chris Hartgerink"/>
</figure>
 Chris Hartgerink 
</h3>
<p>Anything planned?</p>


<h3 id="hahahugoshortcode-s11-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yes, there will be events! We&rsquo;re beginning to plan. Any suggestions? What would you like to see happen at ROR&rsquo;s fifth anniversary?</p>


<h3 id="hahahugoshortcode-s12-hbhb-chris-hartgerink"><figure class="round-figure"><img src="/img/blog/researchequals/chris_hartgerink.png"
         alt="Avatar of Chris Hartgerink"/>
</figure>
 Chris Hartgerink 
</h3>
<p>I would be very interested in some creative celebrations, something that really marks the moment, because I think you and the team have achieved a lot in those five years. And I think that&rsquo;s worth literally celebrating. So maybe a virtual party?</p>


<h3 id="hahahugoshortcode-s13-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>I love it. Sounds amazing.</p>


<h3 id="hahahugoshortcode-s14-hbhb-chris-hartgerink"><figure class="round-figure"><img src="/img/blog/researchequals/chris_hartgerink.png"
         alt="Avatar of Chris Hartgerink"/>
</figure>
 Chris Hartgerink 
</h3>
<p>There are all these artists and drag shows and performances on Zoom. I don&rsquo;t know whether they still do it.</p>


<h3 id="hahahugoshortcode-s15-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>At Crossref, we once had an origami lesson on Zoom at an all-staff meeting, which was wonderful. Although that was maybe more meditative than celebratory. And speaking of creativity, I so admired your <a href="https://www.youtube.com/watch?v=D7dTfFLlUUI&amp;list=PLagyLN5PZI3zVbJNYCNFmL1cJif_31ReV">live streaming of your integration of ROR into ResearchEquals</a>! I thought that was something that more people should emulate. And it was so useful for us to see. Can you tell us about that, and about why you decided to integrate ROR into ResearchEquals?</p>


<h3 id="hahahugoshortcode-s16-hbhb-chris-hartgerink"><figure class="round-figure"><img src="/img/blog/researchequals/chris_hartgerink.png"
         alt="Avatar of Chris Hartgerink"/>
</figure>
 Chris Hartgerink 
</h3>
<p>ResearchEquals is a modular publishing platform, so we have a bit of a different situation from regular manuscript processing. With us, every researcher or author can make their own profile. And it&rsquo;s not that they create an account and then go through a whole submission process and only then somebody else says, &ldquo;Okay, now it gets published.&rdquo; On ResearchEquals, you as the author or authors get to say, &ldquo;Okay, now this becomes public.&rdquo; It&rsquo;s a bit more like a repository in that sense.</p>
<p>One of the things that came up was &ldquo;What metadata do people want to add to their profile?&rdquo; We started out with things like author names, their pronouns, their website, and a bio, but one of the things that was coming up was &ldquo;How can we discover other people?&rdquo; Authors also started saying, &ldquo;We have a certain affiliation, and we want to be able to add that.&rdquo; So then I started looking into that, and I remembered ROR! I was already inclined to integrate ROR anyway.</p>
<p>This way, we have the option in the future to also say, &ldquo;Well, I&rsquo;m not interested in specific authors, but I&rsquo;m interested in specific organizations.&rdquo; <strong>Integrating ROR just makes it very easy to match all these authors to the same institution. And it also lets you get all the metadata related to an institution so that you can display institutions visually and add more information, like when the institution was founded and where it&rsquo;s located, all this information that you wouldn&rsquo;t get from just a raw string.</strong></p>
<p>Plus, there&rsquo;s the risk, as you also know, that I might fill out my affiliation ever so slightly differently than someone else would. ROR was also a cost-effective way to implement affiliations. So all in all, there were many pluses in the pro column for ROR.</p>


<h3 id="hahahugoshortcode-s17-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>And why did you decide to live stream it?</p>


<h3 id="hahahugoshortcode-s18-hbhb-chris-hartgerink"><figure class="round-figure"><img src="/img/blog/researchequals/chris_hartgerink.png"
         alt="Avatar of Chris Hartgerink"/>
</figure>
 Chris Hartgerink 
</h3>
<p>The live streaming was actually very fun. I was thinking about integrating ROR, and then I thought, &ldquo;Okay, but I have so many other things on my plate. How do I actually plan this properly?&rdquo; It was a challenge to myself. It was four weeks, four two-hour slots, where I said, &ldquo;Well, I&rsquo;m just going to plan those two-hour slots for every week, and then I&rsquo;m going to get it as far as I can.&rdquo; Even if nobody watched, the live streaming would help me to plan and to have this experience of <a href="https://en.wikipedia.org/wiki/Rubber_duck_debugging">rubber duck programming</a>, where you try to explain to your rubber ducky what you&rsquo;re actually doing. It ended up being incredibly helpful. In the live stream itself I got some feedback that helped me along the way and made me look at things differently.</p>

<div class="blog-video">
  <iframe src="https://www.youtube.com/embed/videoseries?list=PLagyLN5PZI3zVbJNYCNFmL1cJif_31ReV" allowfullscreen title="Integrating ROR into ResearchEquals"></iframe>
</div>

<p>And so yeah, the idea was for self planning, but also to take the open source aspects of ResearchEquals to the next level. Or maybe not the next level, because other people have done this, but I thought this would be a fun exercise to try. And I was surprised how many people actually ended up watching the stream. I mean, it wasn&rsquo;t thousands, of course, but I think <a href="https://www.youtube.com/live/vfXjvn2ccUI?feature=share">one of the videos actually got around a hundred views</a> over time. Not very long views, but still.</p>


<h3 id="hahahugoshortcode-s20-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Right! I thought they were fascinating, of course. I learned a lot from them. And there were moments during the live streams where I kept wanting to help you &ndash; and then not doing it, to be honest! Because I thought, &ldquo;No, I want to see if our documentation is good enough so that Chris can figure it out, and if they can&rsquo;t, then I&rsquo;m going to go and change it.&rdquo; So I did that several times, actually, after watching you struggle a little. I apologize for not helping you when I could have, just because it was so useful to see how our documentation could be improved.</p>


<h3 id="hahahugoshortcode-s21-hbhb-chris-hartgerink"><figure class="round-figure"><img src="/img/blog/researchequals/chris_hartgerink.png"
         alt="Avatar of Chris Hartgerink"/>
</figure>
 Chris Hartgerink 
</h3>
<p>Well, you&rsquo;re off the hook.</p>


<h3 id="hahahugoshortcode-s22-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Thank you. And how has that integration been? Do you find organizations that are not in ROR that people are entering?</p>


<h3 id="hahahugoshortcode-s23-hbhb-chris-hartgerink"><figure class="round-figure"><img src="/img/blog/researchequals/chris_hartgerink.png"
         alt="Avatar of Chris Hartgerink"/>
</figure>
 Chris Hartgerink 
</h3>
<p>We ended up implementing it in such a way that people can only enter organizations from ROR. We include the ROR ID in our database and not the raw strings, because otherwise we&rsquo;d have this issue still, which is the exact one we&rsquo;re trying to solve, of needing to disambiguate the organizations. <strong>So when people add an affiliation, they all use ROR. I have not heard any feedback that people can&rsquo;t find their institution. Sometimes it might be under a slightly different name than they expect, especially if there&rsquo;s a bilingual name or if it&rsquo;s known under different names.</strong></p>


<h3 id="hahahugoshortcode-s24-hbhb"><figure><img src="/img/blog/researchequals/researchequals-ror.gif"
         alt="ROR in ResearchEquals interface is used to add author affiliations"/>
</figure>
 
</h3>
<p>One of the things we have heard is a limitation, because we only had eight hours to implement, is that at this point we only allow one affiliation at any given time. People have said, you know, &ldquo;I have multiple affiliations, I would like to add them,&rdquo; so we&rsquo;ll be adding that at some point. It&rsquo;s always a matter of resources. The other thing that we&rsquo;re going to be adding are dates for each affiliation so people can also keep track of their past affiliations in their profile. That way it&rsquo;s easy to see how many people were at these organizations over time, so if you&rsquo;re not interested in the person <em>per se</em>, you can track how many people are using ResearchEquals at the organizational level. Of course, that doesn&rsquo;t go into metadata, but that&rsquo;s something we want to offer to institutions.</p>


<h3 id="hahahugoshortcode-s25-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Great. And are you using the ROR API or the ROR data dump?</p>


<h3 id="hahahugoshortcode-s26-hbhb-chris-hartgerink"><figure class="round-figure"><img src="/img/blog/researchequals/chris_hartgerink.png"
         alt="Avatar of Chris Hartgerink"/>
</figure>
 Chris Hartgerink 
</h3>
<p>For ingestion, we use the ROR API, simply to make it as up to date as possible. We also store organizational information whenever people add it in, so it&rsquo;s more efficient. I think that&rsquo;s been very helpful. Of course we try to keep the queries to a minimum, so it&rsquo;s only when people are searching that ROR gets asked &ldquo;Hey, does this exist?&rdquo; And then we store that information in our own database, and we may periodically double-check whether everything is still up to date.</p>


<h3 id="hahahugoshortcode-s27-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>And do you send those ROR IDs out to any other systems, like DOI registration agencies such as Crossref? Or do they just stay in your system?</p>


<h3 id="hahahugoshortcode-s28-hbhb-chris-hartgerink"><figure class="round-figure"><img src="/img/blog/researchequals/chris_hartgerink.png"
         alt="Avatar of Chris Hartgerink"/>
</figure>
 Chris Hartgerink 
</h3>
<p>Whenever I have an affiliation in my account, if I publish a module on ResearchEquals, it immediately gets added to my DOI metadata at that time. So if I switched my affiliation, job-wise, and I switched it in ResearchEquals, then at that point all the new DOIs would also have the new affiliation in there. All the old ones, of course, would have the old one.</p>
<p>What we don&rsquo;t yet do is do the backlog. If you&rsquo;ve authored with ResearchEquals before, pre-ROR integration, you don&rsquo;t have this info yet. If somebody is reading this and they want that updated, then we can always regenerate the metadata for the DOI.</p>


<h3 id="hahahugoshortcode-s29-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Gotcha. What challenges did you run into as you were integrating ROR into your system in four two-hour blocks?</p>


<h3 id="hahahugoshortcode-s30-hbhb-chris-hartgerink"><figure class="round-figure"><img src="/img/blog/researchequals/chris_hartgerink.png"
         alt="Avatar of Chris Hartgerink"/>
</figure>
 Chris Hartgerink 
</h3>
<p>I think the main thing was simply learning some of the quirks around how to interface with ROR directly. I think that&rsquo;s true for any API integration. I write my documentation differently from the way somebody else does, so you have to sort of learn the language a bit to know where to find things. The main thing that was confusing or was a difficulty was <a href="https://ror.readme.io/docs/identifier">this idea that the URL is the ID</a>. I thought it was going to be just the unique string part.</p>


<h3 id="hahahugoshortcode-s31-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Right.</p>


<h3 id="hahahugoshortcode-s32-hbhb-chris-hartgerink"><figure class="round-figure"><img src="/img/blog/researchequals/chris_hartgerink.png"
         alt="Avatar of Chris Hartgerink"/>
</figure>
 Chris Hartgerink 
</h3>
<p>That was a bit of a mindset shift. But beyond that, I think it was fairly straightforward. And most of the time, I was actually busy trying to fit all the design elements in the right place and make sure that when people click here and there that it works.</p>


<h3 id="hahahugoshortcode-s33-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>The interface design is a big issue, because there&rsquo;s lots of different ways to implement ROR in terms of UI.</p>


<h3 id="hahahugoshortcode-s34-hbhb-chris-hartgerink"><figure class="round-figure"><img src="/img/blog/researchequals/chris_hartgerink.png"
         alt="Avatar of Chris Hartgerink"/>
</figure>
 Chris Hartgerink 
</h3>
<p>Yeah. And actually I think after the first two sessions, so after four hours, the integration was pretty much complete. And the remaining four hours were spent simply, you know, moving stuff one pixel to the right to make sure that it displays properly.</p>


<h3 id="hahahugoshortcode-s35-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>I had actually forgotten this until you mentioned it, but one of the things that I changed after watching your live stream was the part of our documentation about <a href="https://ror.readme.io/docs/identifier">the preferred form of the ROR ID</a>. There&rsquo;s actually <a href="https://github.com/ORCID/ORCID-Source/issues/6520">a lot of discussion on the ORCID GitHub</a> about why our preferred form is the entire URL, and one of the changes I made was to add that reasoning into the documentation. Essentially, our sense is that when the URL is used as the ID in metadata, it&rsquo;s more likely to continue to resolve. And that&rsquo;s the key thing that you want from a persistent identifier.</p>
<p>I also put in the documentation that while we think the entire URL is the best form for the ROR identifier, the ROR API does also recognize just the unique string. A ROR ID is <code>https://ror.org</code> plus nine characters, and if you just use that nine-character string, that will work in the API.</p>
<figure><img src="/img/blog/researchequals/libscie-ror-id-forms.png"
         alt="Both the full URL and the 9-character string forms of the ROR ID are valid in requests to the ROR API"/><figcaption>
            <p>Both the full URL <a href="https://ror.org/0342dzm54">https://ror.org/0342dzm54</a> and the 9-character string <code>0342dzm54</code> forms of the ROR ID are valid in requests to the ROR API. The form <code>ror.org/0342dzm54</code> is also valid.</p>
        </figcaption>
</figure>



<h3 id="hahahugoshortcode-s37-hbhb-chris-hartgerink"><figure class="round-figure"><img src="/img/blog/researchequals/chris_hartgerink.png"
         alt="Avatar of Chris Hartgerink"/>
</figure>
 Chris Hartgerink 
</h3>
<p>I mean, I think it makes sense, because then ultimately, in the code, that means whenever we want to link out, we don&rsquo;t need to add all of that. Yeah, I think in that sense, it&rsquo;s very worthwhile.</p>
<p>In terms of difficulties, I think for me, it&rsquo;s also always that there&rsquo;s just so many systems to integrate with, and they all have their own quirks. If you have a big team, you could just say &ldquo;You&rsquo;re the Integration Specialist for this, and you for this.&rdquo; But we&rsquo;re a very small team, and actually I&rsquo;m the one responsible for these integrations, so I need to remember the quirks of ROR, remember the quirks of ORCID, remember the quirks of Crossref and whatever other integrations we have. That&rsquo;s always a challenge to keep space for all that in my head.</p>
<p>But, you know, if the documentation is clear, that improvement is also going to be helpful to me in the future, because I Google everything. Somebody once told me that the documentation is a reflection of the state of the community. Because if feedback doesn&rsquo;t get taken up into the documentation, then that&rsquo;s an indication that they&rsquo;re not really listening.</p>


<h3 id="hahahugoshortcode-s38-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Oh, you&rsquo;re so right. I think I got up at 5am or something to watch your livestream, and the minute it ended, I thought, &ldquo;Okay, this documentation is changing.&rdquo; I absolutely believe that, that there really needs to be a virtuous circle of getting feedback from community and immediately putting that into the documentation. Whenever we get a support request, I think, &ldquo;If one person has this question, and they&rsquo;ve overcome all the barriers to finding out how to ask it, has gone to the trouble of starting an email or posting in the Slack or whatever, then clearly other people have that question, too.&rdquo; Any time one person has an issue, it&rsquo;s almost certainly the case that there are fifteen other people who have that issue who just left the page and never communicated with you about it. So yes, I completely agree about that. And again, thank you so much for all of the free user testing you gave us.</p>


<h3 id="hahahugoshortcode-s39-hbhb-chris-hartgerink"><figure class="round-figure"><img src="/img/blog/researchequals/chris_hartgerink.png"
         alt="Avatar of Chris Hartgerink"/>
</figure>
 Chris Hartgerink 
</h3>
<p>My pleasure. I was doing it anyway. So yes, that&rsquo;s good. Another benefit of live streaming.</p>


<h3 id="hahahugoshortcode-s40-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yep. So what do you hope ROR does in the future? Do you have any feature requests or suggestions for future direction?</p>


<h3 id="hahahugoshortcode-s41-hbhb-chris-hartgerink"><figure class="round-figure"><img src="/img/blog/researchequals/chris_hartgerink.png"
         alt="Avatar of Chris Hartgerink"/>
</figure>
 Chris Hartgerink 
</h3>
<p>Well, I think for me, one of the things that I find very undervalued and that I&rsquo;ve also come to enjoy a lot is maintenance. So in that sense, I don&rsquo;t have any shiny new feature requests except for stability and reliability. I think that very often when something is maintained, a lot of people feel like it stagnates if there is not anything new. But I very much like the idea of ROR being a very reliable, stable service where I don&rsquo;t have to worry about the integration breaking anytime soon, or if it were to break, that there&rsquo;s sufficient time to handle it. So in that way, my biggest feature request is stability.</p>
<p><strong>A nice thing about ROR is that it is living, breathing – with the continual releases, there&rsquo;s new organizations being added, making it more worthwhile without us doing anything. As the data get richer and the usage gets richer, I think that&rsquo;s really where the added value is.</strong> I&rsquo;ve seen other infrastructure providers try to create too many projects, which sometimes is a disservice to the core infrastructure. So I think that would be the primary thing. If anything, I guess the ease with which people can submit for community curation or participate in assessing requests is something to continuously evaluate.</p>


<h3 id="hahahugoshortcode-s42-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yep. Good feedback. I think one of the issues is that we&rsquo;re specifically looking for people from Africa and Asia at the moment. I think if we recruited only from the US and Europe, we would get lots and lots of curators, but so much of what we need from our community curators right now is regional expertise. We need to recruit from Africa and Asia so that we can understand the organizations in that area. But yes, I agree, maybe we should work on clearer guidelines about that.</p>


<h3 id="hahahugoshortcode-s43-hbhb-chris-hartgerink"><figure class="round-figure"><img src="/img/blog/researchequals/chris_hartgerink.png"
         alt="Avatar of Chris Hartgerink"/>
</figure>
 Chris Hartgerink 
</h3>
<p>And if there&rsquo;s anything we can do, if we can add a link out to say, &ldquo;Hey, are you missing an institution? Go here to add it.&rdquo; Or, you know, if people are interested to curate, that people can see that, always happy to discuss that. But I can imagine that having only European and North American universities, departments, etcetera, included is going to be really what&rsquo;s going to be setting ROR apart.</p>


<h3 id="hahahugoshortcode-s44-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>What else would you like to say about ROR? Or about ResearchEquals? Or about Liberate Science?</p>


<h3 id="hahahugoshortcode-s45-hbhb-chris-hartgerink"><figure class="round-figure"><img src="/img/blog/researchequals/chris_hartgerink.png"
         alt="Avatar of Chris Hartgerink"/>
</figure>
 Chris Hartgerink 
</h3>
<p>I think my main thing is I want to have a big festival when you turn five, because it&rsquo;s a momentous time. You turn five first, and then Liberate Science turns five later next year. So that&rsquo;s gonna be fun. And to whomever is reading this, if you&rsquo;re interested in modular publishing, <a href="https://www.researchequals.com/">check out ResearchEquals</a>. If it&rsquo;s too intimidating, <a href="https://ti.to/libscie">we also do cohort trainings</a> where we get a group of people together to just learn over several weeks, in predetermined hour-long slots, where people can also then you know, take the space and just say, okay, that&rsquo;s when I&rsquo;m going to learn this. And never outside of it.</p>


<h3 id="hahahugoshortcode-s46-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>&ldquo;Time-boxing&rdquo; is the phrase that I&rsquo;ve heard for that. But for me, time-boxing only works when there&rsquo;s an external constraint.</p>


<h3 id="hahahugoshortcode-s47-hbhb-chris-hartgerink"><figure class="round-figure"><img src="/img/blog/researchequals/chris_hartgerink.png"
         alt="Avatar of Chris Hartgerink"/>
</figure>
 Chris Hartgerink 
</h3>
<p>That&rsquo;s the nice thing with live streaming.</p>


<h3 id="hahahugoshortcode-s48-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Exactly! But that also takes a lot of courage. So maybe some time I will emulate your courage in doing that. Anyway, thank you so much for speaking with us!</p>


<h3 id="hahahugoshortcode-s49-hbhb-chris-hartgerink"><figure class="round-figure"><img src="/img/blog/researchequals/chris_hartgerink.png"
         alt="Avatar of Chris Hartgerink"/>
</figure>
 Chris Hartgerink 
</h3>
<p>Enjoy the rest of your day, and then have a nice rest of your week.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>Questions? Want to be featured in a ROR case study? Contact <a href="mailto:support@ror.org">support@ror.org</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Chris Hartgerink, the founder of Liberate Science, discusses why and how they integrated ROR into the modular publishing platform ResearchEquals for author affiliations in user profiles and Crossref DOIs and explains why they live streamed all eight hours of the work.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">ROR as Community-supported Infrastructure</title><id>https://doi.org/10.71938/60z7-yd83</id><link rel="alternate" href="https://ror.org/blog/2023-07-24-community-supported-infra/"/><published>2023-07-24T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="General Updates"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>As we have crossed the mid-point of 2023, we&rsquo;re taking a moment to reflect on what has already been a very busy year for ROR!</p>
<ul>
<li>
<p>ROR usage continues to skyrocket, with the ROR API seeing more than 23 million requests every month.</p>
</li>
<li>
<p><a href="/community/#whos-using-ror">ROR integrations</a> are continuing to grow, and we&rsquo;re showcasing specific implementations in a new <a href="/tags/case-studies/">case study series</a> on the ROR blog.</p>
</li>
<li>
<p>The <a href="/about//impact">number of ROR IDs available in Crossref and DataCite</a> is steadily increasing, which also feeds downstream sources like <a href="https://openalex.org/">OpenAlex</a> and <a href="https://commons.datacite.org/">DataCite Commons</a>.</p>
</li>
<li>
<p>ROR IDs are being recommended in a <a href="https://www.rd-alliance.org/groups/national-pid-strategies-wg">growing set of national PID strategies</a> and in ongoing discussions and recommendations related to the <a href="https://www.whitehouse.gov/wp-content/uploads/2022/08/08-2022-OSTP-Public-Access-Memo.pdf">August 2022 OSTP memo</a>, including in feedback submitted in response to United States federal agencies&rsquo; public access policies (see, for example, the <a href="https://www.arl.org/news/arl-comments-on-nih-plan-to-enhance-public-access-to-results-of-nih-supported-research/">Association for Research Libraries&rsquo; response to the NIH draft policy</a>).</p>
</li>
</ul>
<p>In addition, we&rsquo;ve <a href="https://github.com/ror-community/ror-updates/releases">published 10 new data releases</a> since the beginning of the year in response to community feedback about registry additions and updates, and as part of ongoing metadata quality checks on existing registry records. ROR now includes more than 105,000 IDs and associated metadata records, and new registry updates are available on a rolling basis, approximately 1-2 times per month.</p>
<p>All of this progress is happening - and will continue to happen - because organizations are supporting ROR&rsquo;s growth, guiding its development, and investing in its long-term sustainability.</p>


<h3 id="supporting-ror-as-a-community">Supporting ROR, as a community 
</h3>
<p><a href="/blog/2022-10-10-strengthening-sustainability/">As we&rsquo;ve described previously</a>, ROR&rsquo;s sustainability model is based on a shared resourcing commitment by ROR&rsquo;s three operating organizations - California Digital Library, Crossref, and DataCite - to fund ROR&rsquo;s core operating expenses over the long term. This commitment ensures a baseline level of funding for ROR, enables us to make the service freely available for everyone, and minimizes dependence on unpredictable or time-limited sources of funds.</p>
<p>To supplement this baseline resourcing, ROR continues to receive outside investment by stakeholder organizations. These additional funds allow ROR to accelerate and scale our work and activities, and to invest in strategic time-limited projects. Community funding also showcases the importance of investing in core open infrastructure that all can benefit from regardless of their ability to pay for it.</p>


<h3 id="core-infrastructure-for-other-core-infrastructure">Core infrastructure for other core infrastructure  
</h3>
<p>The latest example of community investment in ROR comes from ORCID. Last week, <a href="https://info.orcid.org/orcid-increases-financial-support-for-ror/">ORCID announced</a> that it would be increasing its financial contribution to ROR to support ROR&rsquo;s continued growth. ORCID is a long-time supporter of ROR, having been involved in the early planning stages that defined the initial vision for ROR, and active in community activities ever since then through our community advisory group and steering group. ROR is now the core organization identifier supported in ORCID and makes it possible for ORCID to normalize affiliation information against an open and interoperable identifier without being reliant on proprietary data. As ORCID&rsquo;s Executive Director Chris Shillum explains, ROR plays a key role in the organization&rsquo;s overall strategy: &ldquo;Having a reliable way to identify organizations helps ORCID users and members in multiple ways, as a growing number of use cases clearly demonstrate. ORCID commends the commitment made by CDL, Crossref and DataCite in 2022 to jointly fund ROR&rsquo;s core operating costs. As a fellow scholarly infrastructure organization, we at ORCID feel it is also our responsibility to contribute to ROR&rsquo;s sustainability on behalf of our community.&rdquo;</p>


<h3 id="supporting-ror-through-global-coalitions">Supporting ROR through global coalitions 
</h3>
<p>ROR has also been receiving investments from organizations, library consortia, and national research offices contributing to sustainable open science infrastructure since being <a href="blog/2022-11-22-scoss-selects-ror/">selected by SCOSS</a> in November 2022. This selection identifies ROR as essential open infrastructure, and as part of <a href="https://scoss.org/4thpledgingroundannouncment/">SCOSS&rsquo;s fundraising campaign</a>, organizations are encouraged to invest in noncommercial scholarly infrastructure initiatives to advance open knowledge. Nine months into the SCOSS campaign, we wanted to acknowledge the organizations that have stepped up to support ROR and highlight some of their reasons for investing in the open infrastructure that ROR provides and enables.</p>
<ul>
<li>
<p><a href="https://www.nwo.nl/">Dutch Research Council</a> (NWO) (Netherlands)</p>
</li>
<li>
<p><a href="https://consortium.ch/">Consortium of Swiss Academic Libraries</a> (CSAL) (Switzerland)</p>
</li>
<li>
<p><a href="https://www.crkn-rcdr.ca/">Canadian Research Knowledge Network</a> (CRKN) (Canada)</p>
</li>
<li>
<p><a href="https://www.caul.edu.au/">Council of Australian University Librarians</a> (CAUL) (Australia)</p>
</li>
<li>
<p><a href="https://www.kb.dk/">Royal Danish Library</a> (Denmark)</p>
</li>
<li>
<p><a href="https://www.slub-dresden.de/">SLUB Dresden</a> (Germany)</p>
</li>
<li>
<p><a href="https://www.ouvrirlascience.fr/">French National Funds for Open Science</a> (FNSO) (France)</p>
</li>
</ul>
<p>For these organizations, ROR represents an opportunity to advance strategic goals around identification and tracking of research. &ldquo;Persistent identifiers are key to a robust scholarly infrastructure that makes the research process more efficient, clear, and meaningful,&rdquo; said Clare Appavoo, CRKN Executive Director. &ldquo;CRKN is a proud supporter of persistent identifiers through the ORCID Canada and DataCite Canada consortia, and we&rsquo;re pleased that ROR is a recipient of our member contributions to SCOSS in this funding round.&rdquo;</p>
<p>Open identifiers are especially important in the changing landscape of research dissemination. Angus Cook from the Council of Australian University Librarians described the importance of supporting infrastructure that supports open access:</p>
<p>&ldquo;As consortiums and libraries expand services through publishing activities, in addition to providing access to content, supporting infrastructure that supports open access becomes increasingly important. One aspect of agreement management that becomes more critical as consortiums transition their agreements to open access, is the correct identification of participating institutions. Correct identification of institutions is a vital part of the workflow in ensuring authors are affiliated with the correct entity. As part of CAUL&rsquo;s support for OA agreements and supporting infrastructure, we are keen to see open and accessible identifiers, such as ROR, being adequately resourced. Identifiers, like many standards, should be openly accessible and freely available.&rdquo;</p>


<h3 id="why-cdl-crossref-and-datacite-are-supporting-ror">Why CDL, Crossref, and DataCite are supporting ROR 
</h3>
<p>ROR is also a key component of each of its operating organization&rsquo;s strategic objectives. ROR was identified by Crossref members as a top priority back in 2019, and ROR IDs play an important role in Crossref&rsquo;s vision of the <a href="https://www.crossref.org/documentation/research-nexus/">research nexus</a> and its aims to make scholarly information easier to find, assess, and connect. For DataCite&rsquo;s <a href="https://datacite.org/documents/DataCite_strategic_plan_2022_2025.pdf">strategic focus</a> on identifying and connecting knowledge, ROR IDs are key to establishing connections in DOI metadata to enable meaningful insights about how research is conducted and used. For CDL, investing in open metadata for affiliations aligns with a <a href="https://osc.universityofcalifornia.edu/2022/08/pathways-to-oa-open-infrastructure/">multi-faceted approach</a> to make UC research - including the underlying metadata - more openly available. For example, identifying UC research to be made openly available requires accurate identification of UC-affiliated authors.</p>


<h3 id="a-commitment-to-open">A commitment to open 
</h3>
<p>As an early and continued <a href="https://ror.org/blog/2020-12-16-aligning-ror-with-posi/">signatory to the Principles of Open Scholarly Infrastructure (POSI)</a>, ROR is committed to following best practices to ensure its long-term openness, availability, and sustainability. ROR&rsquo;s operations are specifically structured to maximize the openness and availability of its data and protect against risks, vulnerabilities, and unpredictable circumstances that can befell other initiatives that don&rsquo;t have similar safeguards in place. This is an especially crucial time to understand the importance of what it means to be open, and we are proud to be in the company of other POSI-aligned infrastructures committed to following similar practices.</p>
<p>We are grateful to all of the organizations that have been supporting ROR in one way or another over the years, and in the years to come.</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">As we have crossed the mid-point of 2023, we&amp;rsquo;re taking a moment to reflect on what has already been a very busy year for ROR!</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Case Study: How Europe PMC Uses ROR</title><id>https://doi.org/10.71938/w0js-9z14</id><link rel="alternate" href="https://ror.org/blog/2023-06-30-europe-pmc-case-study/"/><published>2023-06-30T00:00:00+00:00</published><updated>2026-03-10T08:36:41-04:00</updated><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><category term="Case Studies"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>Michael Parkin, data scientist at EMBL-EBI who helps maintain Europe PMC, explains why and how ROR  helps with assessing funding impact and tracking researcher collaborations.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>
<h2 id="key-quotations">Key quotations 
  <a href="#key-quotations" aria-label="Key quotations">
<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 640 512" class="ha ha-link" width="16" height="16" alt="Key quotations"><path d="M579.8 267.7c56.5-56.5 56.5-148 0-204.5c-50-50-128.8-56.5-186.3-15.4l-1.6 1.1c-14.4 10.3-17.7 30.3-7.4 44.6s30.3 17.7 44.6 7.4l1.6-1.1c32.1-22.9 76-19.3 103.8 8.6c31.5 31.5 31.5 82.5 0 114L422.3 334.8c-31.5 31.5-82.5 31.5-114 0c-27.9-27.9-31.5-71.8-8.6-103.8l1.1-1.6c10.3-14.4 6.9-34.4-7.4-44.6s-34.4-6.9-44.6 7.4l-1.1 1.6C206.5 251.2 213 330 263 380c56.5 56.5 148 56.5 204.5 0L579.8 267.7zM60.2 244.3c-56.5 56.5-56.5 148 0 204.5c50 50 128.8 56.5 186.3 15.4l1.6-1.1c14.4-10.3 17.7-30.3 7.4-44.6s-30.3-17.7-44.6-7.4l-1.6 1.1c-32.1 22.9-76 19.3-103.8-8.6C74 372 74 321 105.5 289.5L217.7 177.2c31.5-31.5 82.5-31.5 114 0c27.9 27.9 31.5 71.8 8.6 103.9l-1.1 1.6c-10.3 14.4-6.9 34.4 7.4 44.6s34.4 6.9 44.6-7.4l1.1-1.6C433.5 260.8 427 182 377 132c-56.5-56.5-148-56.5-204.5 0L60.2 244.3z"/></svg>
  </a>
</h2>
<p>&ldquo;As I mentioned, we can disseminate these ROR IDs, because we&rsquo;ve added them to our own API. When you look at a grant record programmatically, you&rsquo;ll see the ROR ID as part of that API response. And they are added to the Wellcome grant DOI metadata as well, so that&rsquo;s another nice way of making this available to others. I&rsquo;m a data scientist, so I love that. The more data the better. It&rsquo;s great to have that kind of information so that people like me can come along afterwards and do some cool stuff that you wouldn&rsquo;t be able to do otherwise.&rdquo;</p>
<p>&ldquo;For now, on the ROR community calls, it always cheers me up to see that bump in ROR IDs in Crossref metadata: &lsquo;Oh, that&rsquo;s the Wellcome grant DOIs we registered.&rsquo; Although I enjoy that, I really hope that in a couple of years that&rsquo;s a tiny little blip because publishers have put so many ROR IDs in Crossref journal metadata that you can&rsquo;t even see that on the graph. That would be really nice.&rdquo;</p>
<p>&ldquo;The way that you&rsquo;re always inviting the community to provide feedback on things like schema changes, that&rsquo;s excellent, partly because it lets people know what&rsquo;s coming up, and lets you object if you have a strong opinion about that: &lsquo;Oh no, that doesn&rsquo;t work for us, have you considered that this might mess up this kind of system.&rsquo; I just think it&rsquo;s a really nice way of doing things. But it is very hard to do and do well, and I think ROR does it very well.&rdquo;</p>
<p>&ndash; Michael Parkin, Data Scientist, Content, EMBL-EBI</p>
</span></div>
	</div>
</div>


<h3 id="hahahugoshortcode-s1-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Hello! If you could, please start by telling us your name, title, and organization.</p>


<h3 id="hahahugoshortcode-s2-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>My name is Michael Parkin. I&rsquo;m a data scientist at <a href="https://www.ebi.ac.uk/">EMBL-EBI</a>, the European Molecular Biology Laboratory’s European Bioinformatics Institute. I work on the <a href="https://europepmc.org/">Europe PMC</a> team.</p>


<h3 id="hahahugoshortcode-s3-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Can you tell us about the relationship between EMBL-EBI and Europe PMC?</p>


<h3 id="hahahugoshortcode-s4-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>There&rsquo;s basically three levels. The parent organization is EMBL, the European Molecular Biology Laboratory. It&rsquo;s headquartered in Heidelberg, Germany, and consists of six sites around Europe. There&rsquo;s a big research center in Heidelberg, a synchrotron facility in Hamburg, a research institute outside of Rome, another one in Grenoble, and one in Barcelona. The sixth site is EMBL’s European Bioinformatics Institute (EMBL-EBI), based in the UK. All of these sites combine to make EMBL, which is a large, intergovernmental, not-for-profit organization, and Europe’s life sciences laboratory.</p>
<p>A lot of the EMBL organization is doing primary research and wet-lab work. Here in the UK, we are mostly on the data side of things. EMBL-EBI is split into two parts. Half of the organization runs data services and the other half is doing research, so some of my colleagues are bioinformaticians. I work in the services side, where we have many teams that look after life science databases, which are freely-accessible to anyone with an internet connection. For example, you run a research study that generates a lot of genomic data, and you want to put that data somewhere. EMBL-EBI will have a resource for that and an associated team that looks after it. Most of the teams manage these big biodata type resources.</p>
<p>My team is a little bit unusual because we are the literature team; literature is our data point. Our team maintains Europe PMC, which is a life science literature database. So there are a lot of similarities to the other kinds of big data that are stored at EMBL-EBI, but we&rsquo;re a bit different because we&rsquo;re focusing on the literature outputs rather than the data outputs that are generated by research. We are funded by EMBL itself, as well as 37 other funding organizations, mostly in the UK, but also across Europe and more globally.</p>


<h3 id="hahahugoshortcode-s5-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>The volume of scholarly literature today makes it essentially big data.</p>


<h3 id="hahahugoshortcode-s6-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>Yeah, it&rsquo;s gotten increasingly bigger over time. The biggest data source at Europe PMC is <a href="https://pubmed.ncbi.nlm.nih.gov/">PubMed</a>. We have a faithful copy of everything that&rsquo;s in PubMed. That&rsquo;s approaching 40 million abstracts, in the mid-30s at the moment. And really the genesis of Europe PMC was around collaboration and data sharing with the NCBI, the National Center for Biotechnology Information, which hosts PubMed and <a href="https://www.ncbi.nlm.nih.gov/pmc/">PubMed Central</a>, a database for full-text copies of journal articles. I think there&rsquo;s over 7 million of those. So EMBL-EBI is really well positioned. We have a lot of computer power and data storage, either on site here outside of Cambridge or within the UK. And the volume is significant. So it does fit well within the scope of these databases, and one of the nice things, of course, is they&rsquo;re all public access. It&rsquo;s just kind of really a nice thing. The idea is that it&rsquo;s for the research community. And so at the core, this is a public good.</p>


<h3 id="hahahugoshortcode-s7-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Wonderful. When and where did you personally first hear about ROR?</p>


<h3 id="hahahugoshortcode-s8-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>I started at EMBL-EBI in 2016, and a couple of years into my time here I was getting interested in PIDs, and there was the opportunity to go to the PIDapalooza Conference. I went to the one in Girona in 2018, and I remember there was an open session, and I think it was someone from Crossref who gave a talk to say &ldquo;watch this space&rdquo; in terms of organizational identifiers. I don&rsquo;t think the name &ldquo;ROR&rdquo; had been chosen at that point. There were actually many presentations at PIDapalooza 2018 about organization identifiers, whether from a librarian’s perspective or publisher’s. But I think there was just a few minutes’ long talk to say that Crossref and ORCID were working on organization identifiers.</p>


<h3 id="hahahugoshortcode-s9-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>You are in fact correct! I was not there, but having studied the <a href="/about/history">history of ROR</a>, I know a great deal of the technical work that did become ROR shortly thereafter was going on then. ROR&rsquo;s official launch date is January 1st, 2019, and it was the Crossref technical team that really did a lot of that work to stand that up. So I&rsquo;m perfectly certain that in 2018 they were working on it, and I do think the ROR name was chosen in late 2018 or something like that.</p>


<h3 id="hahahugoshortcode-s10-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>I feel like I would have remembered some lions!</p>


<h3 id="hahahugoshortcode-s11-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>You&rsquo;re talking about the pre-lion period!</p>


<h3 id="hahahugoshortcode-s12-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>There were a lot of talks at the conference about the use of organizational IDs, and the idea of something coming from organizations like Crossref and ORCID, who we know a lot about and often collaborate with, was a really cool thing. I was kind of interested to see where it went, so I guess that that&rsquo;s where I first heard about what became ROR. Perhaps that was the first announcement to the public.</p>


<h3 id="hahahugoshortcode-s13-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Well, there were working groups from 2016 to 2018, really about three years of discussions and working groups and plans to try to stand it up, and then the official launch in 2019. And now here we are! Who were the primary advocates of implementing ROR at Europe PMC?</p>


<h3 id="hahahugoshortcode-s14-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>There are two main angles to this. As I mentioned, Europe PMC is funded by funders of life science research, and funders typically have a strong desire to track the outputs of their funding. They want to see the impact their funding has. And one aspect of that is to look at journal publications. That&rsquo;s often the way scientists communicate their results, and those results are an outcome of the funding that they were awarded by a particular grant. Part of the puzzle of that is, &ldquo;Well, which institutions does this funding go to?&rdquo; That&rsquo;s a key thing to keep track of, and it gets very, very difficult to track when affiliations in publications are captured just as strings of text; it&rsquo;s very messy. So there&rsquo;s a desire from our funders that Europe PMC has some way of disambiguating institution names as a piece of the impact assessment puzzle. That&rsquo;s one of the reasons we want to be able to do closer inspection of institutions via IDs.</p>
<p>The other side is that EMBL, the organization that we&rsquo;re a part of, produces a lot of its own research, and EMBL wants to know where EMBL-associated researchers are publishing, and especially, who are they collaborating with? That&rsquo;s a really key thing for us. It&rsquo;s, &ldquo;Where are the networks between EMBL researchers and other institutions around the world?&rdquo; And again, one way that gets measured is by looking at journal articles, finding which ones have been published by groups in our organization and the other institutions they worked with. Again, really hard to do when these affiliations are just strings of text. So there&rsquo;s also a desire from our organizational side, to be able to have better information about which institutions are which, and making networks and cross-connections between the two. We do a lot of that with ORCID, and we want to do the same with ROR and institutions. Those are the two key drivers, one side for the funders for research impact assessment, and one from our own organization, knowing who&rsquo;s doing the research and where the collaborations are. Those have been the main use cases for us.</p>


<h3 id="hahahugoshortcode-s15-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Wonderful. I&rsquo;m going to ask a follow-up question, because I have heard from others that they are particularly interested in this collaboration question, finding out who their researchers are collaborating with. Why is that an important thing that people want to know? What are you going to do with that information once you can find it out?</p>


<h3 id="hahahugoshortcode-s16-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>I don&rsquo;t know too much about the details, because our team generates the data for others at EMBL to digest, and EMBL uses this information in a number of ways. But essentially, it showcases the collaborative nature of our work to our funders. EMBL helps to create links and connections between different research areas in the life sciences, so being able to show the number, scale and scope of our collaborations is one way of demonstrating our impact. I think that&rsquo;s where EMBL sees this collaboration information as being valuable.</p>


<h3 id="hahahugoshortcode-s17-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>That&rsquo;s helpful, actually. It&rsquo;s something that I&rsquo;ve heard some people who are implementing ROR say, that that&rsquo;s part of why they&rsquo;ve implemented ROR, is to track research collaborations. Universities want to track their own institution&rsquo;s output, which is surprisingly difficult, but they also want to know who their researchers are collaborating with. So I think it&rsquo;s similar to what you&rsquo;re talking about: Are our researchers in this American institution collaborating only with European researchers? Or are they, you know, really taking a global view and collaborating as much as they could be or should be with researchers in Asia or researchers in South America. My sense is that that&rsquo;s why they want to know that.</p>


<h3 id="hahahugoshortcode-s18-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>I expect these collaborations are usually not something individual researchers worry about too much. But at the higher strategic level this becomes an important question to address. Maybe eventually it does feed down to the researchers, and it will steer the collaborations: &ldquo;Well, you know, we&rsquo;re not really doing a lot of work with, say, South America, maybe some project around there or some collaborative network would be good.&rdquo; Publications being just one thing – it helps you see whether that collaboration is fruitful or not.</p>


<h3 id="hahahugoshortcode-s19-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Describe for us what you&rsquo;ve done with ROR at Europe PMC. I think you&rsquo;ve done several things, actually.</p>


<h3 id="hahahugoshortcode-s20-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>I&rsquo;ll talk more about the more recent one. But the initial work we did, which is already <a href="/community/#adopters">listed on the ROR website</a>, was part of a European Commission-funded project called <a href="https://www.project-freya.eu/mission.html">FREYA</a>, which was a successor to the <a href="https://cordis.europa.eu/project/id/654039">THOR project</a>. They’re committed to Norse mythology! The FREYA project was all about PIDs and PID interconnectivity, and as part of that we were looking at publications. It&rsquo;s important to mention that in Europe PMC we have our publications world, which is most of what we do, and we also have the grants world, which is exclusively for the Europe PMC funders. I&rsquo;ll just make that distinction now, because I&rsquo;ll talk more about the grants side of things.</p>
<p>But the initial work we did with ROR was on the publication side of things, and it was exactly this idea that our organization, EMBL, wanted to have publication data about EMBL sites, how many papers they were publishing and which organizations they were collaborating with. We had a fantastic intern on the team at the time who built <a href="https://gitlab.ebi.ac.uk/literature-services/public-projects/ROR-proto-EMBL">a machine learning model that would look at the affiliations in publications</a> and pick out ones that had an EMBL site affiliation on them. As I mentioned, there&rsquo;s a few of them around Europe, the sites <a href="https://ror.org/01yr73893">outside of Rome</a>, in <a href="https://ror.org/01zjc6908">Grenoble</a>, in <a href="https://ror.org/03mstc592">Heidelberg</a>, in <a href="https://ror.org/050589e39">Hamburg</a>, and luckily there&rsquo;s a ROR ID for each of those, which is fortunate, because I can assign papers to individual institution locations.</p>


<h3 id="hahahugoshortcode-s21-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>And if there weren&rsquo;t a ROR ID, you could request one.</p>


<h3 id="hahahugoshortcode-s22-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>That was one of our initial questions, and I think it&rsquo;s <a href="/about/faqs/#does-ror-support-relationships-and-hierarchies">a question you get a lot</a>, &ldquo;What is the hierarchical structure?&rdquo; Because you can go down to the department level of a university, say, and get really granular. For us in EMBL, each site has its own ROR ID, which is very beneficial, but they&rsquo;re also written fairly similarly. They all have the words &ldquo;European Molecular Biology Laboratory&rdquo; in the name, and then you&rsquo;re looking for basically which city it&rsquo;s in. So that was the specific project: building this <a href="https://gitlab.ebi.ac.uk/literature-services/public-projects/ROR-proto-EMBL">machine learning model that can look at affiliations in articles and pull out the ones that are from our wider organization</a>, and then we can track things like collaborations with other countries. That was a key part of what the strategic team here wanted to do. That wasn&rsquo;t my work at all, and I don’t take any credit.</p>
<p>The project I&rsquo;ve been working on more recently was in our grants world. Moving away from the publications, where there are forty-something million records, to the grants world, where it&rsquo;s much smaller: there are about 70 to 80 thousand grants. It&rsquo;s a lot more manageable. And so what we&rsquo;ve been doing is attaching ROR IDs to the institutions that were funded by grants from the Europe PMC funders. Our grant data looks not too dissimilar from a journal article, really. It&rsquo;s got a title. It&rsquo;s got an abstract. It&rsquo;s got a person associated with it, but instead of being an author, we call them PIs, principal investigators, and they are attached to an affiliation of some type. Usually it&rsquo;s the university, but it can be companies, or it can be charities. I recently saw one that was a cathedral. It&rsquo;s a real spread of places these PIs can be at.</p>


<h3 id="hahahugoshortcode-s23-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yes! I happened to see the <a href="https://ror.org/01m3vxn57">ROR ID for York Minster</a> the other day, which is the enormous cathedral in York.</p>


<h3 id="hahahugoshortcode-s24-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>That&rsquo;s a lovely cathedral, actually.</p>
<figure><img src="/img/blog/europepmc/York_Minster_Cathedral.jpg"
         alt="York Minster, North Yorkshire - from the City Walls, 2013, JackPeasePhotography, CC BY 2.0, via Wikimedia Commons"/><figcaption>
            <p>York Minster, North Yorkshire - from the City Walls, 2013, JackPeasePhotography, CC BY 2.0, via Wikimedia Commons.</p>
        </figcaption>
</figure>

<p>So there&rsquo;s a real spread of organizations, and the data comes to us through spreadsheets. Some of our funders are relatively small organizations, perhaps with a small number of staff, and anything overly technical is not going to be fit for purpose. We have a spreadsheet where they can fill out all the grant details, and one of the columns of the spreadsheet is the organization that the PI is at, and over the last decade and a half, there&rsquo;s been a lot of variation in the way that those affiliations get listed. We&rsquo;ve had the University of Cambridge spelled in about eight different ways, with those including lowercase, uppercase, commas in different places. On our system, if you type &ldquo;Cambridge,&rdquo; into the auto-complete menu, you get six or seven variations of &ldquo;Cambridge University,&rdquo; so &ldquo;the University of Cambridge,&rdquo; &ldquo;Cambridge, comma, University.&rdquo; It&rsquo;s terrible.</p>
<p>So what we were looking for is a way to normalize those. This is probably a very classic use case of ROR here. We&rsquo;ve wanted to do this for a very long time, actually. This is not a new problem. It gets increasingly worse as time goes by and funders find new ways to write different names for universities and such. We wanted to combine these all, and attaching these affiliations to a ROR ID is a really, really neat way of doing it. On the one hand we get a PID, which is wonderful, something we can share with others and disseminate. But also you have a very well curated registry, which happens to provide a suitable single affiliation name. I think it&rsquo;s called the primary name in ROR, but we call it internally the “official” name. And, as you know, the ROR registry gets updated. I imagine if some university decides &ldquo;Actually, I prefer if you use the Spanish spelling of our name,&rdquo; that would be updated, and we&rsquo;d incorporate that into our system.</p>
<figure><img src="/img/blog/europepmc/europe-pmc-grant-finder-ror.png"
         alt="Screenshot of ROR affiliation picker in Europe PMC grant form."/><figcaption>
            <p>Selecting an affiliation based on ROR integration in Europe PMC Grant Finder.</p>
        </figcaption>
</figure>

<p>What we&rsquo;ve been working on is taking our very, very long list of institution names in Europe PMC grant data, many of which are copies of the same institute, just spelled differently, and matching them to ROR IDs. We use the ROR API to do that. We have our machine learning model for publications, but the grant data is a little different. It&rsquo;s a little bit more straightforward, actually, because these institution text strings tend to just be the institution name, whereas in publications you get postcodes, countries, cities&hellip; Sometimes you get two affiliations merged together in the same bit of text. Luckily for us, with Europe PMC grant data, it&rsquo;s usually just something as simple as &ldquo;the University of Cambridge&rdquo;.</p>
<p>So we basically took all institution names, ran them through the ROR API, and had a threshold for matching. The ROR API, beautifully, <a href="https://ror.readme.io/docs/api-affiliation">gives you a score</a>. It tells you how it matched, what kind of similarity was used. We came up with a threshold where &ldquo;Anything below this, we&rsquo;re just going to leave this.&rdquo; We never had the intention that we&rsquo;d get 100% coverage, because although you do have York Minster, for example, in the ROR registry, we didn&rsquo;t expect everything to be there. I was actually very surprised at how many companies appeared in the ROR registry. I wasn&rsquo;t expecting that, but there are many companies that do research and receive external funding.</p>


<h3 id="hahahugoshortcode-s27-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>I was surprised by that as well when I first came to ROR. I was like, &ldquo;Wow, there are a lot of companies,&rdquo; you know, and I&rsquo;d see one and say, &ldquo;I&rsquo;ve never heard of this company,&rdquo; and then I&rsquo;d Google them and discover &ldquo;Oh, yes, they do actually do quite a bit of research.&rdquo; Especially pharmaceutical research, biomedical research. There are lots of small companies, as well as large companies.</p>


<h3 id="hahahugoshortcode-s28-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>We knew there&rsquo;d be some things that we would never expect to be in ROR, so we never went in with an idea that we need to make sure that every single institution we have has a ROR ID associated with it. That just seemed wildly optimistic. And so as we went through this process, we did check everything manually. That was kind of inevitable, that we&rsquo;d need to do some sort of manual step.</p>


<h3 id="hahahugoshortcode-s29-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>And what kind of volume are we talking about here?</p>


<h3 id="hahahugoshortcode-s30-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>I think there&rsquo;s about 4,000 ROR IDs that we added, so that&rsquo;s quite a bit. A reasonable amount of checking. I did it over several days to not burn my brain out. But we prioritized institutions that came up a lot in our grants data, and I think I&rsquo;m right that the top institution was the University of Oxford. That was the most frequently occurring institution I think that was linked to several hundred grants, something like four or five hundred grants just from the University of Oxford, so actually a reasonable percentage of the total. We went from the top down, looking at how many grants are associated with each institution and prioritizing assigning a ROR ID to the top ones, because there are an awful lot of institutions that just appear once. We tried to get as much coverage as we could for as little time investment as possible. But over time we will revisit this kind of exercise. This isn&rsquo;t an automated process. It was a kind of one-off task. We&rsquo;ll revisit it, and try to bolster our ROR coverage.</p>
<p>But what we&rsquo;re hopeful for, and I&rsquo;m even more hopeful on the publication side of things, is to get funders to provide us with the ROR IDs. One funder does, which is wonderful, actually, that they go to the trouble on their end of figuring out the ROR IDs for these institutions that they&rsquo;re funding, and then they pass those to us. But we don&rsquo;t make that a requirement, because as I mentioned, some of these funders are small, and that&rsquo;s a large burden. We really want their grant data, so we don&rsquo;t require the ROR ID. In fact, we don&rsquo;t mandate a lot of grant metadata, to try and make it as simple as possible.</p>


<h3 id="hahahugoshortcode-s31-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>We recommend that people don&rsquo;t require ROR IDs actually, because sometimes there are organizations which are entirely legitimate for your use case, but are out of scope for ROR, or there are organizations that are in scope for ROR, but happen not to be in it yet. And while we do have a quite robust update mechanism right now, it still can be weeks before something gets in which is not always congruent with people&rsquo;s timelines for what they need to do.</p>


<h3 id="hahahugoshortcode-s32-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>We ask for ORCIDs as well. They are relatively well used, I think. At least one of our funders mandates ORCIDs for their funded researchers.</p>


<h3 id="hahahugoshortcode-s33-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>It makes more sense to me to mandate ORCIDs, because it&rsquo;s really not that difficult for an individual to just go sign up for one, whereas because ROR is curated, you&rsquo;re reliant on somebody else to create that for you.</p>


<h3 id="hahahugoshortcode-s34-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>Yes. It would be nice if all the funders were providing this data to us at the point the grant enters our system. Then we&rsquo;d never have to do this process. We&rsquo;d just take those ROR IDs and add them to the system. But that&rsquo;s not practical, so this was kind of a jump start. We went from around 10% of institutions in our dataset having a ROR ID, to about 80%. That’s a nice boost.</p>
<p><strong>And as I mentioned, we can disseminate these ROR IDs, because we&rsquo;ve added them to our own API. When you look at a grant record programmatically, you&rsquo;ll see the ROR ID as part of that API response. And as you&rsquo;ve mentioned in your talks, they go into the Wellcome grant DOI metadata as well, so that&rsquo;s another nice way of making this available to others. I&rsquo;m a data scientist, so I love that. The more data the better. It&rsquo;s great to have that kind of information so that people like me can come along afterwards and do some cool stuff that you wouldn&rsquo;t be able to do otherwise, or that would certainly be a lot more arduous when you have to sift through eight different variations of &ldquo;the University of Cambridge&rdquo; and somehow figure out that those are all the same.</strong></p>


<h3 id="hahahugoshortcode-s35-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>And do that for 4,000 institutions.</p>


<h3 id="hahahugoshortcode-s36-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>It&rsquo;s a punishing task for everyone to do that themselves. It makes much more sense for an organization to take responsibility for doing that, and then you can share those results with everyone. So that&rsquo;s the grants integration. That&rsquo;s quite different from the publications one, but the latter is ultimately where we want to be, because that&rsquo;s the main data at Europe PMC – journal articles and preprints and other associated literature. I&rsquo;m just really hoping that although maybe publishers don&rsquo;t benefit hugely from putting ROR IDs in their journal workflows, it is extremely useful for everyone else who comes afterwards, including our team, but also for funders, for instance, it would be immensely useful. I know there are a few publishers already doing this. I think eLife and Hindawi at least.</p>


<h3 id="hahahugoshortcode-s37-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yep.</p>


<h3 id="hahahugoshortcode-s38-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>Europe PMC gets a lot of full-text XML files from PubMed Central. I&rsquo;d love to see ROR IDs captured in those full-text documents, because then we can reliably start pulling them out. Both of these sorts of initiatives I&rsquo;ve described are one-off tasks in which we were manually adding a ROR ID to something that already existed. I&rsquo;d really love to see these pipelines automatically pulling these PIDs in. But it relies on the publisher or the funder to do the work. Because it is work, adding these ROR IDs to your system.</p>


<h3 id="hahahugoshortcode-s39-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Absolutely. And I think there&rsquo;s a lot of momentum in that regard. I think if we talk a year from now, there will be a much greater percentage of those publications that have ROR in their data. We know of a lot of service providers who are adding ROR and publishers who are adding ROR. And we would love to see PubMed&rsquo;s systems officially support ROR.</p>


<h3 id="hahahugoshortcode-s40-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>PubMed has supported ORCIDs for quite a few years now. <strong>For now, on the ROR community calls, it always cheers me up to see that bump in ROR IDs in Crossref metadata: &ldquo;Oh, that&rsquo;s the Wellcome grant DOIs we registered.&rdquo; Although I enjoy that, I really hope that in a couple of years that&rsquo;s a tiny little blip because publishers have put so many ROR IDs in Crossref journal metadata that you can&rsquo;t even see that on the graph. That would be really nice. I&rsquo;ll be a little bit sad, because once upon a time, you know, Europe PMC had a reasonable contribution to adding ROR IDs to Crossref. But I really do hope in time that you can&rsquo;t even see it.</strong></p>


<h3 id="hahahugoshortcode-s41-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yes, in early August 2022, Europe PMC made a very large spike in the DOIs with ROR IDs in Crossref.</p>
<div class='centered'><figure><img src="/img/blog/europepmc/crossref-ror-ids-june-2023.png"
         alt="Number of Crossref records with ROR IDs over time."/><figcaption>
            <p>Number of Crossref records with ROR IDs over time. See the full <a href="https://docs.google.com/spreadsheets/d/1RKc4HM2K3mXvlMuQt-pR4xXgQiOC2B8rMuaddBqEaMg/edit?usp=sharing">spreadsheet of ROR DOI statistics</a> for API queries and additional data.</p>
        </figcaption>
</figure>
</div>


<h3 id="hahahugoshortcode-s43-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>Yes, and it&rsquo;s basically as a direct result of this kind of matching process that we did, so for now it&rsquo;s substantial. I really hope that publishers will embrace this and start putting ROR IDs in Crossref. And again, then people like me can do some really cool stuff.</p>


<h3 id="hahahugoshortcode-s44-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>What do you hope ROR does in the future? What problems should we fix? What opportunities should we pursue?</p>


<h3 id="hahahugoshortcode-s45-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>Phew. For the problem it fixes, as it is now it&rsquo;s a very good system.</p>


<h3 id="hahahugoshortcode-s46-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Bug fixes? Feature requests? Suggestions?</p>


<h3 id="hahahugoshortcode-s47-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>I mean, the only issues that I had, have kind of been resolved already. They just arrived a little too late for us. The way we get ROR information into our system is to use those data dumps in Zenodo. What we do is we download the JSON file and we parse it out. We&rsquo;re actually only interested in a very small bit of information: all we want is the ROR ID and what the ROR registry considers the primary name. There&rsquo;s a lot of stuff in ROR that&rsquo;s really interesting. In the future I&rsquo;d love to do more with the data that&rsquo;s in there. But for now our use case is fairly limited. We would ideally not have to work with the full JSON dump, but now I think there&rsquo;s a new CSV copy which we&rsquo;ll probably switch to because we just need to take two columns. With JSON, we use this command line tool called &ldquo;jq,&rdquo; and it&rsquo;s a little bit more involved. But unfortunately the CSV file didn&rsquo;t exist when we were starting all of this.</p>


<h3 id="hahahugoshortcode-s48-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>We just <a href="https://ror.readme.io/changelog/2023-03-16-csv-data-dump">released that in March</a>.</p>


<h3 id="hahahugoshortcode-s49-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>So that was not a problem, but that was a minor inconvenience that you&rsquo;ve now sorted. And then the other thing would be around improvements to the <a href="https://ror.readme.io/docs/api-affiliation">matching API</a>, but I think that has also <a href="https://ror.readme.io/changelog/2022-10-24-affiliation-matching-improvements-api-only">been done</a>. At the time the documentation was not so comprehensive. Now I think it is. It&rsquo;s very clear <a href="https://ror.readme.io/docs/matching">what to do and the output you&rsquo;re going to get</a>. At the time I was doing this a year and a half ago it was a bit less clear, but that&rsquo;s been improved, too. So in terms of the user experience, it&rsquo;s very good.</p>
<p>Longer term&hellip; I mean, ORCID is interesting because you have the PID, but you also have the ORCID record, and I can claim various things to it. The ROR registry does not have the claiming aspect. I wondered if that was a desirable thing or not, because in my head, ORCID and ROR solve the same problem for a different type of thing. But the ORCID organisation has all this flexibility around claiming various bits and pieces to your ORCID record. That proves to be really useful when you want to make connections between things, and we use ORCID as a source of figuring out which of EMBL researchers have published, what grants do they hold, that kind of thing. We actually use their ORCID record to create staff profiles, so you get the little headshot and the list of research publications is just pulled out of the ORCID record for that person. That doesn&rsquo;t exist for the institution level. The institution doesn&rsquo;t have the ability to add details to that ROR record. I suspect that is just not in the genetics of ROR. But I wondered if that&rsquo;s something that there&rsquo;s interest in. There&rsquo;s a whole world of complexity, though.</p>


<h3 id="hahahugoshortcode-s50-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>It&rsquo;s certainly not something that&rsquo;s on our roadmap now. But what&rsquo;s funny is that we do get a lot of people sort of assuming that that&rsquo;s the way ROR works, that we are some kind of membership organization or that as an organization you need to register with us in order to update your record.</p>


<h3 id="hahahugoshortcode-s51-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>That you need an account, otherwise it won&rsquo;t appear.</p>


<h3 id="hahahugoshortcode-s52-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Right, and none of that is true. Anyone can submit a form and request a change to any record. I&rsquo;ve certainly done that with organizations I <em>used</em> to be affiliated with, and I said, &ldquo;Oh, you know, this information should include X, Y, and Z.&rdquo; But we do actually give a bit of preference to people who are from a particular organization. It&rsquo;s fairly informal: when we turn things into GitHub issues, we tag it, if it&rsquo;s a person <em>from</em> that organization who wants to make a change to that organization record. And of course that&rsquo;s very common. It is usually people&rsquo;s own organizations that they want to update.</p>
<p>With 105,000 records and more, we would have to have a lot more manpower, I think, to manage 105,000 accounts. But it is interesting. I sometimes wonder what we&rsquo;d do if we had an edit war between requesters over information in a record. As far as I know, we&rsquo;ve never had that. We have a curation team that evaluates everything on the merits. Is this a good change? Is this accurate? And we get good advice from others, as well, and the curation team makes the ultimate decisions.</p>
<p>But sometimes I wonder what we&rsquo;d do if there were competing authorities suggesting what the official name of an organization should be, for instance. We are doing a lot of that, you know. You mentioned as an example earlier changing the primary name in the <code>name</code> element in a ROR record to the name in another language. And we&rsquo;ve been doing a lot of that, because in the data we inherited, most of those names were in English, and so we get a lot of &ldquo;Hey, we are a Spanish university, can we have our primary name in Spanish?&rdquo; And we&rsquo;re happy to do that.</p>


<h3 id="hahahugoshortcode-s53-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>I&rsquo;d say in general I don&rsquo;t have any big feature requests in terms of new things to do. I&rsquo;m a bit at a loss, except to keep doing curation. There must be a lot of work behind the scenes that probably no one sees. But yes, with over 100,000 records that must be a lot of work. It&rsquo;s very important, I think, that people trust that you&rsquo;re a reliable source of information. I think that with a lot of automated systems, there&rsquo;s a lot of drift towards things not being quite right or introducing errors. Having a team of humans that can tackle these requests is a really valuable thing. But it&rsquo;s a lot of time, and it may be something to consider as the usage gets higher and higher, especially if big publishers start making use of ROR, whether that&rsquo;s going to lead to an increased amount of curation required. And how do you cope with the increase? Because you can automate processes, but then I think you lose a bit of accuracy in the process, or even just a little less of a handle on what exactly is going on.</p>


<h3 id="hahahugoshortcode-s54-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Right.</p>


<h3 id="hahahugoshortcode-s55-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>And the other thing would be the documentation, and <strong>the way that you&rsquo;re always inviting the community to provide feedback on things like schema changes, that&rsquo;s excellent, partly because it lets people know what&rsquo;s coming up, and lets you object if you have a strong opinion about that: &ldquo;Oh no, that doesn&rsquo;t work for us, have you considered that this might mess up this kind of system.&rdquo; I just think it&rsquo;s a really nice way of doing things. But it is very hard to do and do well, and I think ROR does it very well.</strong></p>


<h3 id="hahahugoshortcode-s56-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>That&rsquo;s wonderful to hear. We rely on it, and it really is enormously helpful if people can even just skim something and bring us their expertise.</p>


<h3 id="hahahugoshortcode-s57-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>Yeah, there are a lot of experts out there who might just think of something you haven&rsquo;t thought of before.</p>


<h3 id="hahahugoshortcode-s58-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>We find that with every <a href="https://ror.readme.io/docs/feedback-docs">feedback round</a>. &ldquo;Oh, we thought we had a good proposal. But oh, here are the things that we didn&rsquo;t think about.&rdquo;</p>


<h3 id="hahahugoshortcode-s59-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>And people can ignore it if they&rsquo;re not interested.</p>


<h3 id="hahahugoshortcode-s60-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>It&rsquo;s interesting what you said about curation, because we are absolutely seeing that. The rate of curation requests has been going up and up and up. We really have only been doing this kind of curation for about a year and a half, actually, because before that <a href="https://ror.readme.io/docs/grid">ROR was just synced with GRID</a>. So yeah, we are right now looking at ways to maintain the pace of that curation in the face of an increased number of requests. And I am also always trying to make that a little bit more visible, to show <a href="https://github.com/ror-community/ror-updates/issues">all of the requests that we do get</a> and all of the curation work that does go on. Because it is open and transparent, but it&rsquo;s on GitHub, where not everybody goes. It&rsquo;s trying to really show people where that is and what&rsquo;s happening.</p>


<h3 id="hahahugoshortcode-s61-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>But yeah, the volume of work increasing does pose an issue. Scaling has some challenges.</p>


<h3 id="hahahugoshortcode-s62-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yes, and we are figuring out what to do about that.</p>


<h3 id="hahahugoshortcode-s63-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>I take it that ROR is completely diverged from GRID at this point, so you&rsquo;re entirely responsible for the data, and there are no updates from GRID to feed into ROR? At this point it&rsquo;s always a request directly to ROR?</p>


<h3 id="hahahugoshortcode-s64-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yes, that&rsquo;s correct.</p>


<h3 id="hahahugoshortcode-s65-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>And presumably no plans to remain in sync?</p>


<h3 id="hahahugoshortcode-s66-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Well, no, none at all. GRID no longer has public, resolvable IDs. The IDs do exist, but they are using them really only internally in Digital Science products like Dimensions. There are no public updates to GRID at all anymore.</p>


<h3 id="hahahugoshortcode-s67-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>I see.</p>


<h3 id="hahahugoshortcode-s68-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Well, that&rsquo;s about it from me. Anything else you want to say?</p>


<h3 id="hahahugoshortcode-s69-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>No, I&rsquo;ll let you go.</p>


<h3 id="hahahugoshortcode-s70-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Thank you so much for you being such a big supporter of ROR in general, and for giving us feedback and coming to our events.</p>


<h3 id="hahahugoshortcode-s71-hbhb-michael-parkin"><figure class="round-figure"><img src="/img/blog/europepmc/michael-parkin.jpg"
         alt="Avatar of Michael Parkin"/>
</figure>
 Michael Parkin 
</h3>
<p>My pleasure. Thanks for your time.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span><p>See also the <a href="https://www.eurekalert.org/news-releases/984137">press release</a> about Europe PMC&rsquo;s incorporation of ROR into its Grant Finder. Contact <a href="mailto:support@ror.org">support@ror.org</a> with any questions.</p>
<p>Questions? Want to be featured in a ROR case study? Contact <a href="mailto:support@ror.org">support@ror.org</a>.</p>
</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Michael Parkin, data scientist at EMBL-EBI who helps maintain Europe PMC, explains why and how ROR helps with assessing funding impact and tracking researcher collaborations.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Case Study: ROR in DataSalon's MasterVision and PaperStack</title><id>https://doi.org/10.71938/nwjn-4m47</id><link rel="alternate" href="https://ror.org/blog/2023-05-10-datasalon-case-study/"/><published>2023-05-10T00:00:00+00:00</published><updated>2025-11-07T11:35:16-05:00</updated><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><category term="Case Studies"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>In January of 2022, DataSalon <a href="https://www.datasalon.com/web/downloads/press/PR_ROR.pdf">announced a full integration of ROR</a> into its scholarly publishing analytics products <a href="https://www.datasalon.com/web/features/dashboards.htm">MasterVision</a> and <a href="https://www.datasalon.com/web/features/paperstack.htm">PaperStack</a>. DataSalon&rsquo;s Client Services Director Andy Dobson sat down with us to tell us all about how they draw on fifteen years of data experience to match organizational affiliations in publisher data to ROR IDs.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>
<h2 id="key-quotations">Key quotations 
  <a href="#key-quotations" aria-label="Key quotations">
<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 640 512" class="ha ha-link" width="16" height="16" alt="Key quotations"><path d="M579.8 267.7c56.5-56.5 56.5-148 0-204.5c-50-50-128.8-56.5-186.3-15.4l-1.6 1.1c-14.4 10.3-17.7 30.3-7.4 44.6s30.3 17.7 44.6 7.4l1.6-1.1c32.1-22.9 76-19.3 103.8 8.6c31.5 31.5 31.5 82.5 0 114L422.3 334.8c-31.5 31.5-82.5 31.5-114 0c-27.9-27.9-31.5-71.8-8.6-103.8l1.1-1.6c10.3-14.4 6.9-34.4-7.4-44.6s-34.4-6.9-44.6 7.4l-1.1 1.6C206.5 251.2 213 330 263 380c56.5 56.5 148 56.5 204.5 0L579.8 267.7zM60.2 244.3c-56.5 56.5-56.5 148 0 204.5c50 50 128.8 56.5 186.3 15.4l1.6-1.1c14.4-10.3 17.7-30.3 7.4-44.6s-30.3-17.7-44.6-7.4l-1.6 1.1c-32.1 22.9-76 19.3-103.8-8.6C74 372 74 321 105.5 289.5L217.7 177.2c31.5-31.5 82.5-31.5 114 0c27.9 27.9 31.5 71.8 8.6 103.9l-1.1 1.6c-10.3 14.4-6.9 34.4 7.4 44.6s34.4 6.9 44.6-7.4l1.1-1.6C433.5 260.8 427 182 377 132c-56.5-56.5-148-56.5-204.5 0L60.2 244.3z"/></svg>
  </a>
</h2>
<p>&ldquo;We&rsquo;re really interested in data standardization and cleansing: that&rsquo;s central to what we do, so we&rsquo;re all very aware of the importance of having a free and open standard reference dataset for identifying organizations and for supporting some of the other datasets that get loaded into our products. Open source data is key to our ethos.&rdquo;</p>
<p>&ldquo;And if you&rsquo;re moving into OA, and you&rsquo;re not understanding a standardized picture of your customer hierarchy, and you don&rsquo;t know how many articles are coming out of the University of Oxford and how much they should be paying in article fees, then that becomes really tricky. There are potentially significant financial implications, on the customer service side, of not having a dataset that allows you to clean up and standardize all of those customer pictures within your data.&rdquo;</p>
<p>&ldquo;ROR does open up new possibilities for us, such as integrating some of the other systems that are around at the moment. We&rsquo;re currently looking at bringing OpenAlex IDs into PaperStack. That should be really straightforward, because you can use the ROR IDs to retrieve the OpenAlex IDs through their API.&rdquo;</p>
<p>&ndash; Andy Dobson, Client Services Director, DataSalon</p>
</span></div>
	</div>
</div>


<h3 id="hahahugoshortcode-s1-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Thank you so much for agreeing to interview with us. Can you start by telling us your name, title, and organization?</p>


<h3 id="hahahugoshortcode-s2-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>I&rsquo;m Andy Dobson. I&rsquo;m the Client Services Director at <a href="https://datasalon.com">DataSalon</a>.</p>


<h3 id="hahahugoshortcode-s3-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Tell us about DataSalon.</p>


<h3 id="hahahugoshortcode-s4-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>DataSalon was originally set up in 2006 by Nick Andrews, who is our Managing Director, and Jon Monday, our Technical Director. They had been working together managing large-scale IT projects for major publishers and retailers, and had first-hand knowledge of the technical systems and services used by publishers. DataSalon was born out of the challenges, needs, and requirements faced by the publishers they were working with.</p>
<p>Our original product is <a href="https://www.datasalon.com/web/products/mastervision_product_tour.htm">MasterVision</a>, and that&rsquo;s really core to everything that we do here. Not only was it the product that gave the business its start originally, but it also became the basis for a lot of the services we provide, such as <a href="https://www.datasalon.com/web/features/paperstack.htm">PaperStack</a>, and the data quality work we do around cleaning and standardizing data. MasterVision was developed in response to the problem of establishing a joined-up view of multiple databases without having to replace or upgrade existing systems. From its launch the system was immediately adopted by OUP and the BMJ and was acclaimed as solving a significant business challenge for a fraction of the time and cost of conventional solutions. I’m pleased to say that both OUP and the BMJ are still clients!</p>
<p>We&rsquo;re quite a small company, but we’ve got a good long-term client base at DataSalon. People like the way we work, and they continue to work with us. We have a very collaborative relationship with our clients.  We help them solve a lot of the challenges around data cleansing, integration, and analysis of their data in a way that&rsquo;s significantly faster and cheaper and more flexible than conventional approaches. And as an organization we&rsquo;re quite small, but we&rsquo;re extremely agile, so we&rsquo;re very responsive in terms of customer service, and we&rsquo;ve got an excellent understanding of the scholarly publishing industry.</p>
<p>I&rsquo;ve previously worked at Elsevier and Ingenta, and all of the staff have worked at various companies within scholarly publishing at some point. So although we&rsquo;re an IT company, we&rsquo;ve very much got one foot inside scholarly publishing.</p>


<h3 id="hahahugoshortcode-s5-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>So you&rsquo;re saying that you&rsquo;re small but mighty?</p>


<h3 id="hahahugoshortcode-s6-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>Yes, small but mighty. Exactly. We like to think of ourselves as being a bit different, innovative, and extremely effective. We&rsquo;re a 4-day week employer, and we&rsquo;ve always worked, in the main, from home. Even before the COVID days, staff have generally worked from home, so we&rsquo;ve been quite forward-thinking in that sense.</p>
<p>The key to our success has been in &lsquo;growing&rsquo; each client&rsquo;s system directly from the source data, rather than creating them through the traditional time-consuming (and expensive) process of analysis - design - implementation. This same innovative principle lies behind all of the services we now offer, including PaperStack. We offer a very collaborative way of working with the data that is customized to the needs of the publisher. So right from the setup of a given build of MasterVision or PaperStack we work with our clients to develop the tools alongside the specific requirements of their data.</p>
<p>There are a lot of similarities, obviously, across scholarly publishing data. Typically it&rsquo;s subscriptions data, memberships, usage, denials, etc. &ndash; all of those things that we&rsquo;re familiar with. But there are, within that, very individual requirements related to their business models and company goals, so we work with each one individually to give them what they need.</p>


<h3 id="hahahugoshortcode-s7-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Great. Can you tell us a little bit about first MasterVision and then PaperStack? What kinds of things do they provide for publishers?</p>


<h3 id="hahahugoshortcode-s8-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>MasterVision was our first product and it provides publishers with complete customer insight via a fast and user-friendly hosted service. It painlessly merges all of the valuable data from different source systems providing a &lsquo;single customer view&rsquo; of all interactions with a customer. The interface itself is a standard across all of our clients, but each one has been tailored to their individual requirements. There wasn&rsquo;t anything doing that for publisher data at the time MasterVision was created, and I still don&rsquo;t think there is anything that offers that in quite the same way.</p>
<p>We don&rsquo;t look to replace any of the source systems. Publishers will still pick and choose the most effective systems that they want for fulfilling their needs, ie. their platform hosts, their membership database, fulfillment, and peer review systems etc. The integrity of that data is kept and what we&rsquo;re doing is providing a read-only interface onto all of that data. So we&rsquo;re effectively joining up all the different instances of the given customer, whether that be an individual or institution, into a single record, and maintaining and describing that customer hierarchy for detailed analysis. It comes in really useful if you want to analyze things like &ldquo;How many authors have I got submitting to our journal from a given institution?&rdquo;</p>


<h3 id="hahahugoshortcode-s9-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>I can see why you would be interested in ROR, then.</p>


<h3 id="hahahugoshortcode-s10-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>Yes, exactly. So having a reference record to tie customers to is really important, and having the hierarchy of those customers offers vital context. MasterVision is predominantly a sales and marketing tool, and allows publishers to look for sales opportunities and to provide better customer service through having a deep understanding of their customers. If you&rsquo;ve got customer data in multiple datasets living in multiple systems, it&rsquo;s time-consuming to go in and query each of those separately. Whereas pulling all those together into a system, that can&rsquo;t be broken by end users, allows staff to see exactly what&rsquo;s going on on a customer basis for all aspects of the relationship. And it&rsquo;s extremely quick and simple to get answers. So rather than getting IT involved to run complex queries across multiple systems, the end user, even if they&rsquo;re non-technical, can do that themselves. You can put together quite complex queries with MasterVision: &ldquo;Who does XYZ, but doesn’t do A or B?&rdquo;</p>
<figure><img src="/img/blog/datasalon/datasalon-medical.png"
         alt="Data showing 1246 institutions with medical books as a blue circle and 1240 institutions with medical journals as a yellow circle with the overlapping area in green with 996 institutions with both medical books and medical journals"/>
</figure>



<h3 id="hahahugoshortcode-s12-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>It sounds like you anticipated data science before &ldquo;data science&rdquo; was really a term.</p>


<h3 id="hahahugoshortcode-s13-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>Yes, we&rsquo;ve been doing this for quite a long time now. We feel like we were one of the first to really promote data quality and data cleansing. And as I say, many clients stay with us for a long time.</p>
<p>Our other product, PaperStack, works on a similar basis. It&rsquo;s providing publishers with a complete reporting suite for the scholarly submissions process. It is fully integrated with major peer review systems including ScholarOne and Aries Editorial Manager, meaning there is almost zero publisher overhead to get up and running. As a standard part of the service, data from peer review systems is also automatically linked and enhanced using industry data from ROR or Ringgold, Funder Registry, ORCID, and Crossref - all helping to maximize reporting insights. This results in a clean, standardized and comprehensive cross-journal view, available at the institutional, funder, author, and article level. As far as possible PaperStack aims to provide a service with zero overhead in terms of effort on the publisher&rsquo;s part. What we&rsquo;re doing is providing a contextualized and enhanced view of their peer reviewed data, including details of where rejected articles end up.</p>
<figure><img src="/img/blog/datasalon/datasalon-denials.png"
         alt="Chart titled Rejected Article Tracking showing 271,002 total rejected articles to begin with and over time 22.6% were later published with us, 55.9% were later published elsewhere, and 21.5% not published, with subsequent names of publishers and journals where the rejected articles were published"/>
</figure>

<p>By &ldquo;contextualized,&rdquo; I mean that we&rsquo;re using ROR, Ringgold, Funder Registry data, ORCIDs, publication information from Crossref, etc., for building out a comprehensive view of not only the timeline of a given article, but also all of the other big pieces of the puzzle: What are the affiliations of the authors and the editors? What is the hierarchy in terms of organizations they&rsquo;re affiliated with now? How was that then and what does it look like now? How long did it take an article to be accepted? Where did it ultimately get published? We use all that data to create a set of standardized reports that publishers can use, as well as helping them to create additional customized reports that they might want to design to meet their business needs. So PaperStack provides extremely detailed, responsive, and visually appealing reporting of their scholarly submissions.</p>


<h3 id="hahahugoshortcode-s15-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>And do you think that&rsquo;s also primarily used by sales and marketing teams at publishers? Or does it have a different purpose?</p>


<h3 id="hahahugoshortcode-s16-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>It&rsquo;s primarily editorial staff that use PaperStack, but there are applications for sales and marketing in terms of some of the reports that could be created. Additionally you could add in any additional customer data to enhance the overall picture. Because it&rsquo;s based on MasterVision, it potentially means clients could load in usage and denials, APC data etc. to add further value to the submissions picture.</p>


<h3 id="hahahugoshortcode-s17-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Great! I think I&rsquo;ve got a sense of what they do now. Tell me, who were the primary advocates of implementing ROR at your organization?</p>


<h3 id="hahahugoshortcode-s18-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>I would say all of us, from the customer service side to the product managers, to the technical team, and Nick himself. One of our product managers managed the implementation, but the requirements from our clients meant that there was a lot of support and interest from all parts of the business, really, in getting it within the products that we use. <strong>We&rsquo;re really interested in data standardization and cleansing: that&rsquo;s central to what we do, so we&rsquo;re all very aware of the importance of having a free and open standard reference dataset for identifying organizations and for supporting some of the other datasets that get loaded into our products. Open source data is key to our ethos.</strong></p>


<h3 id="hahahugoshortcode-s19-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>That&rsquo;s wonderful. You&rsquo;ve mentioned the importance of a free and open dataset, and your products do use Ringgold as well, as you&rsquo;ve mentioned. Can you talk a little bit about what ROR does that Ringgold doesn&rsquo;t? And vice versa, if you like, too: what does Ringgold do that ROR doesn&rsquo;t?</p>


<h3 id="hahahugoshortcode-s20-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>Yes. Just to go back slightly: as an organization, we&rsquo;ve always used an org dataset for standardizing and linking academic institutions within MasterVision and PaperStack. It&rsquo;s key for what we do. And while that&rsquo;s not completely necessary for the products themselves, it does provide an underpinning for delivering a comprehensive and reliable single customer view. In the past we&rsquo;ve always used Ringgold for those clients that license the Ringgold data. But there are a number of clients for whom the Ringgold dataset isn’t affordable, practical, or streamlined enough for their purposes. Our client base is pretty much half and half, I would say, between the publishers that used Ringgold and the people that didn&rsquo;t have anything. Back in 2014 we felt that there was a big need for us and our clients to have access to a free and open dataset. I don&rsquo;t know how familiar you are with OrgRef, but <a href="https://blog.datasalon.com/2014/10/28/introducing-orgref/">we launched OrgRef in 2014</a>.</p>


<h3 id="hahahugoshortcode-s21-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>I&rsquo;m somewhat familiar with it.</p>


<h3 id="hahahugoshortcode-s22-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>Back in 2014 we launched OrgRef as the first free and open dataset about academic institutions. We worked mainly to establish some useful metadata around academic institutions and a hierarchy picture to provide a way of standardizing that information and making it available to the wider community. I think when OrgRef was launched there wasn&rsquo;t any other free and open institutional ID available. We gave it away for free, and we used it within our products. We received a lot of useful input, from our clients, to develop the dataset further.</p>
<p>It was well received, but obviously the resource that we could attach to it was a lot smaller than commercial offerings, so there was a sizable gulf in terms of the volume of records that sat within the dataset. Ringgold had about 400,000 or so at the time, and we had I think about 80,000 &ndash; although those 80,000 pretty much covered all of the likely subscribers to academic content. Anything more than that tends to include orgs a lot lower down the institutional hierarchy. Not all scholarly publishers are going to be interested in seeing a secondary school in the middle of California that&rsquo;s not likely to buy a subscription. There is a lot of data within Ringgold, for example, that won’t be necessary for the needs of the smaller publishers.</p>


<h3 id="hahahugoshortcode-s23-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>We&rsquo;ve definitely heard that before, and of course we appreciate it. ROR currently has about 105,000 records.</p>


<h3 id="hahahugoshortcode-s24-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>OrgRef itself was extracted from structured information on Wikipedia and other open resources. And, as I say, it aimed to cover most of the important academic and research organizations. The metadata was similar to what you would see in Ringgold &ndash; things like name, country, URL, standard IDs like <a href="https://viaf.org/">VIAF</a>. We made OrgRef free and open for people to download. It was maintained by us and updated on a monthly basis. That probably was less frequent than some publishers would like, in the sense that they might be wanting to update their customer data on a more regular basis. And so we managed that for a while, and then <a href="https://www.digital-science.com/blog/2015/10/digital-science-launches-grid-a-new-global-open-database-offering-unique-information-on-research-organisations/">Digital Science launched their GRID dataset</a>, and they had more resource available to put into that. Their metadata was slightly richer. So we assisted GRID, and they used the OrgRef data to enrich the GRID dataset.</p>


<h3 id="hahahugoshortcode-s25-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>That was very generous of you.</p>


<h3 id="hahahugoshortcode-s26-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>Well, it was free and open anyway! But we stepped back and let them carry on, just because we&rsquo;re quite a small company. We had other products and projects that we were more focused on. The main goal was really to have an open data set that could be used by those who couldn&rsquo;t afford Ringgold and that could be used within our products. If someone else was willing to take that on, that was fine by us.</p>


<h3 id="hahahugoshortcode-s27-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>They were very helpful in setting up ROR.</p>


<h3 id="hahahugoshortcode-s28-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>Yes. Although it&rsquo;s great to do it as an open resource, I suspect both Digital Science and ourselves partly did it for selfish reasons as well as altruistic reasons. So I think when it gets to the point that someone else is prepared to take over, you say, &ldquo;Yeah, that&rsquo;s great, thanks.&rdquo;</p>


<h3 id="hahahugoshortcode-s29-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Right.</p>


<h3 id="hahahugoshortcode-s30-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>And ROR probably sits within that Crossref space better. A lot of what we were doing was pretty much automated, although there was some manual input. As I understand it, there&rsquo;s more manual input on the Ringgold side for digging out the right record, making sure that that is the right address, etc. They&rsquo;ll resolve institutional lists that you might want to provide to them, and they do some other bits and pieces that sit outside of what is possible with automation alone. It comes down to, I suppose, publishers getting what they pay for. But having a free and open reference dataset is great for everyone.</p>


<h3 id="hahahugoshortcode-s31-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Great. Can you take us through some details of what you&rsquo;ve done in your ROR integration? How did that process go? What does that look like within MasterVision in particular, but also PaperStack?</p>


<h3 id="hahahugoshortcode-s32-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>Yes, okay. The ROR integration was very straightforward. We use ROR in both our core services, MasterVision and PaperStack – both of these join together data from multiple sources to give a complete picture of customer activity (for the former) and the editorial process (for the latter), and to do that we need to identify which records relate to the same institution or individual. Matching with ROR means we can link up all the activity by a particular customer even if the data doesn’t have any other IDs or if multiple customer IDs have mistakenly been assigned to the same institution – which happens a lot!</p>
<p>It also means we can standardize the data – we use the name and location information from ROR for the main display, so that the format is consistent across all data sources.
We do the matching not just for institutions but also for individuals, to link them to their associated institution, which means that our clients are able to analyze data available at the individual level (like article submissions or pay per view purchases), as well as at the institutional level.</p>
<p>We also use the parent/child information included in ROR to show the relationships between records, with a family-tree style hierarchy viewer. Making use of hierarchy data allows clients to achieve complex uses around activity, be it funding, article submissions, content purchasing at any level in that hierarchy, or for example, do institutions access content that their associated authors submit to.
<figure><img src="/img/blog/datasalon/datasalon-oxford.png"
         alt="Database graphic showing institutional hierarchy with the top parent organization being Southern Universities Purchasing Consortium then University of Oxford and its children with Christ Church selected"/>
</figure>
</p>


<h3 id="hahahugoshortcode-s34-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>And how did you match the ROR IDs to the existing institutional data?</p>


<h3 id="hahahugoshortcode-s35-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>It was quite simple to do because ROR data is very much similar to GRID. We would just use ROR in the same way as the GRID data that we&rsquo;d previously been loading. Because ROR contained exactly the same data as GRID, including GRID IDs, it made it easy for us to test and implement. We migrated each client site, and we were able to check that the matching tools that we used were always giving the same results as ROR IDs. There&rsquo;s lots of challenges around that, particularly using real-life data. Using a reference dataset means that we can standardize information within the real data from publishers, which is pretty messy most of the time and not consistent from system to system. There&rsquo;s things like different name forms, different location information, different data structures, data entry errors, missing location information. But a lot of those things we&rsquo;d already addressed and developed within our own matching tools.</p>


<h3 id="hahahugoshortcode-s36-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Gotcha.</p>


<h3 id="hahahugoshortcode-s37-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>Part of what we do is provide matching services to publishers as well, so that sits within the tools that we have. As we load data, we will match to the reference dataset where we can. Where there are IDs that we can match, we will use those. If not, we will use our own tools to auto-match against that reference dataset. So we&rsquo;re using that dataset in two ways. One is ‘as we find them’, matching customer IDs that might sit within the publisher data to the reference dataset, and the other is to match the metadata to the reference dataset.</p>


<h3 id="hahahugoshortcode-s38-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>And when you say customer IDs, what do you mean?</p>


<h3 id="hahahugoshortcode-s39-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>If the fulfillment data, for instance, that we receive from a publisher contains a ROR ID, or a Ringgold ID, we&rsquo;ve immediately got a match to the full ROR or Ringgold record. So where those are used in the publisher data, we will match that to the standard records.</p>


<h3 id="hahahugoshortcode-s40-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Gotcha.</p>


<h3 id="hahahugoshortcode-s41-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>So for example, in their fulfillment data, they might have a customer specific ID. Whereas in their usage and denials data they might have a ROR ID which identifies a given org. Where we can pick out those IDs, we&rsquo;ll use them, but we&rsquo;ve got our own matching tools that we&rsquo;ve developed over the last 15+ years that allows us to do automated matching to create a link to the ROR data, where those IDs don&rsquo;t exist. The benefit is that all those different instances of &ldquo;University of Oxford&rdquo; will match to <a href="https://ror.org/052gg0110">one &ldquo;University of Oxford&rdquo; record</a>.</p>


<h3 id="hahahugoshortcode-s42-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Right!</p>


<h3 id="hahahugoshortcode-s43-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>We&rsquo;re doing work on our side to match to those standard records. We use a variety of strategies like fuzzy matching; lists of alternative names to supplement the ones in ROR; lists of alternative names to ignore (so that common acronyms shared by more than one organization don’t lead to incorrect matches); lists of synonyms for typos and US/UK spellings; mappings of state and country codes to full text; inferring missing information from the country name; and matching email and web domains against the organization URLs in ROR.</p>


<h3 id="hahahugoshortcode-s44-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Wow.</p>


<h3 id="hahahugoshortcode-s45-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>We are very proud of our auto-matching tools which can link &lsquo;real world&rsquo; (i.e., messy!) publisher data to the correct ROR records with an impressive degree of accuracy. The rules and exceptions in play there have been refined over many years. For example, ROR ID <a href="https://ror.org/04cvxnb49">https://ror.org/04cvxnb49</a> (Goethe University Frankfurt) brings together many different names:</p>
<ul>
<li>Goethe University</li>
<li>Goethe University, Frankfurt</li>
<li>Goethe-Universität Frankfurt am Main</li>
<li>Goethe-University</li>
<li>Johann Wolfgang Goethe University</li>
</ul>


<h3 id="hahahugoshortcode-s46-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>It&rsquo;s fascinating, and you do have a a nice <a href="https://blog.datasalon.com/2022/11/11/datasalons-secret-sauce-revealed/">blog post</a> detailing some of what you&rsquo;ve you just told me &ndash; your &ldquo;secret sauce.&rdquo; We&rsquo;ve found that that matching is a thing that many people and organizations want to do, because there are a number of organizations that, as you say, had nothing, and so they have a pile of messy text strings, and we do have tools and methods that we recommend for doing that. But it&rsquo;s been remarkable to me in the last few months to discover how many different methods people are using. A lot of people are building their own systems for doing that, and they don&rsquo;t have your fifteen years of experience doing that matching with other things.</p>


<h3 id="hahahugoshortcode-s47-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>What we&rsquo;ve got in place is very well tried and tested now and works really well in real-life scenarios. There can be multiple different versions of Oxford University that might sit within the different datasets that we pull in, whether it&rsquo;s &ldquo;University of Oxford,&rdquo; &ldquo;Oxford University,&rdquo; or indeed a subunit that sits below that top level, something like the Bodleian Library, which would be considered as part of the University of Oxford.</p>


<h3 id="hahahugoshortcode-s48-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Is the matching that you do entirely automated, or is there any human checking afterward?</p>


<h3 id="hahahugoshortcode-s49-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>There is some manual checking that goes on, but it&rsquo;s predominantly automated. I would say probably 90% is automated. There is a manual checking process where we&rsquo;ll go through and make sure there&rsquo;s no obvious incorrect links there. That gets used in a couple of different ways. The first is on the fly, during the build processes for our two products, MasterVision and PaperStack. What we&rsquo;re doing is we&rsquo;re taking weekly data uploads from the publisher and then rebuilding those sites for them on a weekly basis, sometimes on a daily basis. Part of that build process will be running that matching against the latest data that we receive so that the publisher has the latest, best, single standard record for a given institutional customer based on the latest data available. In that sense, it&rsquo;s working in an automated way as part of the build process, so if there&rsquo;s half a dozen versions of &ldquo;Oxford University,&rdquo; then we would link those up during that build. Obviously, if there are IDs that we can use from Ringgold or ROR, then we can use those in the build process for creating that single standard record.</p>
<p>And then we also use the auto-matching tools that we have to provide additional services to publishers. Some might not be MasterVision or PaperStack clients. They might come to us and say, &ldquo;We&rsquo;ve got 10,000 institutional records that are just a complete mess from this particular database that we&rsquo;ve been keeping for the last ten years. Can you deduplicate it? Standardize it? Match it to ROR?&rdquo; And we can do that for them. It&rsquo;s part of the service that we can provide as well.</p>


<h3 id="hahahugoshortcode-s50-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Oh, that&rsquo;s interesting! I didn&rsquo;t know that. As I said, we actually have been hearing a lot of desire for that from different entities.</p>


<h3 id="hahahugoshortcode-s51-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>Yes, it&rsquo;s a pretty straightforward process. By its nature it&rsquo;s not 100% accurate, because often the data is so messy and what&rsquo;s available just isn&rsquo;t clear enough that some matching is just not possible. Oftentimes the data might have been collected over a period of years from lots of different places. There might be someone who has been to an industry show that&rsquo;s just taken some notes down and has gone, &ldquo;Well, you know, X university has got this address,&rdquo; and then it sits within the database and no one knows what to do with it or how clean it is. So, yes, we often do projects for people where they want matching services to things like ROR or Ringgold if they license that.</p>


<h3 id="hahahugoshortcode-s52-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>You&rsquo;ve said that the matching is not 100% accurate and that 90% of the matching is done automatically. Do you have any rough metrics that you know or can share about how accurate your matching is if you just do it automatically?</p>


<h3 id="hahahugoshortcode-s53-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>That&rsquo;s a good question.</p>


<h3 id="hahahugoshortcode-s54-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>I know it&rsquo;s hard to tell.</p>


<h3 id="hahahugoshortcode-s55-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>It&rsquo;s tricky, because it depends on what the original data is like. Sometimes the matching might be 90%, sometimes it might be 70%; it just depends on how clean the input data is. But I would say around 80% is probably right.</p>


<h3 id="hahahugoshortcode-s56-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>And then, whatever the gap is, you&rsquo;ll check that manually?</p>


<h3 id="hahahugoshortcode-s57-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>Well, it depends on what the client wants us to do. If they want us to do some manual work to try and fill in some of the gaps, we can do that. However, oftentimes an automated job matching a large chunk of data that otherwise couldn’t be used is good enough. Also, in some cases, if the data is really messy and sparse, and there is really not much evidence to allow us to make a match, then even with some manual input it would be difficult to have a match that you were 100% sure was the right one.</p>


<h3 id="hahahugoshortcode-s58-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Right, because you just can&rsquo;t tell what people mean. What do they mean?</p>


<h3 id="hahahugoshortcode-s59-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>Well, there are two big &ldquo;York University’s” in the world. There&rsquo;s <a href="https://ror.org/05fq50484">one in Canada</a> and <a href="https://ror.org/04m01e293">one in the UK</a>. There might be <a href="https://ror.org/0190ak572">another one somewhere</a> as well. If someone&rsquo;s put &ldquo;York University&rdquo; and there&rsquo;s little additional contextual information, you wouldn&rsquo;t be 100% sure on which one they meant. So in some cases there&rsquo;s some ambiguity there even when you get down to a manual level, because you just wouldn&rsquo;t be able to tell which was which. There is an element of not having all of the information to allow you to be 100% accurate. We could literally be receiving a file from someone coming to us and saying &ldquo;We found this file lying around on an old system. It has 10,000 names, and we don&rsquo;t really know what to do with it. Can you do something?&rdquo; So the starting point can often be challenging.</p>


<h3 id="hahahugoshortcode-s60-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Got it. Let me ask you, too, how do you get the newest ROR data? Are you using the ROR API? Or do you get the ROR data dump?</p>


<h3 id="hahahugoshortcode-s61-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>We get the ROR data dump. We do use APIs for our PaperStack service when it comes to actually loading certain information. We use the ScholarOne APIs and the Editorial Manager API for collecting the data. However, we use the ROR data dump itself for loading into our services.</p>


<h3 id="hahahugoshortcode-s62-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>We are doing <a href="https://github.com/ror-community/ror-updates/releases">releases</a> now at least once a month, usually twice a month. Do you have a programmatic way of getting those new updates to ROR, or are you just redownloading it <a href="https://doi.org/10.5281/zenodo.6347574">from Zenodo</a> every time?</p>


<h3 id="hahahugoshortcode-s63-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>We download it on a monthly basis, so whenever there&rsquo;s a new one that comes out, we will pick that up and then apply it to the data that we&rsquo;ve got.</p>


<h3 id="hahahugoshortcode-s64-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Perfect. Just a couple of more questions. What do you hope ROR does in the future? Do you have any bug fixes, feature suggestions, advice, requests?</p>


<h3 id="hahahugoshortcode-s65-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>Our main concern previously was that there was no concept of inactive organizations, so it&rsquo;s great that <a href="https://ror.org/blog/2022-12-07-handling-org-status/">that&rsquo;s been addressed</a>. Going forward, I guess the best thing from our point of view would be adding new records just to increase the scope of the dataset.</p>
<p>It&rsquo;s also key for us to keep the <a href="https://ror.org/blog/2023-02-27-parents-children-and-other-relationships-in-ror/">organizational hierarchies</a> lean and relevant. As we talked about, having too many layers just takes away the usefulness of it and muddies the water a bit. You don&rsquo;t need to know ten layers down what&rsquo;s going on, because it&rsquo;s likely not important or not relevant. That&rsquo;s probably the main thing from our side, really, just for what we do, because that allows us to keep things like hierarchies within MasterVision and PaperStack understandable. If you&rsquo;re looking for a sales prospect, or you want to understand at what level a subscription&rsquo;s sitting at, or you want to know what authors are submitting at what institution, you don&rsquo;t need many levels. You want to know those direct relationships, and maybe one or two layers down, but you don&rsquo;t need to know that someone worked in a department of a library at the college of X university, which probably isn&rsquo;t going to be particularly relevant to the sorts of analysis you want to do as a publisher.</p>


<h3 id="hahahugoshortcode-s66-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>What I&rsquo;m also wondering about is what the business case for ROR is &ndash; for you, but also for publishers or other entities. How is ROR adding value to for-profit enterprises, given that it itself is free and open?</p>


<h3 id="hahahugoshortcode-s67-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>It comes down to the standardization of your customer data and to have a unique way of identifying a given customer / organization, within the sphere of your relationship and within scholarly publishing more widely. If you can&rsquo;t associate all of your customer activity to a standardized picture of each organization, then what you&rsquo;re left with is a very messy picture that is difficult to understand, and it&rsquo;s very time-consuming to analyze and get into the guts of what your relationship with them ultimately is. If you&rsquo;ve got multiple versions of an institution that you&rsquo;re not tying together in a single record, then you don&rsquo;t know, potentially, the full value that X University is getting from the deal.</p>
<p>For example, if a customer is referenced differently within both <a href="https://www.projectcounter.org/">COUNTER data</a> and fulfillment data, and you can’t join those records together, then you’re not going to be able to understand how much use they are making of their subscriptions. It also has a big impact on the customer relationship. When it comes to things like renewing sales, if you can’t actually tie up all of the different sublevels of the hierarchy of Oxford University to the top level, you could end up trying to sell things they already buy and it becomes a very untrusting relationship.</p>


<h3 id="hahahugoshortcode-s68-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yes indeed.</p>


<h3 id="hahahugoshortcode-s69-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>If you&rsquo;re making a decision as a library based on how much content is being used, then you want to make sure your usage numbers align with what the publisher is seeing. And as a publisher, if you&rsquo;re not able to present consistent numbers because you&rsquo;re not able to tie up usage of the Bodleian Library with the University of Oxford, then it becomes a very difficult conversation. It&rsquo;s like any customer relationship: one needs to feel valued and fully understood. None of us want to be mistaken for somebody else!</p>


<h3 id="hahahugoshortcode-s70-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Definitely.</p>


<h3 id="hahahugoshortcode-s71-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p><strong>And if you&rsquo;re moving into OA, and you&rsquo;re not understanding a standardized picture of your customer hierarchy, and you don&rsquo;t know how many articles are coming out of the University of Oxford and how much they should be paying in article fees, then that becomes really tricky. There are potentially significant financial implications, on the customer service side, of not having a dataset that allows you to clean up and standardize all of those customer pictures within your data.</strong> Having access to ROR and services like MasterVision allows publishers to continue to maintain and use their existing databases, and the imperfections therein, whilst being confident that they can clean and standardize data in an effective and economical way further downstream.</p>


<h3 id="hahahugoshortcode-s72-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Right. We hear that a lot, that collecting beautifully structured, accurate metadata at the beginning is the ideal, but it just can&rsquo;t always happen that way. Particularly in the case of affiliations, which were collected so often just in a text field, it can be very difficult.</p>


<h3 id="hahahugoshortcode-s73-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>Yes, exactly. While we would always recommend putting good data governance practices into place, that is not always possible or affordable. Having access to services that allow one to get the most out of what one has is the next best thing.</p>


<h3 id="hahahugoshortcode-s74-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yep. That&rsquo;s about all I have for you. What else do you want to say about ROR or your implementation of ROR?</p>


<h3 id="hahahugoshortcode-s75-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>I think I probably covered most of it. <strong>ROR does open up new possibilities for us, such as integrating some of the other systems that are around at the moment. We&rsquo;re currently looking at bringing <a href="https://openalex.org">OpenAlex IDs</a> into PaperStack. That should be really straightforward, because you can use the ROR IDs to retrieve the OpenAlex IDs <a href="https://docs.openalex.org/api-entities/institutions">through their API</a>.</strong> So that&rsquo;s one of the benefits of having the ROR IDs and the mappings that we&rsquo;ve got in place on our side. It&rsquo;s opening up some other possibilities for us, which is nice. But in the main, having that free and open data is beneficial to everyone!</p>


<h3 id="hahahugoshortcode-s76-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Great. Well, thank you so much again for doing this with us, Andy.</p>


<h3 id="hahahugoshortcode-s77-hbhb-andy-dobson"><figure class="round-figure"><img src="/img/blog/datasalon/profile_andy.png"
         alt="Avatar of Andy Dobson"/>
</figure>
 Andy Dobson 
</h3>
<p>Thank you for the opportunity.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>Questions? Want to be featured in a ROR case study? Contact <a href="mailto:support@ror.org">support@ror.org</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">In January of 2022, DataSalon announced a full integration of ROR into its scholarly publishing analytics products MasterVision and PaperStack. DataSalon's Client Services Director Andy Dobson sat down with us to tell us all about how they draw on fifteen years of data experience to match organizational affiliations in publisher data to ROR IDs.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">PIDs and Open Science: Building Community in Latin America | PIDs y Ciencia Abierta: Construyendo comunidad en América Latina</title><id>https://doi.org/10.71938/c9j5-hy97</id><link rel="alternate" href="https://ror.org/blog/2023-05-09-pids-open-science-latin-america/"/><published>2023-05-09T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Ana Cardoso</name><uri>https://orcid.org/0000-0002-6397-633X</uri></author><author><name>Gabriela Mejias</name><uri>https://orcid.org/0000-0002-1598-7181</uri></author><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="Event Recaps"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p><em>Cross-posted by <a href="https://doi.org/10.5438/wnh3-8q65">DataCite</a> and <a href="https://info.orcid.org/pids-and-open-science-building-community-in-latin-america/">ORCID</a></em></p>
<p><em>Versión en español sigue abajo</em></p>
<p>Persistent identifiers are playing a key role in driving more robust research infrastructure and open science initiatives across Latin America. This was a primary theme at the event &ldquo;Persistent Identifiers (PIDs) and Open Science in Latin America&rdquo; (<a href="https://twitter.com/search?q=%23PIDsLATAM23&amp;src=typed_query">#PIDsLATAM23</a>) held on April 18 in Buenos Aires (Argentina) during <a href="https://csvconf.com/">csv,conf,v7</a>.</p>
<p>Organized by <a href="https://datacite.org">DataCite</a>, <a href="https://orcid.org">ORCID</a>, and ROR, the event was attended by more than 70 research stakeholders from across the Latin American region and elsewhere, representing 40 different institutions in total.</p>
<figure class="blog-figure"><img src="/img/blog/pidslatam/pidslatam-event-kickoff.jpg"
         alt="Event kick-off: (from left to right) [Maria Gould](https://orcid.org/0000-0002-2916-3423), [Ana Cardoso](https://orcid.org/0000-0002-8631-3838), [Gabriela Mejias](https://orcid.org/0000-0002-1598-7181)."/><figcaption>
            <p>Event kick-off: (from left to right) <a href="https://orcid.org/0000-0002-2916-3423">Maria Gould</a>, <a href="https://orcid.org/0000-0002-8631-3838">Ana Cardoso</a>, <a href="https://orcid.org/0000-0002-1598-7181">Gabriela Mejias</a>.</p>
        </figcaption>
</figure>

<p>Aimed at research administrators, library directors, and technical staff, the full-day program featured a series of in-depth presentations&mdash;all in Spanish&mdash;about how persistent identifiers are being implemented and used in national, consortial, and institutional settings to advance open science and increase the discoverability and visibility of research in Latin America. The goal was to present use cases and success stories and bring together stakeholders with shared interests and challenges in a Spanish-language setting.</p>
<p>The first set of presentations focused on large-scale initiatives that are aimed at enabling open science with persistent identifiers. Gustavo Durand from <a href="https://dataverse.org/">The Dataverse Project</a> discussed how PIDs are being implemented in the platform to make research data and other works more openly available; Abel del Carpio from <a href="https://www.gob.pe/concytec">CONCYTEC</a> in Peru discussed the organization&rsquo;s strategy with regard to leveraging PIDs on a national level; and Washington Segundo from <a href="https://www.gov.br/ibict/pt-br">IBICT</a> and <a href="https://www.lareferencia.info/es/">LA Referencia</a> discussed efforts in Brazil and across Latin America to develop open science infrastructure using PIDs.</p>
<figure class="blog-figure"><img src="/img/blog/pidslatam/pidslatam-panel1.jpg"
         alt="Panelists of the session *PIDs and open science* (from left to right) [Washington Segundo](https://orcid.org/0000-0003-3635-9384) (IBICT, Brasil), [Gustavo Durand](https://orcid.org/0000-0002-2188-2570) (Dataverse, USA), [Abel del Carpio](https://orcid.org/0000-0003-4469-6516) (CONCYTEC, Peru)."/><figcaption>
            <p>Panelists of the session <em>PIDs and open science</em> (from left to right) <a href="https://orcid.org/0000-0003-3635-9384">Washington Segundo</a> (IBICT, Brasil), <a href="https://orcid.org/0000-0002-2188-2570">Gustavo Durand</a> (Dataverse, USA), <a href="https://orcid.org/0000-0003-4469-6516">Abel del Carpio</a> (CONCYTEC, Peru).</p>
        </figcaption>
</figure>



<h3 id="review-the-session-1-presentations">Review the Session 1 presentations 
</h3>
<ul>
<li>
<p>Gustavo Durand, &ldquo;Dataverse y los PIDs,&rdquo; <a href="https://doi.org/10.5281/zenodo.7860265">https://doi.org/10.5281/zenodo.7860265</a></p>
</li>
<li>
<p>Abel del Carpio, &ldquo;Plataformas de CTI y los identificadores persistentes (PIDs) en el Perú,&rdquo; <a href="https://doi.org/10.5281/zenodo.7860287">https://doi.org/10.5281/zenodo.7860287</a></p>
</li>
<li>
<p>Washington Segundo, &ldquo;Los PIDs y la ciencia abierta: perspectiva Brasil y América Latina,&rdquo; <a href="https://doi.org/10.5281/zenodo.7860279">https://doi.org/10.5281/zenodo.7860279</a></p>
</li>
</ul>
<p>The next series of presentations focused on global persistent identifier services for enabling more open and interoperable research infrastructure across Latin America and beyond, featuring <a href="https://datacite.org">DataCite</a> (Gabi Mejias), <a href="https://orcid.org">ORCID</a> (Ana Cardoso), and <a href="https://ror.org">ROR</a> (Maria Gould).</p>


<h3 id="review-the-session-2-presentations">Review the Session 2 presentations 
</h3>
<ul>
<li>
<p>Gabi Mejias, &ldquo;Conectando la investigación con DataCite,&rdquo; <a href="https://doi.org/10.5281/zenodo.7860319">https://doi.org/10.5281/zenodo.7860319</a></p>
</li>
<li>
<p>Ana Cardoso, &ldquo;PIDs y ORCID,&rdquo; <a href="https://doi.org/10.5281/zenodo.7860315">https://doi.org/10.5281/zenodo.7860315</a></p>
</li>
<li>
<p>Maria Gould, &ldquo;ROR, El Registro Abierto de Identificadores Persistentes para Organizaciones de Investigación,&rdquo; <a href="https://doi.org/10.5281/zenodo.7860464">https://doi.org/10.5281/zenodo.7860464</a></p>
</li>
</ul>
<figure class="blog-figure"><img src="/img/blog/pidslatam/pidslatam-gabi-ana-maria.jpeg"
         alt="Panelists of the session *PIDs as open research infrastructure* (from left to right) Gabi Mejias (DataCite, Germany), Maria Gould (ROR, USA), Ana Cardoso (ORCID, Mexico)."/><figcaption>
            <p>Panelists of the session <em>PIDs as open research infrastructure</em> (from left to right) Gabi Mejias (DataCite, Germany), Maria Gould (ROR, USA), Ana Cardoso (ORCID, Mexico).</p>
        </figcaption>
</figure>

<p>The third and final series of presentations focused on specific implementations of persistent identifiers in Latin American research infrastructure. This session highlighted institutional use cases at the <a href="https://unr.edu.ar/">Universidad Nacional de Rosario</a> in Argentina (Paola Carolina Bongiovani, Paulina Freán, Analía Salazar) and <a href="https://www.uchile.cl/">Universidad de Chile</a> (Rodrigo Donoso) to implement persistent identifiers in library and repository systems, and examples of consortia-level implementations by <a href="https://www.consortia.com.co/">Consortia</a> from Colombia (Paula Saavedra) and <a href="https://www.escire.lat/">eScire</a> (Nydia Lopez and Joel Torres) from Mexico.</p>
<figure class="blog-figure"><img src="/img/blog/pidslatam/pidslatam-panel3.jpg"
         alt="Panelists of the session *PIDs implementation in Latin America* (from left to right) [Analía Salazar](https://orcid.org/0000-0001-8204-1591) (UNR, Argentina), [Paulina Frean](https://orcid.org/0000-0002-9710-0213) (UNR, Argentina), [Nydia López](https://orcid.org/0009-0007-3657-5817) (eScire, Mexico), [Joel Torres](https://orcid.org/0000-0001-6739-5906) (eScire, Mexico), [Paola Bongiovani](https://orcid.org/0000-0003-0049-9086) (UNR, Argentina), [Rodrigo Donoso Vegas](https://orcid.org/0000-0001-5635-4324) (Universidad de Chile, Chile), [Paula Saavedra](https://orcid.org/0000-0001-5383-3020) (Consortia, Colombia)."/><figcaption>
            <p>Panelists of the session <em>PIDs implementation in Latin America</em> (from left to right) <a href="https://orcid.org/0000-0001-8204-1591">Analía Salazar</a> (UNR, Argentina), <a href="https://orcid.org/0000-0002-9710-0213">Paulina Frean</a> (UNR, Argentina), <a href="https://orcid.org/0009-0007-3657-5817">Nydia López</a> (eScire, Mexico), <a href="https://orcid.org/0000-0001-6739-5906">Joel Torres</a> (eScire, Mexico), <a href="https://orcid.org/0000-0003-0049-9086">Paola Bongiovani</a> (UNR, Argentina), <a href="https://orcid.org/0000-0001-5635-4324">Rodrigo Donoso Vegas</a> (Universidad de Chile, Chile), <a href="https://orcid.org/0000-0001-5383-3020">Paula Saavedra</a> (Consortia, Colombia).</p>
        </figcaption>
</figure>



<h3 id="review-the-session-3-presentations">Review the Session 3 presentations 
</h3>
<ul>
<li>
<p>Paola Carolina Bongiovani, Paulina Freán, Analía Salazar, &ldquo;Implementación de PIDs en América Latina REPOSITORIO DE DATOS ACADÉMICOS RDA-UNR dataverse.unr.edu.ar,&rdquo; <a href="https://doi.org/10.5281/zenodo.7860470">https://doi.org/10.5281/zenodo.7860470</a></p>
</li>
<li>
<p>Rodrigo Donoso Vegas, &ldquo;Implementación PIDs en América Latina. Experiencia de la Universidad de Chile,&rdquo; <a href="https://doi.org/10.5281/zenodo.7860501">https://doi.org/10.5281/zenodo.7860501</a></p>
</li>
<li>
<p>Paula Saavedra Ochoa, &ldquo;Gestión de identificadores persistentes consorciados,&rdquo; <a href="https://doi.org/10.5281/zenodo.7860524">https://doi.org/10.5281/zenodo.7860524</a></p>
</li>
<li>
<p>Nydia Lopez and Joel Torres, &ldquo;Consorcio eSCIRE,&rdquo; <a href="https://doi.org/10.5281/zenodo.7860513">https://doi.org/10.5281/zenodo.7860513</a></p>
</li>
</ul>
<p>At the conclusion of the program, audience members participated in an interactive brainstorming session to share their impressions about the state of PIDs implementations in Latin America and raise questions and ideas about how to move forward to achieve greater adoption. The participants responded to three prompts:</p>
<ul>
<li>
<p><em>What would you like to see happen with PIDs in Latin America?</em></p>
</li>
<li>
<p><em>What are the challenges to PIDs adoption in the region?</em></p>
</li>
<li>
<p><em>What can we do to address these challenges?</em></p>
</li>
</ul>
<p>In the responses to the online poll, several themes stood out: the importance of shared infrastructure and of collaboration, the necessity of making PIDs infrastructure more accessible, the value of having more training and capacity-building, and the ongoing need to continue raising awareness of how to work with PIDs and of the benefits they provide.</p>
<hr>


<h2 id="hahahugoshortcode-s4-hbhb"><figure class="blog-figure"><img src="/img/blog/pidslatam/pidslatam-menti1.png"/>
</figure>
 
</h2>


<h2 id="hahahugoshortcode-s5-hbhb"><figure class="blog-figure"><img src="/img/blog/pidslatam/pidslatam-menti2.png"/>
</figure>
 
</h2>


<h2 id="hahahugoshortcode-s6-hbhb"><figure class="blog-figure"><img src="/img/blog/pidslatam/pidslatam-menti3.png"/>
</figure>
 
</h2>
<p>Throughout the day, the sessions and participant engagement demonstrated not only a high level of interest in the topic of PIDs and open science in Latin America, but also generated new ideas and opportunities for future collaborations and follow-up actions.</p>
<p>We at DataCite, ORCID, and ROR express our gratitude for the effusive response to this event and the satellite discussions that took place on the preceding and following days, all of which speak to the leading role that Latin America is playing in developing infrastructures, policies, and practices for promoting collaboration around open knowledge, redistributing global research networks, and making it possible for <em>all</em> countries to have greater access to science and technology. It was great to meet the Latin American community in person and we hope to continue working together to build an open and robust research infrastructure!</p>
<figure class="blog-figure"><img src="/img/blog/pidslatam/pidslatam-group-photo.jpg"
         alt="(Happy!) participants of the PIDs and open science in Latin America event."/><figcaption>
            <p>(Happy!) participants of the PIDs and open science in Latin America event.</p>
        </figcaption>
</figure>

<hr>


<h1 id="pids-y-ciencia-abierta-construyendo-comunidad-en-américa-latina">PIDs y Ciencia Abierta: Construyendo comunidad en América Latina 
</h1>
<p><em>Publicado a la vez por <a href="https://doi.org/10.5438/s8rf-f731">DataCite</a> y <a href="https://info.orcid.org/pids-y-ciencia-abierta-construyendo-comunidad-en-america-latina/">ORCID</a></em></p>
<p>Los identificadores persistentes están jugando un papel clave en impulsar una infraestructura de investigación más sólida e iniciativas de ciencia abierta en América Latina. Este fue el tema principal del evento &ldquo;Identificadores Persistentes (PIDs) y Ciencia Abierta en América Latina&rdquo; (<a href="https://twitter.com/search?q=%23PIDsLATAM23&amp;src=typed_query">#PIDsLATAM23</a>) realizado el 18 de abril en Buenos Aires (Argentina) en el marco de la de <a href="https://csvconf.com/">csv,conf,v7</a>.</p>
<p>Organizado por <a href="https://datacite.org/">DataCite</a>, <a href="https://orcid.org">ORCID</a> y <a href="https://ror.org">ROR</a>, el evento contó con más de 70 personas vinculadas a la investigación en América Latina y otras regiones, representando más de 40 instituciones diferentes.</p>
<figure class="blog-figure"><img src="/img/blog/pidslatam/pidslatam-event-kickoff.jpg"
         alt="Inicio del evento: (de izquierda a derecha) [Maria Gould](https://orcid.org/0000-0002-2916-3423), [Ana Cardoso](https://orcid.org/0000-0002-8631-3838), [Gabriela Mejias](https://orcid.org/0000-0002-1598-7181)."/><figcaption>
            <p>Inicio del evento: (de izquierda a derecha) <a href="https://orcid.org/0000-0002-2916-3423">Maria Gould</a>, <a href="https://orcid.org/0000-0002-8631-3838">Ana Cardoso</a>, <a href="https://orcid.org/0000-0002-1598-7181">Gabriela Mejias</a>.</p>
        </figcaption>
</figure>

<p>Dirigido a gestores de investigación, directores de bibliotecas y personal técnico, el programa de un día completo contó con una serie de presentaciones detalladas, todas en español, sobre cómo se implementan y utilizan los identificadores persistentes en entornos nacionales, consorciados e institucionales que apoyan el acceso abierto, la ciencia e incrementan la descubribilidad y visibilidad de la investigación en América Latina.</p>
<p>El objetivo fue presentar casos de uso e historias de éxito así como reunir a personas con intereses y desafíos en común en un espacio de habla hispana.</p>
<p>El primer grupo de presentaciones se centró en iniciativas a gran escala que tienen como objetivo promover la ciencia abierta a través de identificadores persistentes (PIDs). Gustavo Durand de <a href="https://dataverse.org/">The Dataverse Project</a> expuso cómo se están implementando los PIDs en la plataforma para hacer que los datos de investigación y otros trabajos estén disponibles de manera más abierta; Abel del Carpio del <a href="https://www.gob.pe/concytec">CONCYTEC</a> habló sobre la estrategia de la organización en el aprovechamiento de los PIDs en Perú; y Washington Segundo de <a href="https://www.gov.br/ibict/pt-br">IBICT</a> y <a href="https://www.lareferencia.info/es/">LA Referencia</a> mostró los esfuerzos tanto de Brasil como de otras regiones en América Latina para desarrollar una infraestructura de ciencia abierta utilizando PIDs.</p>
<figure class="blog-figure"><img src="/img/blog/pidslatam/pidslatam-panel1.jpg"
         alt="Panelistas de la sesión *PIDs y ciencia abierta* (de izquierda a derecha) [Washington Segundo](https://orcid.org/0000-0003-3635-9384) (IBICT, Brasil), [Gustavo Durand](https://orcid.org/0000-0002-2188-2570) (Dataverse, USA), [Abel del Carpio](https://orcid.org/0000-0003-4469-6516) (CONCYTEC, Peru)."/><figcaption>
            <p>Panelistas de la sesión <em>PIDs y ciencia abierta</em> (de izquierda a derecha) <a href="https://orcid.org/0000-0003-3635-9384">Washington Segundo</a> (IBICT, Brasil), <a href="https://orcid.org/0000-0002-2188-2570">Gustavo Durand</a> (Dataverse, USA), <a href="https://orcid.org/0000-0003-4469-6516">Abel del Carpio</a> (CONCYTEC, Peru).</p>
        </figcaption>
</figure>



<h3 id="presentaciones-de-la-sesión-1">Presentaciones de la Sesión 1: 
</h3>
<ul>
<li>
<p>Gustavo Durand, &ldquo;Dataverse y los PIDs&rdquo;, <a href="https://doi.org/10.5281/zenodo.7860265">https://doi.org/10.5281/zenodo.7860265</a></p>
</li>
<li>
<p>Abel del Carpio, &ldquo;Plataformas de CTI y los identificadores persistentes (PIDs) en el Perú,&rdquo; <a href="https://doi.org/10.5281/zenodo.7860287">https://doi.org/10.5281/zenodo.7860287</a></p>
</li>
<li>
<p>Washington Segundo, &ldquo;Los PIDs y la ciencia abierta: perspectiva Brasil y América Latina,&rdquo; <a href="https://doi.org/10.5281/zenodo.7860279">https://doi.org/10.5281/zenodo.7860279</a></p>
</li>
</ul>
<p>El siguiente grupo de presentaciones se centró en los servicios de identificadores persistentes globales para construir una infraestructura de investigación más abierta e interoperable en América Latina y a nivel global, con <a href="https://datacite.org">DataCite</a> (Gabi Mejias), <a href="https://orcid.org">ORCID</a> (Ana Cardoso) y <a href="https://ror.org">ROR</a> (Maria Gould).</p>


<h3 id="presentaciones-de-la-sesión-2">Presentaciones de la Sesión 2: 
</h3>
<ul>
<li>
<p>Gabi Mejias, &ldquo;Conectando la investigación con DataCite,&rdquo; <a href="https://doi.org/10.5281/zenodo.7860319">https://doi.org/10.5281/zenodo.7860319</a></p>
</li>
<li>
<p>Ana Cardoso, &ldquo;PIDs y ORCID,&rdquo; <a href="https://doi.org/10.5281/zenodo.7860315">https://doi.org/10.5281/zenodo.7860315</a></p>
</li>
<li>
<p>Maria Gould, &ldquo;ROR, El Registro Abierto de Identificadores Persistentes para Organizaciones de Investigación,&rdquo; <a href="https://doi.org/10.5281/zenodo.7860464">https://doi.org/10.5281/zenodo.7860464</a></p>
</li>
</ul>
<figure class="blog-figure"><img src="/img/blog/pidslatam/pidslatam-gabi-ana-maria.jpeg"
         alt="Panelistas de la sesión *PIDs como infraestructura abierta de investigación* (de izquierda a derecha) Gabi Mejias (DataCite, Alemania), Maria Gould (ROR, USA), Ana Cardoso (ORCID, México)."/><figcaption>
            <p>Panelistas de la sesión <em>PIDs como infraestructura abierta de investigación</em> (de izquierda a derecha) Gabi Mejias (DataCite, Alemania), Maria Gould (ROR, USA), Ana Cardoso (ORCID, México).</p>
        </figcaption>
</figure>

<p>El tercer y último grupo de presentaciones se enfocó en las implementaciones y desarrollo de identificadores persistentes en la infraestructura de investigación de América Latina. Esta sesión destacó casos de uso institucional en la <a href="https://unr.edu.ar/">Universidad Nacional de Rosario en Argentina</a> (Paola Carolina Bongiovani, Paulina Freán, Analía Salazar) y la <a href="https://www.uchile.cl/">Universidad de Chile</a> (Rodrigo Donoso) para implementar identificadores persistentes en sistemas de bibliotecas y repositorios. Y por último se destacaron algunos ejemplos de implementaciones a nivel consorcial por <a href="https://www.consortia.com.co/">Consortia</a> de Colombia (Paula Saavedra) y <a href="https://www.escire.lat/">eScire</a> (Nydia López y Joel Torres) de México.</p>
<figure class="blog-figure"><img src="/img/blog/pidslatam/pidslatam-panel3.jpg"
         alt="Panelistas de la sesión *Implementación de PIDs en América Latina* (de izquierda a derecha) [Analía Salazar](https://orcid.org/0000-0001-8204-1591) (UNR, Argentina), [Paulina Frean](https://orcid.org/0000-0002-9710-0213) (UNR, Argentina), [Nydia López](https://orcid.org/0009-0007-3657-5817) (eScire, Mexico), [Joel Torres](https://orcid.org/0000-0001-6739-5906) (eScire, Mexico), [Paola Bongiovani](https://orcid.org/0000-0003-0049-9086) (UNR, Argentina), [Rodrigo Donoso Vegas](https://orcid.org/0000-0001-5635-4324) (Universidad de Chile, Chile), [Paula Saavedra](https://orcid.org/0000-0001-5383-3020) (Consortia, Colombia)."/><figcaption>
            <p>Panelistas de la sesión <em>Implementación de PIDs en América Latina</em> (de izquierda a derecha) <a href="https://orcid.org/0000-0001-8204-1591">Analía Salazar</a> (UNR, Argentina), <a href="https://orcid.org/0000-0002-9710-0213">Paulina Frean</a> (UNR, Argentina), <a href="https://orcid.org/0009-0007-3657-5817">Nydia López</a> (eScire, Mexico), <a href="https://orcid.org/0000-0001-6739-5906">Joel Torres</a> (eScire, Mexico), <a href="https://orcid.org/0000-0003-0049-9086">Paola Bongiovani</a> (UNR, Argentina), <a href="https://orcid.org/0000-0001-5635-4324">Rodrigo Donoso Vegas</a> (Universidad de Chile, Chile), <a href="https://orcid.org/0000-0001-5383-3020">Paula Saavedra</a> (Consortia, Colombia).</p>
        </figcaption>
</figure>



<h3 id="presentaciones-de-la-sesión-3">Presentaciones de la Sesión 3: 
</h3>
<ul>
<li>
<p>Paola Carolina Bongiovani, Paulina Freán, Analía Salazar, &ldquo;Implementación de PIDs en América Latina REPOSITORIO DE DATOS ACADÉMICOS RDA-UNR dataverse.unr.edu.ar,&rdquo; <a href="https://doi.org/10.5281/zenodo.7860470">https://doi.org/10.5281/zenodo.7860470</a></p>
</li>
<li>
<p>Rodrigo Donoso Vegas, &ldquo;Implementación PIDs en América Latina. Experiencia de la Universidad de Chile,&rdquo; <a href="https://doi.org/10.5281/zenodo.7860501">https://doi.org/10.5281/zenodo.7860501</a></p>
</li>
<li>
<p>Paula Saavedra Ochoa, &ldquo;Gestión de identificadores persistentes consorciados,&rdquo; <a href="https://doi.org/10.5281/zenodo.7860524">https://doi.org/10.5281/zenodo.7860524</a></p>
</li>
<li>
<p>Nydia López y Joel Torres, &ldquo;Consorcio eSCIRE&rdquo;, <a href="https://doi.org/10.5281/zenodo.7860513">https://doi.org/10.5281/zenodo.7860513</a></p>
</li>
</ul>
<p>Ya para finalizar la agenda, los miembros de la audiencia participaron en una sesión interactiva de lluvia de ideas (brainstorming) para compartir sus impresiones sobre el estado de las implementaciones de PIDs en América Latina y plantear preguntas e ideas sobre cómo avanzar para lograr una mayor adopción. Los participantes respondieron a tres preguntas:</p>
<ul>
<li>
<p><em>¿Qué te gustaría que sucediera con los PIDs en América Latina?</em></p>
</li>
<li>
<p><em>¿Cuáles son los desafíos para la adopción de PIDs en la región?</em></p>
</li>
<li>
<p><em>¿Qué crees que podemos hacer para abordar estos desafíos?</em></p>
</li>
</ul>
<p>En las respuestas a la encuesta en línea, se resaltaron varios temas: la importancia de la infraestructura compartida y de la colaboración, la necesidad de hacer que la infraestructura de PIDs sea más accesible, el valor de tener más capacitación y desarrollo de capacidades, y la necesidad constante de continuar creando conciencia de cómo trabajar con PIDs y de los beneficios que estos brindan.</p>
<hr>


<h2 id="hahahugoshortcode-s12-hbhb"><figure class="blog-figure"><img src="/img/blog/pidslatam/pidslatam-menti1.png"/>
</figure>
 
</h2>


<h2 id="hahahugoshortcode-s13-hbhb"><figure class="blog-figure"><img src="/img/blog/pidslatam/pidslatam-menti2.png"/>
</figure>
 
</h2>


<h2 id="hahahugoshortcode-s14-hbhb"><figure class="blog-figure"><img src="/img/blog/pidslatam/pidslatam-menti3.png"/>
</figure>
 
</h2>
<p>A lo largo del día, las sesiones y la participación de los asistentes demostraron no solo un gran nivel de interés en el tema de los PIDs y la ciencia abierta en América Latina, sino que también generaron nuevas ideas y oportunidades para futuras colaboraciones y acciones de seguimiento.</p>
<p>Desde DataCite, ORCID y ROR expresamos nuestro agradecimiento por la positiva y efusiva respuesta a este evento y las discusiones satélite que tuvieron lugar en los días anteriores y posteriores, todo lo cual habla del papel de liderazgo que está jugando América Latina en el desarrollo de infraestructuras, políticas y prácticas para promover la colaboración en torno al conocimiento abierto, redistribuir las redes globales de investigación y hacer posible que todos los países tengan un mayor acceso a la ciencia y la tecnología. ¡Fue increíble conocer a la comunidad latinoamericana en persona y esperamos seguir trabajando juntos para construir una infraestructura de investigación abierta y sólida en la región!</p>
<figure class="blog-figure"><img src="/img/blog/pidslatam/pidslatam-group-photo.jpg"
         alt="Participantes (¡Felices!) del evento PIDs y ciencia abierta en América Latina."/><figcaption>
            <p>Participantes (¡Felices!) del evento PIDs y ciencia abierta en América Latina.</p>
        </figcaption>
</figure>

]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Persistent identifiers are playing a key role in driving more robust research infrastructure and open science initiatives across Latin America.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Case Study: ROR in Scholastica Products</title><id>https://doi.org/10.71938/q4ps-gb98</id><link rel="alternate" href="https://ror.org/blog/2023-03-30-scholastica-case-study/"/><published>2023-03-30T00:00:00+00:00</published><updated>2025-11-07T11:35:16-05:00</updated><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><category term="Case Studies"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p><a href="https://scholasticahq.com/">Scholastica</a>, a journal publishing technology solutions provider, has incorporated ROR IDs into its Peer Review System, Production Service, and Open Access Publishing Platform. In this interview with Cory Schires, Scholastica co-founder and CTO, we discuss the two phases of ROR implementation, the pros and cons of displaying ROR IDs, what makes sense about ROR&rsquo;s level of granularity, and the importance of reliable APIs.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>
<h2 id="key-quotations">Key quotations 
  <a href="#key-quotations" aria-label="Key quotations">
<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 640 512" class="ha ha-link" width="16" height="16" alt="Key quotations"><path d="M579.8 267.7c56.5-56.5 56.5-148 0-204.5c-50-50-128.8-56.5-186.3-15.4l-1.6 1.1c-14.4 10.3-17.7 30.3-7.4 44.6s30.3 17.7 44.6 7.4l1.6-1.1c32.1-22.9 76-19.3 103.8 8.6c31.5 31.5 31.5 82.5 0 114L422.3 334.8c-31.5 31.5-82.5 31.5-114 0c-27.9-27.9-31.5-71.8-8.6-103.8l1.1-1.6c10.3-14.4 6.9-34.4-7.4-44.6s-34.4-6.9-44.6 7.4l-1.1 1.6C206.5 251.2 213 330 263 380c56.5 56.5 148 56.5 204.5 0L579.8 267.7zM60.2 244.3c-56.5 56.5-56.5 148 0 204.5c50 50 128.8 56.5 186.3 15.4l1.6-1.1c14.4-10.3 17.7-30.3 7.4-44.6s-30.3-17.7-44.6-7.4l-1.6 1.1c-32.1 22.9-76 19.3-103.8-8.6C74 372 74 321 105.5 289.5L217.7 177.2c31.5-31.5 82.5-31.5 114 0c27.9 27.9 31.5 71.8 8.6 103.9l-1.1 1.6c-10.3 14.4-6.9 34.4 7.4 44.6s34.4 6.9 44.6-7.4l1.1-1.6C433.5 260.8 427 182 377 132c-56.5-56.5-148-56.5-204.5 0L60.2 244.3z"/></svg>
  </a>
</h2>
<p>&ldquo;I would say the benefit to the users is getting better metadata. It&rsquo;s an obvious answer, but users don&rsquo;t always know about metadata. They also might not think of metadata as a feature, but we do. At Scholastica, we care about taking steps to enrich metadata – like adding ROR IDs, for example, on behalf of our customers, so they don&rsquo;t <em>have</em> to worry about the technical aspects of metadata collection or creation and can instead focus on maximizing the discovery benefits.&rdquo;</p>
<p>&ldquo;Before ROR, I was aware of some of the commercial competitors, but they were prohibitively expensive for many smaller journals and more difficult to integrate. We work with bigger journals and presses, but we also work with many small journals, and the price was a concern for our customers. When ROR came out as something that was free and community-supported, that was super important.&rdquo;</p>
<p>&ldquo;I would say one thing I like about how y&rsquo;all have set up ROR is your approach to parent-child or nested institutions. Obviously, you can go super deep on that and get extremely detailed. And I think y&rsquo;all have helpfully resisted the urge to go deeper than makes sense, in my opinion. I mean, people don&rsquo;t always realize that: it&rsquo;s kind of counterintuitive that there can be an inverse relationship between the richness of a dataset and its utility. It can get so granular, and that&rsquo;s cool, but then it&rsquo;s really hard to make any kind of inference from the data.&rdquo;</p>
<p>&ndash; Cory Schires, Co-founder and Chief Technology Officer, Scholastica</p>
</span></div>
	</div>
</div>


<h3 id="hahahugoshortcode-s1-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Thanks for joining us! Please tell us your name, title, and organization.</p>


<h3 id="hahahugoshortcode-s2-hbhb-cory-schires"><figure class="round-figure"><img src="/img/blog/scholastica/cschires.jpg"
         alt="Avatar of Cory Schires"/>
</figure>
 Cory Schires 
</h3>
<p>My name is Cory Schires. I am a co-founder and CTO at <a href="https://scholasticahq.com">Scholastica</a>.</p>


<h3 id="hahahugoshortcode-s3-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Tell us about Scholastica.</p>


<h3 id="hahahugoshortcode-s4-hbhb-cory-schires"><figure class="round-figure"><img src="/img/blog/scholastica/cschires.jpg"
         alt="Avatar of Cory Schires"/>
</figure>
 Cory Schires 
</h3>
<p>Scholastica <a href="https://scholasticahq.com/about/">was founded in 2012</a> by myself, Brian Cody, and Robert Walsh. We met at the University of Chicago. To this day, we all share a deep affection for academia. Scholastica now has more products, and we&rsquo;ve expanded over the last ten years, but our core mission remains the same. We want to empower academic organizations of any size to publish top quality journals, more effectively, efficiently, and affordably. We&rsquo;re also huge proponents of open access. Currently, all the journals published on Scholastica are open access.</p>


<h3 id="hahahugoshortcode-s5-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>That&rsquo;s amazing. When and where did you personally first hear about ROR?</p>


<h3 id="hahahugoshortcode-s6-hbhb-cory-schires"><figure class="round-figure"><img src="/img/blog/scholastica/cschires.jpg"
         alt="Avatar of Cory Schires"/>
</figure>
 Cory Schires 
</h3>
<p>I don&rsquo;t remember exactly. It was probably in the context of <a href="https://www.Crossref.org/community/ror/">Crossref</a>. Actually, I also often find out about industry developments on <a href="https://blog.scholasticahq.com/">Scholastica&rsquo;s blog</a>, run by our marketing director Danielle Padula. It&rsquo;s a good resource. Like others in the industry, I&rsquo;m busy trying to keep track of many different things, so I use our blog to stay abreast of new developments.</p>


<h3 id="hahahugoshortcode-s7-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>I was about to say! Danielle is so impressive. She actually just published <a href="https://blog.scholasticahq.com/post/ror-use-cases-for-publishers-interview/">an interview with me</a>, so we have dueling interviews. She&rsquo;s very organized and on top of things. I&rsquo;m jealous.</p>


<h3 id="hahahugoshortcode-s8-hbhb-cory-schires"><figure class="round-figure"><img src="/img/blog/scholastica/cschires.jpg"
         alt="Avatar of Cory Schires"/>
</figure>
 Cory Schires 
</h3>
<p>So am I.</p>


<h3 id="hahahugoshortcode-s9-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Who were the primary advocates of implementing ROR at your organization?</p>


<h3 id="hahahugoshortcode-s10-hbhb-cory-schires"><figure class="round-figure"><img src="/img/blog/scholastica/cschires.jpg"
         alt="Avatar of Cory Schires"/>
</figure>
 Cory Schires 
</h3>
<p>I would say it was the sales team and Brian Cody, our CEO. The sales team is obviously always talking to a lot of prospective customers and sometimes to current customers, and people are always asking for things. We&rsquo;re very dutiful about keeping notes on all of that and tracking what people are asking for. So while we heard it from the sales team, they&rsquo;re ultimately relaying customer requests. If enough people ask for something, we&rsquo;ll probably build it.</p>


<h3 id="hahahugoshortcode-s11-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>You&rsquo;re obviously very good about <a href="https://blog.scholasticahq.com/categories/new-features/">announcing new features in your products</a>. We&rsquo;ve seen you do that for ROR, and for other features as well. I&rsquo;ve been reading around in your announcements, and they are frequent and detailed and impressive.</p>


<h3 id="hahahugoshortcode-s12-hbhb-cory-schires"><figure class="round-figure"><img src="/img/blog/scholastica/cschires.jpg"
         alt="Avatar of Cory Schires"/>
</figure>
 Cory Schires 
</h3>
<p>Thanks. Again, credit to Danielle for marketing. At Scholastica, we all advocate for shipping features that are important to industry partners and improving our existing integrations to better support the flow of research. We like to stay connected with the broader publishing and research community not only to keep raising awareness of Scholastica, but also to support important initiatives like ROR, so it&rsquo;s a win-win.</p>


<h3 id="hahahugoshortcode-s13-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>That&rsquo;s great to hear. Even being fairly new to ROR, having been here a little over nine months, I could already tell coming in that awareness of ROR is not much of a problem, certainly not in the United States and Europe and Australia. We&rsquo;re also doing a lot of outreach to Asia and South America, to the extent that we can, and that&rsquo;s been terrific. But it has seemed to me that there is a lot of buzz about ROR. And I am glad to hear that it&rsquo;s something you feel you can brag about.</p>


<h3 id="hahahugoshortcode-s14-hbhb-cory-schires"><figure class="round-figure"><img src="/img/blog/scholastica/cschires.jpg"
         alt="Avatar of Cory Schires"/>
</figure>
 Cory Schires 
</h3>
<p>It solves a problem. And with all PIDs, the more people use them, the better they are.</p>


<h3 id="hahahugoshortcode-s15-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>That is so, so right. And while it&rsquo;s always hard to say someone is absolutely the first, as far as we can tell, Scholastica is one of the first major service providers to do such a thorough integration of ROR, where it&rsquo;s in more than one of your products. I&rsquo;ve talked to a number of people at other companies that are thinking about it or working on it, and there are some other partial implementations or ones that are for specific customers. But anyway, congrats.</p>


<h3 id="hahahugoshortcode-s16-hbhb-cory-schires"><figure class="round-figure"><img src="/img/blog/scholastica/cschires.jpg"
         alt="Avatar of Cory Schires"/>
</figure>
 Cory Schires 
</h3>
<p>That&rsquo;s good to hear, though I am a little surprised. It was on our internal roadmap for, I don&rsquo;t know, at least six months. It felt like we were saying &ldquo;We&rsquo;re going to get to this&rdquo; for quite awhile.</p>


<h3 id="hahahugoshortcode-s17-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>The funny thing is that six months sounds fast to me! My certification has expired, but at one point I was a certified Agile product owner, and having worked on several software projects, especially in university libraries, I know that it&rsquo;s very easy for things to sit on roadmaps for months and months and months and months. So, good for you! And now can you describe in general how you&rsquo;ve implemented ROR in your systems?</p>


<h3 id="hahahugoshortcode-s18-hbhb-cory-schires"><figure class="round-figure"><img src="/img/blog/scholastica/cschires.jpg"
         alt="Avatar of Cory Schires"/>
</figure>
 Cory Schires 
</h3>
<p>Sure. It wasn&rsquo;t too difficult for us. One thing you already touched on is that we have separate products for <a href="https://scholasticahq.com/features/">peer review</a>, <a href="https://scholasticahq.com/typesetting/">production</a>, and <a href="https://scholasticahq.com/publishing-features/">publishing</a>. That means we had to put ROR support in at least two places: our manuscript submission form and the form that is used to create an article. So that was a consideration from the start. That actually kind of divided the project into two phases. First, you&rsquo;ve got to collect the data; you&rsquo;ve got to update your web forms, right? That&rsquo;s the collecting part. Second, there&rsquo;s using it &ndash; distributing it. Now that you have this data, you need to make sure that it&rsquo;s getting into your <a href="https://www.crossref.org/documentation/schema-library/markup-guide-metadata-segments/affiliations/">Crossref metadata</a>, that it&rsquo;s getting into your <a href="https://ror.readme.io/docs/jats">JATS XML</a>, that it&rsquo;s getting into <a href="https://www.portico.org/">Portico</a> and so forth. But anyway, you&rsquo;ve got to make sure that anywhere you&rsquo;re sending article data, ROR is going along for the ride.</p>
<p>And actually we did it in the reverse order. We did the second part first, which was to set up all our downstream integrations and different exports to understand and expect ROR identifiers, and we shipped that all the way to production. It wasn&rsquo;t doing anything, but it was just there waiting to be useful. And then we did the first part, which was adding it to the various web forms so that we were actually collecting it. That allowed us to release it in two separate phases. You mentioned Agile project planning: we&rsquo;re always looking for ways to break up an otherwise big project into a couple releases, if possible. So that&rsquo;s the approach we took. That can be a good tip for anyone who&rsquo;s similarly trying to keep this kind of project bite-sized.</p>


<h3 id="hahahugoshortcode-s19-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yes, that&rsquo;s true: there&rsquo;s the collecting of the ROR ID as one phase and the sending of the ROR ID as another phase.  Actually, I know of several integrations that have done one or the other, not both. I know of people who are adding the ROR manually to their JATS XML, so they have people who as far as I know just copy and paste it into a template and send that to Crossref &ndash; smaller organizations. I also know of some systems that collect the ROR for internal use within systems and then don&rsquo;t send it in DOIs to Crossref. The way you&rsquo;ve done it is the ideal way.</p>
<p>Tell me, in your manuscript submission and peer review system, are you collecting ROR IDs for all authors or just for the corresponding author?</p>


<h3 id="hahahugoshortcode-s20-hbhb-cory-schires"><figure class="round-figure"><img src="/img/blog/scholastica/cschires.jpg"
         alt="Avatar of Cory Schires"/>
</figure>
 Cory Schires 
</h3>
<p>The corresponding author fills out the form on behalf of all the co-authors, and when they&rsquo;re filling that out, the fields for each author are the same &ndash; or mostly the same. There&rsquo;s a little bit of difference for the corresponding author, but they&rsquo;re mostly the same. And the ROR integration part is the same. So yes, we collect it for all authors.</p>
<figure><img src="/img/blog/scholastica/ror-integration-manuscript-submission-form.png"
         alt="Scholastica Peer Review manuscript submission form"/>
</figure>



<h3 id="hahahugoshortcode-s22-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>And how do you handle when authors have multiple affiliations?</p>


<h3 id="hahahugoshortcode-s23-hbhb-cory-schires"><figure class="round-figure"><img src="/img/blog/scholastica/cschires.jpg"
         alt="Avatar of Cory Schires"/>
</figure>
 Cory Schires 
</h3>
<p>Even apart from ROR you need your submission form to understand that authors may have zero or more affiliations. If your form is assuming each author will have one affiliation, that&rsquo;s not terrible, but it doesn&rsquo;t mirror what we know to be real in the world. So you really need that as a starting point. And I don&rsquo;t think it&rsquo;s particularly novel, but there&rsquo;s a standard &ldquo;Add another&rdquo; link you can click. You&rsquo;ll see that in different online tools. The approach we took feels fairly conventional in a good way, I think: you don&rsquo;t need to get too clever with something like that. We accept the institution name, and that bit is the ROR <code>name</code>, and that field is what we use to find and associate the ROR ID. And then we also have the department field, which doesn&rsquo;t really have anything to do with ROR, but that gives you a place to put the other information that you might want to put.</p>


<h3 id="hahahugoshortcode-s24-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>That&rsquo;s exactly the way we <a href="https://ror.readme.io/docs/forms">recommend to do it</a>, so that&rsquo;s great. How do you handle it when an author is affiliated with an organization that is not in ROR?</p>


<h3 id="hahahugoshortcode-s25-hbhb-cory-schires"><figure class="round-figure"><img src="/img/blog/scholastica/cschires.jpg"
         alt="Avatar of Cory Schires"/>
</figure>
 Cory Schires 
</h3>
<p>Going back to what I said before, you need to get those two fields. We call it &ldquo;Institution&rdquo; and &ldquo;Department.&rdquo; As people are typing into that &ldquo;Institution&rdquo; name field, we have a user interface element that people call a &ldquo;typeahead.&rdquo; I guess you know what that is.</p>


<h3 id="hahahugoshortcode-s26-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>I had actually never seen the term &ldquo;typeahead&rdquo; before I started at ROR, but that is what we call it in all of our <a href="https://ror.readme.io/docs/forms">documentation</a> and our <a href="https://ror-community.github.io/ror-typeahead-demos/">demo</a> and our <a href="https://github.com/ror-community/ror-typeahead-demos">demo code</a>. I thought of it as a &ldquo;suggest box&rdquo; or something.</p>


<h3 id="hahahugoshortcode-s27-hbhb-cory-schires"><figure class="round-figure"><img src="/img/blog/scholastica/cschires.jpg"
         alt="Avatar of Cory Schires"/>
</figure>
 Cory Schires 
</h3>
<p>It&rsquo;s an autosuggest sort of thing, you know, like when you&rsquo;re doing a search, it suggests results. Anyway, so as you&rsquo;re typing in the name field, we&rsquo;re suggesting results in that typeahead. And if they click on one of those results, great: it will fill up the field, and we&rsquo;ll have made the association under the hood. In the interface, you could choose to expose the ROR ID or put a green check or something, but we didn&rsquo;t do that. We are just collecting it unbeknownst to the users. And to answer your question about organizations that are not in ROR, because it&rsquo;s a typeahead field, they can also just type whatever they want. They don&rsquo;t have to pick something from that list. We did think briefly about requiring ROR IDs, but we decided against that.</p>
<figure><img src="/img/blog/scholastica/ror-integration-article-form.gif"
         alt="ROR typeahead in Scholastica OA Publishing Platform form"/>
</figure>



<h3 id="hahahugoshortcode-s29-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>We recommend against that, because while we do have a process for people to request a new ROR ID for an organization, right now it&rsquo;s just a <a href="https://curation.ror.org">Google form to request a single new organization</a>. And it does take us roughly four to six weeks to get those new organizations into ROR, because we <a href="https://ror.org/registry/#curation-model">curate those requests</a>. We have to check and see, &ldquo;Is this a real organization? Is it in scope?&rdquo; So we <a href="https://ror.readme.io/docs/forms">ask people not to require a ROR ID in forms</a>, because otherwise we&rsquo;ll get requests from people saying, you know, &ldquo;I can&rsquo;t submit my manuscript, and I have to do it by tomorrow.&rdquo; So kudos to you: exactly the right way to do it. And we&rsquo;d love to <a href="https://github.com/ror-community/ror-roadmap/issues/7">build a process for collecting some of this data from ROR users</a>, getting information on what these organizations are that people aren&rsquo;t finding in ROR, to see if we can add them. We&rsquo;d like to do that someday, but it&rsquo;ll take a while to figure out.</p>


<h3 id="hahahugoshortcode-s30-hbhb-cory-schires"><figure class="round-figure"><img src="/img/blog/scholastica/cschires.jpg"
         alt="Avatar of Cory Schires"/>
</figure>
 Cory Schires 
</h3>
<p>We&rsquo;d be happy to send you a CSV of organizations that people have entered that aren&rsquo;t in ROR.</p>


<h3 id="hahahugoshortcode-s31-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Feel free to do that &ndash; we&rsquo;d love to look at it. You can email it to me. That&rsquo;s where we are right now: we&rsquo;re on the &ldquo;email us a spreadsheet&rdquo; end of the spectrum of data collection methods, which is fairly primitive, but we&rsquo;d definitely be happy for you to do that. So, just a couple more questions. What benefits to you and your users is this ROR implementation providing?</p>


<h3 id="hahahugoshortcode-s32-hbhb-cory-schires"><figure class="round-figure"><img src="/img/blog/scholastica/cschires.jpg"
         alt="Avatar of Cory Schires"/>
</figure>
 Cory Schires 
</h3>
<p><strong>I would say the benefit to the users is getting better metadata. It&rsquo;s an obvious answer, but users don&rsquo;t always know about metadata. They also might not think of metadata as a feature, but we do. At Scholastica, we care about taking steps to enrich metadata – like adding ROR IDs, for example, on behalf of our customers, so they don&rsquo;t <em>have</em> to worry about the technical aspects of metadata collection or creation and can instead focus on maximizing the discovery benefits.</strong></p>


<h3 id="hahahugoshortcode-s33-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Right.</p>


<h3 id="hahahugoshortcode-s34-hbhb-cory-schires"><figure class="round-figure"><img src="/img/blog/scholastica/cschires.jpg"
         alt="Avatar of Cory Schires"/>
</figure>
 Cory Schires 
</h3>
<p>We really tried to integrate ROR into every possible way that we send data, to Portico or whatever the case may be; we really tried to add it everywhere. And if we missed a place, someone will tell us eventually, and we&rsquo;ll add it there as well. But that&rsquo;s the main thing, is just the richer metadata.</p>
<p>Also, and there aren&rsquo;t any short-term plans to do this, but this does give us the ability to do things that people have asked for occasionally. For example, &ldquo;Oh, I&rsquo;d like to see a list of all my past authors, and I would like to be able to filter that by institution.&rdquo; We couldn&rsquo;t do that well without a stable identifier to know which institution is which, so I think it opens the door for some possible new features for us. It also might be interesting at some point to query the data to produce something like a pie chart showing the breakdown of the institutions authors are coming from by <a href="https://carnegieclassifications.acenet.edu/classification_descriptions/basic.php">Carnegie classification</a>, for example. I don&rsquo;t know that that would be anything other than merely interesting.</p>


<h3 id="hahahugoshortcode-s35-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Sure, yeah. It&rsquo;s interesting that you&rsquo;re not showing ROR IDs to the users, and that you&rsquo;ve said that whether users care about it or not, the ROR ID and that enriched metadata is a service to them, and a feature, and it might enable other features. I think ORCID originally thought that they would be a behind-the-scenes piece of technical infrastructure, and I think we thought the same at ROR, that it would be mostly implemented under the hood in the way you&rsquo;ve done it.</p>
<p>But there are systems that do <a href="https://ror.readme.io/docs/display">display ROR IDs</a> to their users, and it is kind of nice, because you can click on the link and go see some concise information about that organization. ROR is a bit different from ORCID in that ORCID records have to be maintained by individual researchers, so they do need to see it, whereas <a href="https://ror.org/about/faqs/#who-is-the-contact-person-at-my-organization-responsible-for-our-ror-id">ROR IDs are not maintained by individual institutions</a>. So I think you&rsquo;re exactly right that users don&rsquo;t really need to see the ROR ID, but they will benefit from that behind-the-scenes tidiness and enrichment of the metadata.</p>


<h3 id="hahahugoshortcode-s36-hbhb-cory-schires"><figure class="round-figure"><img src="/img/blog/scholastica/cschires.jpg"
         alt="Avatar of Cory Schires"/>
</figure>
 Cory Schires 
</h3>
<p>As I recall, we did talk about that: &ldquo;Should we show it?&rdquo; We also talked about that case where the user types in an organization that is not in ROR; we thought about putting something like a yellow warning saying, &ldquo;This isn&rsquo;t in ROR.&rdquo; We even actually started to code that, but then we pulled it out of the plan, because as we were thinking more about it, it was clear that people were going to see it and they would think something was wrong.</p>


<h3 id="hahahugoshortcode-s37-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>I have seen systems where they display a green checkmark or something similar when the organization is in ROR, and when they do that they add explanatory text and sometimes a link to the ROR website, so it can work, but we&rsquo;ve had some misunderstandings from that kind of implementation.</p>


<h3 id="hahahugoshortcode-s38-hbhb-cory-schires"><figure class="round-figure"><img src="/img/blog/scholastica/cschires.jpg"
         alt="Avatar of Cory Schires"/>
</figure>
 Cory Schires 
</h3>
<p>&ldquo;These three authors have green checkmarks and this one doesn&rsquo;t: what am I doing wrong?&rdquo;</p>


<h3 id="hahahugoshortcode-s39-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yeah, it&rsquo;s like, &ldquo;Oh, no, nothing is wrong.&rdquo; It&rsquo;s a perfectly legitimate organization, or the author is an independent researcher, or, you know, there are all kinds of legitimate reasons for there not to be a ROR ID for an organization.</p>


<h3 id="hahahugoshortcode-s40-hbhb-cory-schires"><figure class="round-figure"><img src="/img/blog/scholastica/cschires.jpg"
         alt="Avatar of Cory Schires"/>
</figure>
 Cory Schires 
</h3>
<p>And we can layer that in later if there&rsquo;s some reason to, but this is where we&rsquo;ve started.</p>


<h3 id="hahahugoshortcode-s41-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>What do you hope ROR does in the future? What opportunities should we pursue, or what would make it easier for you and other service providers to adopt ROR? What feature suggestions or bug fixes do you have for us?</p>


<h3 id="hahahugoshortcode-s42-hbhb-cory-schires"><figure class="round-figure"><img src="/img/blog/scholastica/cschires.jpg"
         alt="Avatar of Cory Schires"/>
</figure>
 Cory Schires 
</h3>
<p>The main thing is that I&rsquo;m just glad ROR exists. There are all kinds of reasons why you would want to know what institution an author is coming from. <strong>Before ROR, I was aware of some of the commercial competitors, but they were prohibitively expensive for many smaller journals and more difficult to integrate. We work with bigger journals and presses, but we also work with many small journals, and the price was a concern for our customers. When ROR came out as something that was free and community-supported, that was super important.</strong></p>
<p>But that doesn&rsquo;t really answer your question. <strong>I would say one thing I like about how y&rsquo;all have set up ROR is your <a href="https://ror.readme.io/docs/relationships">approach to parent-child or nested institutions</a>. Obviously, you can go super deep on that and get extremely detailed. And I think y&rsquo;all have helpfully resisted the urge to go deeper than makes sense, in my opinion. I mean, people don&rsquo;t always realize that: it&rsquo;s kind of counterintuitive that there can be an inverse relationship between the richness of a dataset and its utility. It can get so granular, and that&rsquo;s cool, but then it&rsquo;s really hard to make any kind of inference from the data</strong>. It becomes like <a href="https://en.wikipedia.org/wiki/On_Exactitude_in_Science">Borges&rsquo;s map</a>. It&rsquo;s really exact, but it&rsquo;s useless.</p>
<figure class="blog-figure"><img src="/img/blog/scholastica/map-territory.png"
         alt="What do you consider the largest map that would be really useful? - About six inches to the mile. - Only six inches! exclaimed Mein Herr. We very soon got to six yards to the mile. Then we tried a hundred yards to the mile. And then came the grandest idea of all ! We actually made a map of the country, on the scale of a mile to the mile! - Have you used it much? I enquired. - It has never been spread out, yet, said Mein Herr: the farmers objected: they said it would cover the whole country, and shut out the sunlight! So we now use the country itself, as its own map, and I assure you it does nearly as well."/><figcaption>
            <p>Carroll, Lewis, and Harry Furniss. Sylvie and Bruno Concluded. London and New York: Macmillan and Co., 1893. <a href="https://hdl.handle.net/2027/uc2.ark:/13960/t6ww7b09m?urlappend=%3Bseq=208">HathiTrust</a>. Accessed 22 Mar. 2023. Influence on the one-paragraph Jorge Luis Borges short story &lsquo;Del rigor en la ciencia&rsquo; <a href="https://en.wikipedia.org/wiki/On_Exactitude_in_Science">(On Exactitude in Science)</a>, first published 1946.</p>
        </figcaption>
</figure>

<p>When I was looking into that and thinking about that problem, I thought, &ldquo;Oh, I like the approach they&rsquo;ve taken here. I agree with this.&rdquo; It made me feel like y&rsquo;all are &hellip; well, the opinion aligns with mine, which is comforting to me.</p>


<h3 id="hahahugoshortcode-s44-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Wonderful. I don&rsquo;t know if you saw our recent <a href="/blog/2023-02-27-parents-children-and-other-relationships-in-ror/">blog post about hierarchies and relationships</a> &ndash; we say some of those exact things. We found out that somehow people were under the impression that ROR didn&rsquo;t do hierarchy at all.</p>


<h3 id="hahahugoshortcode-s45-hbhb-cory-schires"><figure class="round-figure"><img src="/img/blog/scholastica/cschires.jpg"
         alt="Avatar of Cory Schires"/>
</figure>
 Cory Schires 
</h3>
<p>Yeah, that is what I thought initially. But I remember looking at the API and seeing, &ldquo;Okay, yes, there are some relationships.&rdquo;</p>


<h3 id="hahahugoshortcode-s46-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>You&rsquo;ve been so kind about ROR when I asked you to give us constructive feedback! And hopefully, we&rsquo;ll get to that. But honestly, I do think that is one piece of constructive feedback that we needed to hear, that we were mis-communicating about what ROR can do in terms of parent-child institutional relationships. People often ask us about university departments in particular, and we <a href="/about/faqs/#why-doesnt-ror-include-university-departments">don&rsquo;t do university departments</a>, but that doesn&rsquo;t mean we don&rsquo;t do organizational hierarchy at all. We absolutely do that: for both parent-child relationships and related organizations.</p>
<p>That&rsquo;s why we&rsquo;ve just put out that <a href="/blog/2023-02-27-parents-children-and-other-relationships-in-ror/">explainer blog post</a>, and we&rsquo;ve also beefed up our documentation to say, &ldquo;Yes, we do hierarchy and relationships,&rdquo; so that you don&rsquo;t have to look at the <a href="https://api.ror.org/organizations">JSON in the API</a> in order to figure it out. And we didn&rsquo;t have the relationships in <a href="https://ror.org/search">our search interface</a> for awhile: we just <a href="https://github.com/ror-community/ror-roadmap/issues/54">added that a few months ago</a>. Since a lot of people look at ROR just in that browser search interface, they didn&rsquo;t realize the relationships were there. And we really do think that the degree of granularity we have, which is not as deep as some other organizational identifiers, is a feature, not a bug.</p>


<h3 id="hahahugoshortcode-s47-hbhb-cory-schires"><figure class="round-figure"><img src="/img/blog/scholastica/cschires.jpg"
         alt="Avatar of Cory Schires"/>
</figure>
 Cory Schires 
</h3>
<p>I agree with that. Keep it simple, or at least relatively.</p>


<h3 id="hahahugoshortcode-s48-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>&ldquo;Keep It Simple, Stupid,&rdquo; right? The <a href="https://en.wikipedia.org/wiki/KISS_principle">KISS acronym for that design principle</a>.</p>


<h3 id="hahahugoshortcode-s49-hbhb-cory-schires"><figure class="round-figure"><img src="/img/blog/scholastica/cschires.jpg"
         alt="Avatar of Cory Schires"/>
</figure>
 Cory Schires 
</h3>
<p>I might have more feedback on ROR as we start to use it more internally, but one thing I would say is that you should invest in the resiliency of the API and of the organization. This is a small example, and I love Crossref, don&rsquo;t get me wrong, but we had some tests in our test suite that hit the Crossref API, and we had to disable those by default because sometimes the API goes down, and then our test suite is failing and causes problems for us. The ROR API is increasingly useful, I think, and we service providers need it to work for integrations.</p>


<h3 id="hahahugoshortcode-s50-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>That is great advice. <a href="https://ror1.statuspage.io/uptime">Our API has had wonderful uptime so far</a>, and we have a little <a href="https://api.ror.org/heartbeat">API heartbeat</a>, so you can just send a little quick query to that to see if the ROR API is OK. But we definitely have been seeing increased use, and increased use, and increased use, which is great. We want that to happen, but it does mean that there are more costs. As you might know, we are <a href="https://ror.org/about/faqs/#who-runs-ror">supported by three organizations</a>, and <a href="https://datacite.org">DataCite</a> is the one that pays for our technical infrastructure.</p>
<p>And I think organizational resiliency is part of technical resiliency. Last fall, <a href="https://ror.org/blog/2022-10-10-strengthening-sustainability/">we announced that ROR is now part of the operating budgets of Crossref, DataCite, and the California Digital Library</a>. So we&rsquo;re not going to have a member model, and as long as people are still paying member fees to DataCite and Crossref and tuition to UC colleges, then ROR will be around.</p>


<h3 id="hahahugoshortcode-s51-hbhb-cory-schires"><figure class="round-figure"><img src="/img/blog/scholastica/cschires.jpg"
         alt="Avatar of Cory Schires"/>
</figure>
 Cory Schires 
</h3>
<p>One other thing about the API. As you&rsquo;re obviously aware, you can <a href="https://ror.readme.io/docs/data-dump">get the ROR data dump</a> and query that or you can hit the API, and we evaluated both approaches. Specifically, we were looking at, &ldquo;Is this API going to be fast enough and reliable enough?&rdquo; because we would prefer to hit the API than keep our own copy of the data, and in our testing, it was good. So that&rsquo;s great.</p>


<h3 id="hahahugoshortcode-s52-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>And now that we&rsquo;re updating the ROR registry about twice a month, if you&rsquo;re using the data dump, you&rsquo;ll have to make sure that you have the latest version of the data. You <em>can</em> get that programmatically <a href="https://ror.readme.io/docs/data-dump#download-ror-data-dumps-programmatically-with-the-zenodo-api">by querying Zenodo</a>. But if you&rsquo;re querying the API, you don&rsquo;t have to worry about that: you&rsquo;re just getting the updated data automatically.</p>


<h3 id="hahahugoshortcode-s53-hbhb-cory-schires"><figure class="round-figure"><img src="/img/blog/scholastica/cschires.jpg"
         alt="Avatar of Cory Schires"/>
</figure>
 Cory Schires 
</h3>
<p>Sure.</p>


<h3 id="hahahugoshortcode-s54-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Is there anything else you&rsquo;d like to say before we wrap up?</p>


<h3 id="hahahugoshortcode-s55-hbhb-cory-schires"><figure class="round-figure"><img src="/img/blog/scholastica/cschires.jpg"
         alt="Avatar of Cory Schires"/>
</figure>
 Cory Schires 
</h3>
<p>One thing I would say is that this was a team effort. I didn&rsquo;t work that much on the ROR project. Brian Cody, our CEO, did most of the project planning and actually most of the spec writing, which is not usually how he spends most of his time, but he wanted to help get this one done. And several different developers worked on it. Specifically, Tatum Szymczak really carried it across the finish line. A whole bunch of people worked on it, and that&rsquo;s important.</p>


<h3 id="hahahugoshortcode-s56-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yes, it is! Well, thanks for talking with me, Cory. It was nice to chat.</p>


<h3 id="hahahugoshortcode-s57-hbhb-cory-schires"><figure class="round-figure"><img src="/img/blog/scholastica/cschires.jpg"
         alt="Avatar of Cory Schires"/>
</figure>
 Cory Schires 
</h3>
<p>I appreciate your time. Thanks for involving me. Happy to be doing this for ROR.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>Questions? Want to be featured in a ROR case study? Contact <a href="mailto:support@ror.org">support@ror.org</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Scholastica has incorporated ROR IDs into its Peer Review System, Production Service, and Open Access Publishing Platform. In this interview with Cory Schires, Scholastica co-founder and CTO, we discuss the two phases of ROR implementation, the pros and cons of displaying ROR IDs, what makes sense about ROR's level of granularity, and the importance of reliable APIs.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Europe PMC Announces ROR Integration into Grant Finder</title><id>https://doi.org/10.71938/vwkf-7493</id><link rel="alternate" href="https://ror.org/blog/2023-03-28-europe-pmc-grant-finder/"/><published>2023-03-28T00:00:00+00:00</published><updated>2026-03-10T08:36:41-04:00</updated><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><category term="Adoption News"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p><a href="https://europepmc.org/">Europe PMC</a>, one of the largest providers of high-performance search tools for life sciences literature, has <a href="http://blog.europepmc.org/2023/03/improved-affiliation-search-for-grants.html">announced</a> that its Grant Finder now incorporates Research Organization Registry (ROR) IDs to help users find active and completed grants awarded by Europe PMC funders.</p>
<p>Europe PMC provides comprehensive access to life sciences literature from trusted sources. It&rsquo;s available to anyone, anywhere for free. Users of Europe PMC can search and read 42 million publications, preprints and other documents enriched with links to supporting data, reviews, protocols, and other relevant resources. Europe PMC partners with other organisations to build robust public tools to provide access to open content and data and to advance life sciences research.</p>
<p>Integrating ROR IDs into the Europe PMC Grant Finder means that organizational name variations are now aggregated under a single name, which allows people to search for different aliases and receive the same set of results. Europe PMC used the ROR API in conjunction with a manual screening step to match 82% of associated institutions in the Europe PMC grants database (GRIST) to over 2600 ROR IDs. ROR IDs are now returned as part of the GRIST API core response along with the institution name. The ROR ID is also used to auto-suggest a matching affiliation when a user searches for grants awarded to a particular institution by typing a name in the &ldquo;Affiliation&rdquo; field of the Grant Finder. This integration is now live at <a href="http://europepmc.org/grantfinder">http://europepmc.org/grantfinder</a>.</p>


<h3 id="hahahugoshortcode-s0-hbhb"><figure><img src="/img/blog/europepmc/europe-pmc-grant-finder-ror.png"
         alt="ROR-powered affiliation chooser in Europe PMC Grant Finder"/>
</figure>
 
</h3>
<p>Europe PMC has supported ROR IDs for author affiliations since 2019, and in 2021 ROR IDs were added to a subset of publications by EMBL-affiliated authors as part of <a href="https://www.project-freya.eu">Project FREYA</a>. Additional work for Project FREYA included the development of the <a href="https://gitlab.ebi.ac.uk/literature-services/public-projects/ROR-proto-EMBL">ROR Predictor prototype</a>, which enables matching author affiliations in research outputs with PMIDs to ROR IDs. Europe PMC also uses ROR IDs to enrich the DOI metadata of the <a href="https://www.crossref.org/community/grants/">research grants it registers with Crossref</a> on behalf of the Wellcome Trust.</p>
<p>“Linking research outputs via persistent identifiers makes related information more findable and helps track provenance. Incorporating ROR IDs for grant awards supports DORA-compatible research reporting for funders and institutions who need to understand the impact of research,” said Melissa Harrison, Team Leader of Literature Services at EMBL-EBI.</p>
<p>Said ROR Project Lead Maria Gould of the California Digital Library, &ldquo;Europe PMC has been remarkably forward-looking in its early use of ROR, and we are excited to see other services follow their example.&rdquo;</p>
<p>See also the Crossref news release, &ldquo;<a href="https://www.eurekalert.org/news-releases/984137">Europe PMC integrates ROR into its Grant Finder</a>.&rdquo;</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span><ul>
<li><a href="mailto:helpdesk@europepmc.org">Contact helpdesk@europepmc.org</a> for further information about Europe PMC&rsquo;s use of ROR.</li>
<li><a href="mailto:support@ror.org">Contact Amanda French</a>, ROR Technical Community Manager, for more information about ROR.</li>
<li>Register for an upcoming <a href="/events">ROR Community Call or other event</a> to hear about integrations like these and more.</li>
</ul>
</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Europe PMC, one of the largest providers of high-performance search tools for life sciences literature, has announced that its Grant Finder now incorporates Research Organization Registry (ROR) IDs to help users find active and completed grants awarded by Europe PMC funders.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Case Study: ROR at Rockefeller University Press and Silverchair</title><id>https://doi.org/10.71938/t63t-g186</id><link rel="alternate" href="https://ror.org/blog/2023-03-20-rup-silverchair-case-study/"/><published>2023-03-20T00:00:00+00:00</published><updated>2025-11-07T11:35:16-05:00</updated><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><category term="Case Studies"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>Rockefeller University Press has incorporated ROR into many aspects of their workflow, and their platform provider Silverchair was there to help. This case study is based on the presentation about this integration given at the December 2022 ROR Community Call by Rob O&rsquo;Donnell, Senior Director of Publishing at RUP, and Emily Hazzard, Product Operations Analyst at Silverchair.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>
<h2 id="key-quotations">Key quotations 
  <a href="#key-quotations" aria-label="Key quotations">
<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 640 512" class="ha ha-link" width="16" height="16" alt="Key quotations"><path d="M579.8 267.7c56.5-56.5 56.5-148 0-204.5c-50-50-128.8-56.5-186.3-15.4l-1.6 1.1c-14.4 10.3-17.7 30.3-7.4 44.6s30.3 17.7 44.6 7.4l1.6-1.1c32.1-22.9 76-19.3 103.8 8.6c31.5 31.5 31.5 82.5 0 114L422.3 334.8c-31.5 31.5-82.5 31.5-114 0c-27.9-27.9-31.5-71.8-8.6-103.8l1.1-1.6c10.3-14.4 6.9-34.4-7.4-44.6s-34.4-6.9-44.6 7.4l-1.1 1.6C206.5 251.2 213 330 263 380c56.5 56.5 148 56.5 204.5 0L579.8 267.7zM60.2 244.3c-56.5 56.5-56.5 148 0 204.5c50 50 128.8 56.5 186.3 15.4l1.6-1.1c14.4-10.3 17.7-30.3 7.4-44.6s-30.3-17.7-44.6-7.4l-1.6 1.1c-32.1 22.9-76 19.3-103.8-8.6C74 372 74 321 105.5 289.5L217.7 177.2c31.5-31.5 82.5-31.5 114 0c27.9 27.9 31.5 71.8 8.6 103.9l-1.1 1.6c-10.3 14.4-6.9 34.4 7.4 44.6s34.4 6.9 44.6-7.4l1.1-1.6C433.5 260.8 427 182 377 132c-56.5-56.5-148-56.5-204.5 0L60.2 244.3z"/></svg>
  </a>
</h2>
<p>&ldquo;When we get new deals, we set them up in eJournal Press systems. They&rsquo;re configured using ROR IDs, and the process is then completely driven by ROR. The system looks for a match between affiliation ROR IDs and deal ROR IDs. And once there&rsquo;s a match, we know the article is eligible for that deal.&rdquo;</p>
<p>&ldquo;I&rsquo;m excited that we&rsquo;re now live with our Open Access Switchboard eligibility workflow, which is known as &ldquo;E1&rdquo; in Switchboard lingo. If a manuscript is not recognized as part of a deal using what was shown in the last step, we use the ROR IDs we&rsquo;ve gathered to sequentially query each of those organizations to ask if they&rsquo;ll pay the gold OA fee for the article.&rdquo;</p>
<p>&ldquo;We include ROR IDs in our JATS files, and after an article is published, Silverchair includes those IDs in Crossref deposits.&rdquo;</p>
<p>&ndash; Rob O&rsquo;Donnell, Senior Director of Publishing, Rockefeller University Press</p>
<p>&ldquo;In the institutional management service in SiteManager, we&rsquo;re going to be introducing a way to search institutions based on the ROR ID because it&rsquo;s incredibly convenient.&rdquo;</p>
<p>&ldquo;Something else that we&rsquo;re planning on doing is introducing ROR-based reporting to support those Read and Publish agreements so that we can expose the relationships between those authors and institutions, and really see what&rsquo;s coming out of these institutions. Where is it published? Where does it live? And how do we serve those authors better?&rdquo;</p>
<p>&ndash; Emily Hazzard, Product Operations Analyst, Silverchair</p>
</span></div>
	</div>
</div>


<h3 id="hahahugoshortcode-s1-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>We&rsquo;ve asked Rob O&rsquo;Donnell, Senior Director of Publishing at Rockefeller University Press, to tell us about some of the new things they&rsquo;re doing with ROR.</p>


<h3 id="hahahugoshortcode-s2-hbhb-rob-odonnell"><figure class="round-figure"><img src="/img/blog/rup/rodonnell.jpg"
         alt="Avatar of Rob O&#39;Donnell"/>
</figure>
 Rob O&rsquo;Donnell 
</h3>
<p>Thanks for having me. Let me start by giving some pertinent information about <a href="https://rupress.org/">Rockefeller University Press (RUP)</a>. We&rsquo;re a small university press publishing 3 and 1/3 biomedical journals. The one-third is <a href="https://www.life-science-alliance.org/">Life Science Alliance</a>, which is a collaboration open access journal with <a href="https://www.cshlpress.com/">Cold Spring Harbor Laboratory Press</a> and <a href="https://www.embopress.org/">EMBO Press</a>. We&rsquo;re predominantly institutional subscription based, transformative, and we&rsquo;re using Read and Publish deals as a stepping stone on our way to becoming fully open access. We use <a href="https://www.ejournalpress.com/">eJournalPress (EJP)</a> submission, production, and billing systems. Our content is hosted mostly on <a href="https://www.silverchair.com/">Silverchair</a>; LSA&rsquo;s content is on <a href="https://www.highwirepress.com/">HighWire</a>. We&rsquo;re active with the <a href="https://oaswitchboard.org">OA Switchboard</a>, which is a metadata exchange hub for both reporting and eligibility querying.</p>
<figure><img src="/img/blog/rup/slide-01.png"
         alt="RUP Overview: three and one-third journals, institutional subscriptions, user of eJournalPress and Silverchair, active with OA Switchboard"/>
</figure>



<h3 id="hahahugoshortcode-s4-hbhb-rob-odonnell"><figure class="round-figure"><img src="/img/blog/rup/rodonnell.jpg"
         alt="Avatar of Rob O&#39;Donnell"/>
</figure>
 Rob O&rsquo;Donnell 
</h3>
<p>This slide is basically an outline of what I&rsquo;m going to talk about, and I think it&rsquo;s a good visual that shows how much our workflows now revolve around ROR. We&rsquo;re using them really in every aspect of our workflow. I&rsquo;ll start at the top with how we use ROR IDs to model our Read and Publish deals and then work clockwise.</p>
<figure><img src="/img/blog/rup/slide-02.png"
         alt="ROR in RUP workflows: assess deal potential, configure Read and Publish deals, gather ROR IDs upon submission, perform OA Switchboard E1 check, do production check, send ROR IDs to Silverchair and Crossref"/>
</figure>



<h3 id="hahahugoshortcode-s6-hbhb-rob-odonnell"><figure class="round-figure"><img src="/img/blog/rup/rodonnell.jpg"
         alt="Avatar of Rob O&#39;Donnell"/>
</figure>
 Rob O&rsquo;Donnell 
</h3>
<p>We have Read and Publish deals with over 300 institutions across all our deals (totaling &gt;600 ROR IDs). Now that includes consortia, but that&rsquo;s the number of institutions we have so far. To get started, our Institutional Sales Manager, Miguel Peralta, mapped our subscriber IDs to ROR IDs. We already had Ringgold for most of our subscribers, so we started there and we <a href="https://ror.readme.io/docs/mapping#mapping-ringgold-ids-to-ror">went from Ringgold to ISNI to ROR</a>. We also added those ROR IDs to our subscriber records in <a href="https://www.silverchair.com/the-silverchair-platform/tools-technology/">Silverchair&rsquo;s SiteManager</a>, and Emily will talk more about that later. Then we mapped archival author affiliations to ROR; we went back to 2017. Most of that mapping was done via <a href="https://dimensions.ai">Dimensions</a>. We have a subscription to Dimensions, and Dimensions still includes GRID IDs in their results &ndash; not sure if they will update those to ROR IDs. They still use GRID internally, which made it easy to <a href="https://ror.readme.io/docs/mapping#map-a-list-of-ids-to-ror">go from GRID to ROR</a>. Once we had ROR IDs for both authors and subscribers, we were able to match our published output with subscribers in order to model and create our deal proposals. We also use the mapped ROR IDs to alert previous authors at deal institutions that they can now publish at no cost to them.</p>
<figure><img src="/img/blog/rup/slide-03.png"
         alt="Read and Publish Deal Assessment: Mapped subscribers to ROR via Ringgold to ISNI to ROR, ROR IDs housed in Silverchair’s SiteManager, mapped archival author affiliations to ROR, mapped GRID via Dimensions and institution name and email suffix, allowed us to see publishing output per subscribing institution to help model deals and market those deals to former authors, enabled better workflow in place for new content"/>
</figure>



<h3 id="hahahugoshortcode-s8-hbhb-rob-odonnell"><figure class="round-figure"><img src="/img/blog/rup/rodonnell.jpg"
         alt="Avatar of Rob O&#39;Donnell"/>
</figure>
 Rob O&rsquo;Donnell 
</h3>
<p><strong>When we get new deals, we set them up in eJournal Press systems. They&rsquo;re configured using ROR IDs, and the process is then completely driven by ROR. The system looks for a match between affiliation ROR IDs and deal ROR IDs. And once there&rsquo;s a match, we know the article is eligible for that deal.</strong> Because we have consortia and large institutions, we need to map all <a href="https://ror.readme.io/docs/relationships">sub-organizations</a> that are included in deals as well, and this is a manual process. I&rsquo;ve been wondering whether the <a href="https://psiregistry.org">IP Registry / PSI Registry</a> could help here. They now have ROR IDs in the registry. We use PSI and they have our subscribers broken down and clustered into consortia or large institutions with only the sub-institutions as included in our subscriptions or deals. If those ROR IDs were included in the PSI API, it could be a huge help. When there is a match, the corresponding author is alerted that they&rsquo;re eligible for a deal when they proceed to sign their license form. We are currently building other steps in the workflow to alert them earlier in the process.</p>
<figure><img src="/img/blog/rup/slide-04.png"
         alt="Read and Publish deal configuration and eligibility: Deals configured by ROR within eJournalPress submission and billing systems, hierarchies created and children mapped manually, opportunity for IP Registry integration, corresponding authors alerted to eligibility based on affiliation ROR IDs"/>
</figure>



<h3 id="hahahugoshortcode-s10-hbhb-rob-odonnell"><figure class="round-figure"><img src="/img/blog/rup/rodonnell.jpg"
         alt="Avatar of Rob O&#39;Donnell"/>
</figure>
 Rob O&rsquo;Donnell 
</h3>
<p>When we first implemented this workflow we were only collecting ROR IDs for the corresponding author’s current address, which was a problem because that’s not necessarily a manuscript affiliation. Since then we’ve improved the process, and I show that in this short video. EJP has its own instance of the ROR database in their system. When the author is filling out their submission and starts typing the institution name the typeahead is looking up the ROR record in the EJP database. The author chooses the correct institution from the results list and is then presented with a green checkmark next to the institution name, an indication that it has been validated. We also have a new section asking the corresponding author for all of their affiliations. It’s the same process as just described for each affiliation. The video shows what happens if the author does not select from the typeahead menu, and they just hit Save, or if they choose a name that’s not in ROR – they get this message that basically says, “Look, if you leave it this way, you’re not going to be eligible for any free publishing.” Authors can add as many affiliations as needed, and all of those will be checked against our deals to see if the article is eligible. Our policy is that any corresponding author affiliation on the manuscript is eligible.</p>

<div style="position: relative; padding-bottom: 56.25%; height: 0; overflow: hidden;">
  <iframe src="https://www.youtube.com/embed/BmTjUYrYKOQ" style="position: absolute; top: 0; left: 0; width: 100%; height: 100%; border:0;" allowfullscreen title="Rockefeller University Press ROR-powered author affiliations"></iframe>
</div>



<h3 id="hahahugoshortcode-s12-hbhb-rob-odonnell"><figure class="round-figure"><img src="/img/blog/rup/rodonnell.jpg"
         alt="Avatar of Rob O&#39;Donnell"/>
</figure>
 Rob O&rsquo;Donnell 
</h3>
<p>The next slide is about the <a href="https://oaswitchboard.org">OA Switchboard</a>. If you’re not familiar with the Switchboard, I encourage you to look into it! Not only is the functionality important to publishers, institutions, funders, and (indirectly) authors, but the organizations involved are incredibly collaborative, experimental, and working with the goal of industry-wide solutions. <strong>I’m excited that we’re now live with our Open Access Switchboard eligibility workflow, which is known as “E1” in Switchboard lingo. If a manuscript is not recognized as part of a deal using what was shown in the last step, we use the ROR IDs we’ve gathered to sequentially query each of those organizations to ask if they’ll pay the gold OA fee for the article.</strong> We do this during revision when there’s a high chance of acceptance. We don’t do it for every manuscript. That would be overkill. We’re also querying contributing author affiliations, even though we’re not collecting ROR IDs for all authors yet. This is done using <a href="https://www.youtube.com/watch?v=MzfeUJwtDBg&amp;list=PL4n_Cvd0PpoHfsM3_6VfhAovGIfL3Z79x&amp;index=8">smart matching</a> (originally developed by OA Switchboard&rsquo;s tech team, ELITEX, and now available open source) and that’s working well, too. Smart matching maps text-based affiliations to ROR IDs (via the ROR API) and then directs the queries to those institutions. The Switchboard also maps our tagged <a href="https://www.crossref.org/services/funder-registry/">Funder Registry IDs</a> to ROR and queries those organizations as well. So, we’re tagging all affiliations, all funders, and asking them, “Hey, would you be willing to pay for this?” The actual question is “Does this intended publication meet your requirements/policies? If so, will you cover the publication charges?” and each organization can reply yes or no, and cover full or partial payment of the APC. This eligibility querying workflow has the potential to facilitate funding for articles that are not covered under a Read and Publish deal.</p>
<p>And of course, we also use the Switchboard, via ROR tagging, to report on published articles. The “P1” workflow allows participating institutions and funders to see where and when the research from their organization or research they have funded is published by an OA switchboard-participating publisher. Data from all publishers appear in a single dashboard, with the ability to export as consistently formatted Excel or JSON.</p>
<figure><img src="/img/blog/rup/slide-05.png"
         alt="OA Switchboard: Eligibility E1 check, smart matching, corresponding author ROR IDs used as is, contributing author ROR IDs captured via Appetence smart matching, Funder IDs mapped to ROR by Appetence, post-publication P1 reporting routed via ROR"/>
</figure>



<h3 id="hahahugoshortcode-s14-hbhb-rob-odonnell"><figure class="round-figure"><img src="/img/blog/rup/rodonnell.jpg"
         alt="Avatar of Rob O&#39;Donnell"/>
</figure>
 Rob O&rsquo;Donnell 
</h3>
<p>And just a quick mention: we&rsquo;ve recently introduced a step in our production workflow where production editors will check for any missing ROR IDs. If they find that an institution is missing from the ROR database, they <a href="https://curation-request.ror.org">request that they be added to ROR</a>. In some cases, authors like to use a medical school or a hospital if that is where they are rather than the parent organization. And that ends up coming across production without a ROR. So Adam, I&rsquo;m sorry if you&rsquo;ve been getting <a href="https://github.com/ror-community/ror-updates/issues">requests for med schools</a>.</p>


<h3 id="hahahugoshortcode-s15-hbhb-adam-buttrick"><figure class="round-figure"><img src="/img/adam-sq-200.jpg"
         alt="Avatar of Adam Buttrick"/>
</figure>
 Adam Buttrick 
</h3>
<p>That&rsquo;s okay. That&rsquo;s <a href="/adam-buttrick">my job</a>.</p>


<h3 id="hahahugoshortcode-s16-hbhb-rob-odonnell"><figure class="round-figure"><img src="/img/blog/rup/rodonnell.jpg"
         alt="Avatar of Rob O&#39;Donnell"/>
</figure>
 Rob O&rsquo;Donnell 
</h3>
<p>We have, in some cases, mapped medical schools to the parent ROR ID in the EJP ROR database so that, if someone starts typing a medical school, that name is in the typeahead, but the parent ROR ID is used under the hood. </p>
<figure><img src="/img/blog/rup/slide-06.png"
         alt="Production Check and Request: Production editors check manuscripts for missing ROR IDs, request to add via ROR request form, will add a top-level ROR to manuscript metadata when appropriate"/>
</figure>

<p>Emily is now going to talk more about what Silverchair is doing. And I&rsquo;m grateful that they&rsquo;re doing quite a bit! <strong>But briefly, we <a href="https://ror.readme.io/docs/jats">include ROR IDs in our JATS files</a>, and after an article is published, Silverchair includes those IDs in Crossref deposits.</strong> As I said earlier, we also have our ROR IDs in Silverchair&rsquo;s SiteManager, which checks entitlements. So the goal is to be able to report easily, via Silverchair, on Publish output and Read output for any institution.</p>
<p>And finally, we&rsquo;re in a pilot with <a href="https://researchgate.net">ResearchGate</a>. Not all of our content is Open Access, and we are sending ResearchGate content that is access-controlled, so we need entitlement checks on the ResearchGate side. We do that via <a href="https://scholar.google.com/intl/en/scholar/publishers.html#otherpolicies">Google Scholar Subscriber Links</a> and ROR IDs! With <a href="https://en.wikipedia.org/wiki/Anurag_Acharya">Anurag Acharya</a>&rsquo;s approval, Silverchair has added ROR IDs to our Subscriber Links and ResearchGate uses those to grant access to individuals from our subscribing and R&amp;P institutions.</p>
<figure><img src="/img/blog/rup/slide-07.png"
         alt="Downstream: Sending ROR IDs to Silverchair and Crossref, about to start including ROR IDs in final JATS files, Silverchair poised to deposit to Crossref, goal is holistic reporting of publish and read metrics based on ROR, potential use of ROR for syndication entitlement check on ResearchGate"/>
</figure>



<h3 id="hahahugoshortcode-s19-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>That&rsquo;s amazing. That was really wonderful. I mean: all the applause. We will now hear from Emily Hazzard, Product Operations Analyst at Silverchair, about Silverchair&rsquo;s part in this. Emily?</p>


<h3 id="hahahugoshortcode-s20-hbhb-emily-hazzard"><figure class="round-figure"><img src="/img/blog/rup/ehazzard.jpg"
         alt="Avatar of Emily Hazzard"/>
</figure>
 Emily Hazzard 
</h3>
<p>Hi, everyone. I&rsquo;ve just got the one slide, and we&rsquo;re going to go into dark mode. Rob already talked about a lot of the things that we&rsquo;re doing, so if there are questions, just let me know. But for those of you who I haven&rsquo;t met, I&rsquo;m Emily. I&rsquo;m on the Product team at <a href="https://www.silverchair.com/">Silverchair</a>. And my role is between the high-level visioning stage and the detailed requirements stage, so I&rsquo;m involved in a lot of things ROR. We are mainly implementing things based on what our publishers find the most important, and we&rsquo;re always open to ideas. I love persistent identifiers: I could talk about them for a long time. And that&rsquo;s why we&rsquo;re all here. So I&rsquo;ll talk a little bit about what we currently support around ROR and a couple of things on the roadmap.</p>
<figure><img src="/img/blog/rup/slide-08.png"
         alt="Current &#43; future support for ROR IDs at Silverchair: Publishers can add ROR IDs to institutional information stored in our system; when a ROR ID has been added to an institution, it is included in Subscriber Links for Google Scholar; publishers can include ROR IDs alongside author affiliations in articles, proceedings, books, and chapters; when an author affiliation includes one or more ROR IDs, that information is sent in Crossref, PubMed, and fulltext deposits; on the roadmap, we will introduce the capability to search institutions in our system based on ROR ID, and we plan to develop reports that surface the connections between institutions and authors"/>
</figure>

<p>One of the things that we support is connecting institutions to ROR IDs. Publishers like Rockefeller are able to go into our <a href="https://www.silverchair.com/the-silverchair-platform/tools-technology/">SiteManager system</a>, and then add ROR IDs to the institutional information that we store. This allows us to include ROR IDs in Google Scholar Subscriber Links, which Rob has already mentioned, and we&rsquo;re going to be able to use those in other ways. Step 1 was making sure that we had the space for them in our system so that we can do everything that we want to do in the future. As you saw from that screenshot, we do currently include them in Subscriber Links, and that&rsquo;s mainly being used to support the ResearchGate use case that Rob mentioned. This also means they are available to Google Scholar, but I don&rsquo;t have a lot of insight as to what Google Scholar is doing with those ROR IDs now. If they choose to use them, I&rsquo;m happy to share updates about that in future calls.</p>
<p>We also allow publishers to include ROR IDs alongside author affiliations in articles, proceedings, books, and chapters for the content that&rsquo;s loaded to our publishing platform. That means that we&rsquo;re able to then auto-magically downstream that information in <a href="https://www.crossref.org">Crossref</a> deposits and <a href="https://pubmed.ncbi.nlm.nih.gov/">PubMed</a> deposits, as well as full-text deposits. Once we&rsquo;ve got that information in our system, all of that information is going to be propagated out to all of those different indexing services, which is exciting. <strong>As far as things on our roadmap, in the institutional management service in SiteManager, we&rsquo;re going to be introducing a way to search institutions based on the ROR ID just because it&rsquo;s incredibly convenient.</strong> Sometimes it&rsquo;s easier to find something based on a persistent identifier than something that might take a little bit of disambiguating (to find out how it might have been spelled or transliterated). <strong>Something else that we&rsquo;re planning on doing is introducing ROR-based reporting to support those Read and Publish agreements so that we can expose the relationships between those authors and institutions, and really see what&rsquo;s coming out of these institutions. Where is it published? Where does it live? And how do we serve those authors better?</strong></p>
<p>So that&rsquo;s what we&rsquo;re doing right now. I&rsquo;ve got lots of ideas for things that I am interested in. Again, talk to me if you have more questions. Are there any questions about the things that we&rsquo;re currently doing or that&rsquo;s on our roadmap here?</p>


<h3 id="hahahugoshortcode-s22-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>I suppose I&rsquo;m a little curious as to whether all of the users of Silverchair get all of these features at once, or are you doing it more client by client?</p>


<h3 id="hahahugoshortcode-s23-hbhb-emily-hazzard"><figure class="round-figure"><img src="/img/blog/rup/ehazzard.jpg"
         alt="Avatar of Emily Hazzard"/>
</figure>
 Emily Hazzard 
</h3>
<p>That&rsquo;s a great question. So the SiteManager piece, adding ROR IDs to the institutional information, that&rsquo;s available to any client right now who&rsquo;s using that service. And then connecting authors to ROR IDs and including that downstream, that&rsquo;s available to everyone on our latest version of the platform. We try to auto-deploy as much as we can, especially when we know it&rsquo;s going to be abundantly useful for things like a persistent identifier, so that&rsquo;s one of the things that I always try to make sure we can do where we can do it.</p>


<h3 id="hahahugoshortcode-s24-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Wonderful. Thank you.</p>


<h3 id="hahahugoshortcode-s25-hbhb-emily-hazzard"><figure class="round-figure"><img src="/img/blog/rup/ehazzard.jpg"
         alt="Avatar of Emily Hazzard"/>
</figure>
 Emily Hazzard 
</h3>
<p>Absolutely. Well, thanks, everybody. If you have any other questions, reach out.</p>


<h3 id="hahahugoshortcode-s26-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-200.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Thanks for telling us all about your work with ROR, Rob and Emily.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span><a href="/documents/2022-12-08-ROR-RUP-Silverchair-Slides.pdf">Download the slides</a> as a PDF. Questions? Want to be featured in a ROR case study? Contact <a href="mailto:support@ror.org">support@ror.org</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Rockefeller University Press has incorporated ROR into many aspects of their workflow, and their platform provider Silverchair was there to help. This case study is based on the presentation about this integration given at the December 2022 ROR Community Call.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Parents, Children, and Other Relationships in ROR Records</title><id>https://doi.org/10.71938/qntp-5c92</id><link rel="alternate" href="https://ror.org/blog/2023-02-27-parents-children-and-other-relationships-in-ror/"/><published>2023-02-27T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>ROR Core Team</name><uri>https://ror.org/about/team/#core-team</uri></author><category term="Explainers"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>One common question we receive about ROR registry records is whether they reflect organizational hierarchy and relationships between organizations &ndash; yes, they do! <a href="https://ror.org/about/faqs/#does-ror-support-relationships-and-hierarchies">ROR does indeed support parent-child organizational hierarchies</a> as well as other types of relationships between organizations. In this blog post, we go into more detail about how ROR supports relationships and hierarchies, how these are expressed in the metadata, how we curate them, and how users like you can explore and leverage this metadata. </p>


<h2 id="relationships-in-ror-at-a-glance">Relationships in ROR at a glance 
</h2>
<p>The ROR registry currently includes records for nearly 105,000 organizations (to be precise, 104,594 at the time of this writing). Across the entire ROR dataset, there are more than 22,000 total relationships between organizations (22,057), which means that <strong>approximately 21% of ROR records have at least one relationship to another record in the registry</strong>.</p>
<p>Parent-child hierarchies are the most common relationship type, with more than 17,000 instances. The most common scenario is that an organization has only one parent, but in some cases an organization has multiple parents (1,891 organizations, or about 1.8% of total records, have multiple parents). Several national-level and large-scale research organizations, such as the Czech Academy of Sciences (<a href="https://ror.org/053avzc18">https://ror.org/053avzc18</a>), have more than 100 of their child organizations enumerated in ROR. </p>
<p>Since its <a href="https://ror.org/blog/2022-03-17-first-independent-release/">first independent release</a> in March 2022 as part of the planned divergence from GRID, ROR has created over 1,000 new relationships between research organizations in the registry. We do this in collaboration with the global research community through our <a href="https://github.com/ror-community/ror-updates#ror-updates">transparent curation process</a>, and this service is entirely free of charge.</p>


<h2 id="relationships-in-the-ror-data-structure">Relationships in the ROR data structure 
</h2>
<p>ROR records store both structural and temporal connections in the <code>relationships</code> field. Connections and relationships between records can therefore be mapped and graphed. The relationship types supported are <code>Parent</code>, <code>Child</code>, <code>Related</code>, <code>Successor</code>, and <code>Predecessor</code>. An organization can have multiple relationships, but each relationship must be classified as only one of the above types. </p>
<p>The United States Department of Energy, for instance (<a href="https://ror.org/01bj3aw27">https://ror.org/01bj3aw27</a>), has many &ldquo;child&rdquo; organizations such as national research laboratories. Those child organizations themselves often have their own children representing different research units and funding bodies. Pictured below is a vertical &ldquo;family tree&rdquo; of the Department of Energy created from ROR records with an <a href="https://github.com/ror-community/ror-utilities/tree/main/organization-tree-scripts">organization tree script</a> showing DOE&rsquo;s children and grandchildren (laterally related organizations are not shown in this view). This structure aligns with how the Department of Energy tracks both its research outputs and the research projects it has supported using ROR IDs and <a href="https://www.crossref.org/services/funder-registry/">Crossref Funder Registry IDs</a>. </p>
<figure class="blog-figure"><img src="/img/blog/hierarchies/ror-doe-hierarchy.png"
         alt="List of US Department of Energy children and grandchildren expressed as items in an indented list from the ROR organization tree script written by Sandra Mierz."/><figcaption>
            <p>List of US Department of Energy children and grandchildren expressed as items in an indented list from the ROR organization tree script written by Sandra Mierz.</p>
        </figcaption>
</figure>

<p>Relationships are displayed on individual records in the ROR <a href="https://ror.org/search">search UI</a>, as for example the record for the US <a href="https://ror.org/040gcmg81">National Cancer Institute</a>, whose parent is the<a href="https://ror.org/01cwqze88"> National Institutes of Health</a> (NIH) and which has multiple children and multiple related organizations.</p>
<figure class="blog-figure"><img src="/img/blog/hierarchies/ror-natl-cancer-inst.png"
         alt="ROR record for the US National Cancer Institute showing the parent organization as the National Institutes of Health as well as three child organizations and two related organizations."/><figcaption>
            <p>ROR record for the US National Cancer Institute showing the parent organization as the National Institutes of Health as well as three child organizations and two related organizations.</p>
        </figcaption>
</figure>

<p>The <code>relationships</code> element in the ROR JSON schema is an array, so organizations can have multiple children, multiple parents, and multiple lateral relationships.</p>
<figure class="blog-figure"><img src="/img/blog/hierarchies/ror-metadata-relationships.png"
         alt="Relationships in the JSON of the ROR record for the U.S. National Cancer Institute."/><figcaption>
            <p>Relationships in the JSON of the ROR record for the U.S. National Cancer Institute.</p>
        </figcaption>
</figure>

<p>If an organization ceases operations and passes on its work to another organization, that relationship is also reflected in ROR through the &ldquo;Predecessor&rdquo; or &ldquo;Successor&rdquo; relationship type. For instance, the Canadian Polar Commission has been succeeded by the Polar Knowledge Canada, and this is reflected in the relationship metadata.</p>
<figure class="blog-figure"><img src="/img/blog/hierarchies/ror-cpc.png"
         alt="Inactive ROR record for the Canadian Polar Commission stating that the organization has been succeeded by Polar Knowledge Canada."/><figcaption>
            <p>Inactive ROR record for the Canadian Polar Commission stating that the organization has been succeeded by Polar Knowledge Canada.</p>
        </figcaption>
</figure>

<figure class="blog-figure"><img src="/img/blog/hierarchies/ror-relationships-successor.png"
         alt="Metadata in the inactive Canadian Polar Commission ROR record showing that the organization has been succeeded by Polar Knowledge Canada."/><figcaption>
            <p>Metadata in the Canadian Polar Commission ROR record showing that the organization has been succeeded by Polar Knowledge Canada.</p>
        </figcaption>
</figure>



<h2 id="curation-policies-and-practices-with-relationships-and-hierarchies">Curation policies and practices with relationships and hierarchies 
</h2>
<p>ROR&rsquo;s essential purpose is to connect research outputs to research organizations, and therefore it is not focused on capturing all subdivisions of a given organization, such as a university&rsquo;s academic departments. <strong>The more granular the registry, the less suited it is to address the use case of tracking outputs at the institutional level.</strong> Likewise, large-scale reconciliation of research outputs, such as that done by <a href="https://openalex.org/">OpenAlex</a> to assign ROR IDs to millions upon millions of affiliation strings in their global academic graph, can become more difficult and time-consuming relative to the degree of depth. One of the things that users appreciate about ROR is that it provides a manageable level of granularity, which makes the dataset more immediately usable as well as easier to maintain than other sources of organizational data.</p>
<p>That being said, it is still important for the registry to be able to capture relationships and hierarchies between and within organizations. Through our curation process, we make careful determinations in consultation with the global research community about how to include and express these connections in ROR metadata. For example, when deciding whether a prospective child organization should be added to ROR or not, we consider factors such as whether it has an independent status/identity and funding sources from its parent organization, how the organization appears in affiliation usage, and whether a separate ROR ID for the organization would make the task of tracking outputs easier or more challenging. </p>
<p>As a global registry made freely and openly available for users and integrators around the world and maintained through a transparent community-based process, we also consider regional specificity in curation decisions involving relationships and hierarchies. This includes being aware of linguistic variations &ndash; for example, government agencies in many countries (Australia, Canada, and others) are called &ldquo;departments,&rdquo; which is not the same thing as a university &ldquo;department&rdquo; in some countries &ndash; as well as the specificity of local institutional structures, such as the unique nature of research institutes in France&rsquo;s higher education system. </p>
<p>Another important aspect of ROR&rsquo;s curation practices when it comes to relationships and hierarchies is optimizing ROR&rsquo;s interoperability with other organization identifiers. ROR IDs map to other open identifiers such as ISNI, Wikidata, and Crossref Funder IDs. As part of ongoing work to reconcile as many ROR IDs as possible to these identifiers, we make decisions about adding certain child and related organizations that can help to optimize these mappings. This is particularly important in the case of the Crossref Funder Registry, which is expected to be incorporated into ROR in the future. Although we are proud that <strong>90% of existing funding assertions in Crossref have been made with a Funder ID that already maps to a corresponding ROR record</strong>, we also understand the need to support the long tail of funding references and levels of hierarchy that are available in the Funder Registry, and we will continue to address this in the coming months. </p>


<h2 id="use-cases-and-current-integrations">Use cases and current integrations 
</h2>
<p>ROR users can and do take advantage of relationships and hierarchies in ROR for a number of purposes, including but not limited to the following: </p>
<ul>
<li>
<p><strong>Large, complex institutions such as university systems and multinational companies and nonprofits want to track research outputs for the entire system as well as for each constituent organization.</strong></p>
</li>
<li>
<p><strong>Funders want to find all research outputs funded by the top-level agency or foundation as well as by subsidiary agencies and related research facilities.</strong></p>
</li>
<li>
<p><strong>National funders and Open Access programs want to monitor both national and institution-level adherence to Open Access policies.</strong> </p>
</li>
<li>
<p><strong>Publishers want to manage institution-level subscriptions and Open Access agreements, each of which may include or exclude particular institutional subdivisions.</strong></p>
</li>
</ul>
<p>One example of a system that has cleverly integrated ROR to enable finding research outputs both from a single organization and from related organizations is <a href="https://lens.org">The Lens</a>, which offers large-scale search of patents and scholarly works. In December of 2022, The Lens <a href="https://about.lens.org/release-8-7/">announced</a> that it had added a new ROR-based feature: a field in its structured search called &ldquo;Institution ROR ID Lineage&rdquo; that lets a user find all the scholarly works associated with an organization&rsquo;s ROR ID plus all the scholarly works associated with that organization&rsquo;s children. The Lens was already using ROR in its institutional data model, but this fascinating new feature shows an appreciation of what ROR&rsquo;s relationships field can do. Searching by the ROR ID for the University of London (<a href="https://ror.org/04cw6st05">https://ror.org/04cw6st05</a>) alone produces just over 98,000 results, whereas searching by the ROR ID lineage for the University of London produces over 780,000 results from the University of London and all of its 21 children.</p>
<figure><img src="/img/blog/hierarchies/ror-lens-ulondon.jpg"
         alt="The Lens&#39;s structured search for scholarly works by ROR ID showing over 98,000 works associated with the University of London alone. "/><figcaption>
            <p>The Lens&rsquo;s structured search for scholarly works by ROR ID showing over 98,000 works associated with the University of London alone.</p>
        </figcaption>
</figure>

<figure><img src="/img/blog/hierarchies/ror-lens-ulondon-lineage.jpg"
         alt="The Lens&#39;s structured search for scholarly works by ROR ID showing over 780,000 works associated with the University of London and its subsidiary organizations. The Lens also offers the ability to filter results by sub-organization."/><figcaption>
            <p>The Lens&rsquo;s structured search for scholarly works by ROR ID showing over 780,000 works associated with the University of London and its subsidiary organizations. The Lens also offers the ability to filter results by sub-organization.</p>
        </figcaption>
</figure>

<p>Germany&rsquo;s <a href="https://open-access-monitor.de">Open Access Monitor</a>, similarly, uses ROR&rsquo;s hierarchies to allow its users to track open access data for large, complex German research organizations at the parent level, the child level, or in clusters of selected related organizations. The <a href="https://ror.org/01hhn8329">Max Planck Society</a>, for instance, has nearly 100 subsidiary research organizations, and because the OA Monitor maps all the data it acquires from Dimensions, Scopus, and Web of Science to ROR IDs, it can give its users information about the open access compliance of all Max Planck Society organizations together, only one of them, or any combination of them. </p>
<figure><img src="/img/blog/hierarchies/ror-oam-max-planck-select.png"
         alt="In Germany&#39;s ROR-powered Open Access Monitor, a user can select all, one, or any set of the child research organizations of the Max Planck Society. "/><figcaption>
            <p>In Germany&rsquo;s ROR-powered Open Access Monitor, a user can select all, one, or any set of the child research organizations of the Max Planck Society.</p>
        </figcaption>
</figure>

<figure><img src="/img/blog/hierarchies/ror-oam-max-planck-oa-by-year.png"
         alt="In Germany&#39;s ROR-powered Open Access Monitor, a user who has selected all 102&#43; Max Planck Society organizations can see aggregate data on the number of publications in each Open Access category by year for the entirety of the Max Planck Society system. Users can also see the same data for any single Max Planck Society organization or any combination of Max Planck Society organizations."/><figcaption>
            <p>In Germany&rsquo;s ROR-powered Open Access Monitor, a user who has selected all 102+ Max Planck Society organizations can see aggregate data on the number of publications in each Open Access category by year for the entirety of the Max Planck Society system. Users can also see the same data for any single Max Planck Society organization or any combination of Max Planck Society organizations.</p>
        </figcaption>
</figure>

<p>Finally, we are hearing from a number of publishers who are interested in using ROR to manage open access publishing agreements with research organizations. Our favorite example of a publisher who is already using ROR for this purpose is <a href="https://rupress.org/">Rockefeller University Press</a> (RUP). In a recent OASPA/JISC webinar on publishing workflows, representatives of RUP explained and demonstrated several different ways in which they use ROR, including for the purpose of managing open access agreements. Their systems compare the ROR ID of the corresponding author&rsquo;s affiliation to the ROR IDs for agreements, and when there is a match, the author is notified that their work falls under the terms of that agreement. RUP&rsquo;s implementation shows the value of having ROR IDs incorporated in multiple parts of a publishing workflow and demonstrates that ROR&rsquo;s degree of granularity is already more than sufficient to increase the efficiency of such workflows. Watch the full RUP presentation and/or the full webinar below:</p>

<div class="blog-video">
  <iframe src="https://www.youtube.com/embed/Jn21FiGiNCo?t=2359" allowfullscreen title="OASPA and JISC webinar on publishing workflows, January 26, 2023"></iframe>
</div>

<p>We hope the preceding examples can serve as models for other projects that might use ROR&rsquo;s relationships and hierarchies for similar purposes. If you too are using ROR&rsquo;s relationships or hierarchies, we&rsquo;d love to hear more about it &ndash; <a href="https://bit.ly/ror-integration-form">tell us more about your integration</a> and we&rsquo;ll feature you on our website, or get in touch via <a href="mailto:support@ror.org">support@ror.org</a> if you need help getting started.</p>


<h2 id="final-takeaways">Final takeaways 
</h2>
<p>ROR&rsquo;s support for relationships and hierarchies is specifically informed by our core use case of connecting research organizations to research outputs and motivated by our aim to provide an open and easy-to-use dataset with inclusive global coverage. </p>
<p>We hope this post has helped to provide more detail about how relationships and hierarchies are supported in ROR, how we make decisions about relationship structures in the curation process, and how this metadata is being leveraged by users and integrators. </p>
<p>To learn more, check out <a href="https://ror.readme.io">ROR&rsquo;s technical documentation</a>, and get in touch via <a href="mailto:support@ror.org">support@ror.org</a> with any questions. If you have feedback about relationship structures to add to or update in the registry, let us know via the <a href="https://curation-request.ror.org">curation request form</a>.</p>
<hr>
<p><em>This post was jointly written by members of the <a href="/about//team/#core-team">ROR core team</a>: Adam Buttrick (Metadata Curation Lead), Amanda French (Technical Community Manager), Maria Gould (Project Lead), and Liz Krznarich (Technical Lead).</em></p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">ROR supports parent-child organizational hierarchies as well as other types of relationships between organizations. This blog post explains how organizational hierarchies and relationships are expressed in ROR metadata, offers statistics on hierarchies and relationships in ROR, summarizes ROR's curation practices for hierarchies and relationships, and gives examples of real-life use cases of ROR hierarchies and relationships.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Case Study: ROR in Caltech Repositories</title><id>https://doi.org/10.71938/qjyk-j024</id><link rel="alternate" href="https://ror.org/blog/2023-02-23-caltech-case-study/"/><published>2023-02-23T00:00:00+00:00</published><updated>2025-11-07T11:35:16-05:00</updated><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><category term="Case Studies"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>In this case study we interview Tom Morrell, Research Data Specialist at Caltech Library and key contributor to the open source research data management system InvenioRDM, about Caltech&rsquo;s early adoption of ROR IDs in their repository systems and why InvenioRDM is architected with ROR IDs for funders and author affiliations.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>
<h2 id="key-quotations">Key quotations 
  <a href="#key-quotations" aria-label="Key quotations">
<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 640 512" class="ha ha-link" width="16" height="16" alt="Key quotations"><path d="M579.8 267.7c56.5-56.5 56.5-148 0-204.5c-50-50-128.8-56.5-186.3-15.4l-1.6 1.1c-14.4 10.3-17.7 30.3-7.4 44.6s30.3 17.7 44.6 7.4l1.6-1.1c32.1-22.9 76-19.3 103.8 8.6c31.5 31.5 31.5 82.5 0 114L422.3 334.8c-31.5 31.5-82.5 31.5-114 0c-27.9-27.9-31.5-71.8-8.6-103.8l1.1-1.6c10.3-14.4 6.9-34.4-7.4-44.6s-34.4-6.9-44.6 7.4l-1.1 1.6C206.5 251.2 213 330 263 380c56.5 56.5 148 56.5 204.5 0L579.8 267.7zM60.2 244.3c-56.5 56.5-56.5 148 0 204.5c50 50 128.8 56.5 186.3 15.4l1.6-1.1c14.4-10.3 17.7-30.3 7.4-44.6s-30.3-17.7-44.6-7.4l-1.6 1.1c-32.1 22.9-76 19.3-103.8-8.6C74 372 74 321 105.5 289.5L217.7 177.2c31.5-31.5 82.5-31.5 114 0c27.9 27.9 31.5 71.8 8.6 103.9l-1.1 1.6c-10.3 14.4-6.9 34.4 7.4 44.6s34.4 6.9 44.6-7.4l1.1-1.6C433.5 260.8 427 182 377 132c-56.5-56.5-148-56.5-204.5 0L60.2 244.3z"/></svg>
  </a>
</h2>
<p>&ldquo;Basically as soon as ROR came out and it was supported in DataCite, once they said, &lsquo;We&rsquo;ve got a spot in the metadata to put ROR,&rsquo; we pretty quickly implemented ROR for our thesis repository. That was a really easy, easy case, because we know everybody who has a thesis in the thesis repository is affiliated with Caltech.&rdquo;</p>
<p>&ldquo;We had an initial version of the data repository [&hellip;] and in that one we didn&rsquo;t have any affiliation identifiers. There was an affiliation field, but no identifier, because we implemented this before ROR existed, and we didn&rsquo;t like any of the other identifiers that were available.&rdquo;</p>
<p>&ldquo;[InvenioRDM has] been built so that the bones assume that ROR exists, and that&rsquo;s really nice, because we&rsquo;ve got that open identifier baked into the software stack.&rdquo;</p>
<p>&ldquo;At the moment we don&rsquo;t have to do a ton of reporting back to the funders to say what datasets were associated with a given grant, but we&rsquo;re anticipating that we&rsquo;re going to have to do that in the next few years, so having the ability to pull out exactly what records are associated with a given funder is going to be very important. We have it in the system now, so instead of just having people type in the text and the different variations of &lsquo;National Institutes of Health&rsquo; or &lsquo;National Science Foundation,&rsquo; instead, you have them type it in, it assigns the ROR to it, and then we&rsquo;ll be able to pull records out via the ROR.&rdquo;</p>
<p>&ndash; Tom Morrell, Research Data Specialist, Caltech Library</p>
</span></div>
	</div>
</div>


<h3 id="hahahugoshortcode-s1-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Thank you so much for joining me to talk about your ROR implementation. If you could, please start by telling us your name, title, and organization.</p>


<h3 id="hahahugoshortcode-s2-hbhb-tom-morrell"><figure class="round-figure"><img src="/img/blog/caltech/tom-morrell.jpg"
         alt="Avatar of Tom Morrell"/>
</figure>
 Tom Morrell 
</h3>
<p>Hi, I&rsquo;m Tom Morrell. I&rsquo;m the Research Data Specialist at <a href="https://library.caltech.edu/library/home">Caltech Library</a>.</p>


<h3 id="hahahugoshortcode-s3-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Perfect. Can you tell us about Caltech Library and the unit that you&rsquo;re in?</p>


<h3 id="hahahugoshortcode-s4-hbhb-tom-morrell"><figure class="round-figure"><img src="/img/blog/caltech/tom-morrell.jpg"
         alt="Avatar of Tom Morrell"/>
</figure>
 Tom Morrell 
</h3>
<p>Caltech Library serves all the information needs of Caltech as a campus. And Caltech is a kind of unique institution. We&rsquo;re small in terms of numbers. We have about 300 faculty, but we have a really big impact in a lot of research areas &ndash; in Astronomy, Astrophysics, Biology, Neuroscience. We&rsquo;ve got a lot of high-impact work, just done with a very small staff. So what the library does is we serve everybody on campus. We serve traditional library information needs in terms of journals, for instance, but we also have a <a href="https://library.caltech.edu/search/repositories">very strong repository program</a> in terms of making the research content of Caltech publicly available. That includes publications (<a href="https://authors.library.caltech.edu/">CaltechAUTHORS</a>) and theses (<a href="https://thesis.library.caltech.edu/">CaltechTHESIS</a>)  as well as what I spend most of my time working on, which is datasets and research software (<a href="https://data.caltech.edu">CaltechDATA</a>).</p>


<h3 id="hahahugoshortcode-s5-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>And when did when did Caltech first start its repositories? You have both an institutional repository and a data repository, correct?</p>


<h3 id="hahahugoshortcode-s6-hbhb-tom-morrell"><figure class="round-figure"><img src="/img/blog/caltech/tom-morrell.jpg"
         alt="Avatar of Tom Morrell"/>
</figure>
 Tom Morrell 
</h3>
<p>I don&rsquo;t remember off the top my head the original date. We were very early. We were part of some of the initial discussions up at Los Alamos about the concept of institutional repositories and making content open. We have over <a href="https://authors.library.caltech.edu/">100,000 items in our main institutional repository</a>, which, as a percentage of the output of the institution is very, very high, so we&rsquo;ve got lots of dedicated librarians who&rsquo;ve put a lot of effort into collecting content and writing up quality metadata for the content and making that publicly available. And one of the things that I really enjoy being able to do is to figure out ways of making that content more discoverable and more valuable, using things like ROR.</p>


<h3 id="hahahugoshortcode-s7-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Wonderful. So where and when did you personally first hear about ROR?</p>


<h3 id="hahahugoshortcode-s8-hbhb-tom-morrell"><figure class="round-figure"><img src="/img/blog/caltech/tom-morrell.jpg"
         alt="Avatar of Tom Morrell"/>
</figure>
 Tom Morrell 
</h3>
<p>Oh, that&rsquo;s a good question. It was probably at one of the conferences very early on, when ROR was just beginning, and it was likely in the context of <a href="https://datacite.org">DataCite</a> metadata. That&rsquo;s where I&rsquo;ve spent a lot of my time, is figuring out how we can best describe things like our datasets and software in DataCite metadata. And one of the challenges always was, you know, How do we do affiliations? And before ROR there wasn&rsquo;t really an easy, open way of saying &ldquo;This the affiliation that I mean.&rdquo; So when I heard of ROR, I was like, &ldquo;This is really just the perfect solution for how we handle affiliations and identifying where people are coming from.&rdquo;</p>


<h3 id="hahahugoshortcode-s9-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Were you the primary advocate of implementing ROR at your organization?</p>


<h3 id="hahahugoshortcode-s10-hbhb-tom-morrell"><figure class="round-figure"><img src="/img/blog/caltech/tom-morrell.jpg"
         alt="Avatar of Tom Morrell"/>
</figure>
 Tom Morrell 
</h3>
<p>I would say yes, but we had lots of other librarians who were also very excited and interested in this. Because it&rsquo;s going to be helpful not only for discoverability by making our metadata better, but it&rsquo;ll also be really helpful for reporting. One of the types of reports that we have to do often is compliance reports for funders, and we track who&rsquo;s collaborating with who. At the moment those are challenging to put together, because you need to know everybody&rsquo;s affiliation, and trying to match up what a given affiliation is when you have random strings that are coming in from publications is really challenging. So we have librarians who are very excited about the possibility of actually having identifiers for these affiliations. That makes a lot of the reporting stuff a lot easier.</p>


<h3 id="hahahugoshortcode-s11-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>And just to clarify your role: are you responsible primarily for the data repository, or do you also have a role on the institutional repository?</p>


<h3 id="hahahugoshortcode-s12-hbhb-tom-morrell"><figure class="round-figure"><img src="/img/blog/caltech/tom-morrell.jpg"
         alt="Avatar of Tom Morrell"/>
</figure>
 Tom Morrell 
</h3>
<p>My primary role is the data repository, but I also help out with the institutional repository as well on the technical end. We&rsquo;re currently in the process of migrating the institutional repository (<a href="https://authors.library.caltech.edu/">CaltechAUTHORS</a>) to <a href="https://inveniosoftware.org/products/rdm/">InvenioRDM</a>, so I&rsquo;m leading the group that&rsquo;s doing that migration.</p>


<h3 id="hahahugoshortcode-s13-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>That&rsquo;s actually very exciting.</p>


<h3 id="hahahugoshortcode-s14-hbhb-tom-morrell"><figure class="round-figure"><img src="/img/blog/caltech/tom-morrell.jpg"
         alt="Avatar of Tom Morrell"/>
</figure>
 Tom Morrell 
</h3>
<p>So yeah, I kind of am the bridge. We have research services librarians who work with the researchers and make all the decisions for the institutional repository: How should we describe these articles? How should we tag and organize and collect them? And they enter them into the system. Where I come in is more on the technical end: How should we actually manage the metadata? How can we do bulk updates to the institutional repository? And then we&rsquo;ve got software developers in the <a href="https://library.caltech.edu/about/directory#dld">Digital Library Development group</a> that do more of the you know, nuts and bolts of really doing the software development. So I&rsquo;m kind of in an intermediate role between the librarians and the software developers.</p>


<h3 id="hahahugoshortcode-s15-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Gotcha. So your <a href="https://thesis.library.caltech.edu/">thesis repository</a>, I believe, currently runs on <a href="https://www.eprints.org/uk/">EPrints</a>, and your data repository runs on <a href="https://inveniosoftware.org/products/rdm/">InvenioRDM</a>, and you have ROR integrated into both right now?</p>


<h3 id="hahahugoshortcode-s16-hbhb-tom-morrell"><figure class="round-figure"><img src="/img/blog/caltech/tom-morrell.jpg"
         alt="Avatar of Tom Morrell"/>
</figure>
 Tom Morrell 
</h3>
<p>Yes, we do. I&rsquo;ll run through the history of how we did the integration. <strong>Basically as soon as ROR came out and it was supported in DataCite, once they said, &ldquo;<a href="https://blog.datacite.org/identify-your-affiliation-with-metadata-schema-4-3/">We&rsquo;ve got a spot in the metadata to put ROR</a>,&rdquo; we pretty quickly implemented ROR for our thesis repository. That was a really easy, easy case, because we know everybody who has a thesis in the thesis repository is affiliated with Caltech.</strong> [Note: For an interesting analysis of affiliations in university repositories, see Habermann, T. (2022, June 24). <a href="https://metadatagamechangers.com/blog/2022/6/23/universitiesdatacite">Universities@DataCite</a>. Metadata Game Changers blog.]</p>


<h3 id="hahahugoshortcode-s17-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>It&rsquo;s one single ROR ID!</p>


<h3 id="hahahugoshortcode-s18-hbhb-tom-morrell"><figure class="round-figure"><img src="/img/blog/caltech/tom-morrell.jpg"
         alt="Avatar of Tom Morrell"/>
</figure>
 Tom Morrell 
</h3>
<p>It&rsquo;s one single ROR ID. And for the DOI minting, we were not particularly happy with the built-in plugin. EPrints generally, we&rsquo;ve used it for a very long time, and it&rsquo;s been a great system, but it is definitely showing its age. So we started doing some of our development work outside of EPrints, because that allows us to be much faster and more efficient than writing things in Perl. So the way we do our DOI minting for the theses, we basically have <a href="https://github.com/caltechlibrary/epxml_to_datacite">an external script that pulls out the metadata, reformats it, and then mints the DOI</a>. So we were able to pretty easily add ROR IDs for the theses. Each thesis has an author name, and then we put in the affiliation with the ROR ID. And then we went through and then bulk updated all of the DOIs with the new affiliation and ROR ID. That was our first implementation, and it was very straightforward because we knew what the ROR should be, we knew where it would go, and we had the tooling to basically automatically add it.</p>


<h3 id="hahahugoshortcode-s19-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>About how many theses do you think that was, that you added the ROR to in the DOI? Do you know, roughly?</p>


<h3 id="hahahugoshortcode-s20-hbhb-tom-morrell"><figure class="round-figure"><img src="/img/blog/caltech/tom-morrell.jpg"
         alt="Avatar of Tom Morrell"/>
</figure>
 Tom Morrell 
</h3>
<p>It was approximately 10,000 theses.</p>


<h3 id="hahahugoshortcode-s21-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Fantastic. I&rsquo;m gonna ask you a very random question now, and there&rsquo;s no shame if you answer it in the negative. Do you by any chance have the <a href="https://ror.org/05dxps055">ROR ID for Caltech</a> memorized?</p>


<h3 id="hahahugoshortcode-s22-hbhb-tom-morrell"><figure class="round-figure"><img src="/img/blog/caltech/tom-morrell.jpg"
         alt="Avatar of Tom Morrell"/>
</figure>
 Tom Morrell 
</h3>
<p>I have about the first 3 characters. I don&rsquo;t have the full one memorized.</p>


<h3 id="hahahugoshortcode-s23-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>It&rsquo;s meant to be random, so I&rsquo;m not surprised, but I just wondered, since you&rsquo;ve had so much experience with that particular ROR ID. So that&rsquo;s ROR in CaltechTHESIS. Tell us about the data repository and InvenioRDM. How did you decide on that system? And tell us about ROR in that system. [Note: CaltechDATA on InvenioRDM <a href="https://library.caltech.edu/blog/CaltechDATA-InvenioRDM">launched in September 2022</a>.]</p>


<h3 id="hahahugoshortcode-s24-hbhb-tom-morrell"><figure class="round-figure"><img src="/img/blog/caltech/tom-morrell.jpg"
         alt="Avatar of Tom Morrell"/>
</figure>
 Tom Morrell 
</h3>
<p>When we were looking at the data repository we wanted a system that was open source, but also one where it was really easy for researchers to do their own deposits. In comparison to the institutional repository where you get the publication, and you can look at it and say &ldquo;This the title, and these are the authors&rdquo; and so forth, we have a random data file. You really need the researcher to be involved in the process of describing what the dataset is and what papers it&rsquo;s connected to. So we wanted to make sure the researcher had a very easy and straightforward way of making a submission to the repository, and one of the things that we liked about Invenio and InvenioRDM is it&rsquo;s based on the really successful open repository <a href="https://zenodo.org">Zenodo</a>, which has gotten a lot of traction, because it&rsquo;s very easy to go upload your dataset and get them DOIs and get them preserved in a way that other people can access them. So we really like the ease of use, and we like that it&rsquo;s an <a href="https://github.com/inveniosoftware/invenio-app-rdm">open source project</a>, and we like that it was written in Python, so it&rsquo;s easy for us to do additional development and make additional contributions to the <a href="https://inveniosoftware.org/getinvolved/">community</a>. And one of the exciting things for us about InvenioRDM has been to see how the community has grown over the last couple of years. We were one of the first members in the group, and it&rsquo;s now got lots of new people joining and working on rolling out their own repositories for their own institutions. So from a technical side, it&rsquo;s nice to be able to make contributions into a codebase that then a lot of other people are going to use.</p>


<h3 id="hahahugoshortcode-s25-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yeah, I can imagine that. And <a href="https://blog.zenodo.org/2022/12/07/2022-12-07-zenodo-on-inveniordm/">Zenodo just announced, I think a month or two ago, that they are officially migrating to InvenioRDM themselves</a>.</p>


<h3 id="hahahugoshortcode-s26-hbhb-tom-morrell"><figure class="round-figure"><img src="/img/blog/caltech/tom-morrell.jpg"
         alt="Avatar of Tom Morrell"/>
</figure>
 Tom Morrell 
</h3>
<p>It&rsquo;s very exciting that that&rsquo;s public now and that we&rsquo;ve got a more firm timeline for it. There&rsquo;s a lot of nice harmonization if folks are working together on a project. It makes everything more efficient.</p>


<h3 id="hahahugoshortcode-s27-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>So my understanding is that InvenioRDM has ROR basically already built into every part of it. Did you have to do any additional customization to it?</p>


<h3 id="hahahugoshortcode-s28-hbhb-tom-morrell"><figure class="round-figure"><img src="/img/blog/caltech/tom-morrell.jpg"
         alt="Avatar of Tom Morrell"/>
</figure>
 Tom Morrell 
</h3>
<p>Yes. I&rsquo;ll talk about our migration process. There&rsquo;s what we did with our data, and then there&rsquo;s what we hope to be able to do in the future with InvenioRDM. <strong>We had an initial version of the data repository which was on another version of Invenio, and in that one we didn&rsquo;t have any affiliation identifiers. There was an affiliation field, but no identifier, because we implemented this before ROR existed, and we didn&rsquo;t like any of the other identifiers that were available.</strong> So as part of the migration, we wanted to start to add RORs into the existing affiliations that we had. And <a href="https://library.caltech.edu/prf.php?account_id=84551">librarian Kathy Johnson</a> did a great job. We used the <a href="https://github.com/Metadata-Game-Changers/RORRetriever">RORRetriever from Metadata Game Changers</a>.</p>


<h3 id="hahahugoshortcode-s29-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Ah, yes.</p>


<h3 id="hahahugoshortcode-s30-hbhb-tom-morrell"><figure class="round-figure"><img src="/img/blog/caltech/tom-morrell.jpg"
         alt="Avatar of Tom Morrell"/>
</figure>
 Tom Morrell 
</h3>
<p>We used that little automation to take all of the strings that we had for affiliations and make a guess as to what the ROR might be. And then Kathy went through and verified whether that was correct or not. So we did kind of a minimal set of data enhancements, finding RORs for our existing records.</p>
<figure class="blog-figure"><img src="/img/blog/caltech/migrate-caltechdata-script.png"
         alt="Code from the migration script for CaltechDATA"/><figcaption>
            <p>The migration script for CaltechDATA, which includes a lot of ROR mapping, is available  <a href="https://github.com/caltechlibrary/inveniordm-migrate/blob/master/migrate_caltechdata.py">on GitHub</a>.</p>
        </figcaption>
</figure>

<p>And then <a href="https://inveniordm.docs.cern.ch/reference/metadata/#creators-1-n">in the configuration of InvenioRDM, you can basically load in ROR for both the affiliation field</a>, so all the affiliations, as well as for funders. So we&rsquo;ve done that. When you go in to add an affiliation, it does a little search, and it&rsquo;ll auto-complete based on ROR. And then similarly for the funders it will auto-complete the funder name and you can click which one you want. So that&rsquo;s just out of the box configuration.</p>
<figure class="blog-figure"><img src="/img/blog/caltech/caltech-data-item-affiliation.png"
         alt="Entering an affiliation into CaltechData"/><figcaption>
            <p>Entering an affiliation into CaltechDATA with InvenioRDM.</p>
        </figcaption>
</figure>

<figure class="blog-figure"><img src="/img/blog/caltech/caltech-data-item.png"
         alt="An item in CaltechDATA showing author affiliations powered by ROR"/><figcaption>
            <p>An item in CaltechDATA showing author affiliations powered by ROR at <a href="https://doi.org/10.22002/ee2yc-fg857">https://doi.org/10.22002/ee2yc-fg857</a>. See full <a href="https://api.datacite.org/dois/10.22002/ee2yc-fg857?affiliation=true">metadata in the DataCite API</a>.</p>
        </figcaption>
</figure>

<p>In terms of integration, we did some data cleanup, we configured InvenioRDM how it works out of the box, and that all is great. But we also have a lot of improvements that we want to do for the future that we hope we can contribute back, and things that the InvenioRDM community in general wants to work on. There&rsquo;s being able to pull in the latest changes with inactive versus active RORs and have that show up in the interface, there&rsquo;s having the country show up so we can get better disambiguation for organizations that have the same name, stuff like that. So there&rsquo;s still a number of areas where I think we can improve InvenioRDM and how it handles RORs. <strong>But it&rsquo;s been built so that the bones assume that ROR exists, and that&rsquo;s really nice, because we&rsquo;ve got that open identifier baked into the software stack.</strong></p>
<figure class="blog-figure"><img src="/img/blog/caltech/live-stock.jpg"
         alt="Old-fashioned drawing of the skeleton of a cow with letters indicating key parts as in a diagram"/><figcaption>
            <p>Live stock - a cyclopedia for the farmer and stock owner, 1914, Internet Archive Book Images, No restrictions, <a href="https://commons.wikimedia.org/wiki/File:Live_stock_-_a_cyclopedia_for_the_farmer_and_stock_owner_including_the_breeding,_care,_feeding_and_management_of_horses,_cattle,_swine,_sheep_and_poultry_with_a_special_department_on_dairying_-_being_(14797912313).jpg">via Wikimedia Commons</a>.</p>
        </figcaption>
</figure>



<h3 id="hahahugoshortcode-s35-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Sounds great. It really is a terrific community, too, InvenioRDM. I lurk on <a href="https://discord.com/invite/8qatqBC">the Discord</a>, but I have not been hugely active in it.</p>


<h3 id="hahahugoshortcode-s36-hbhb-tom-morrell"><figure class="round-figure"><img src="/img/blog/caltech/tom-morrell.jpg"
         alt="Avatar of Tom Morrell"/>
</figure>
 Tom Morrell 
</h3>
<p>The community is the best part of it.</p>


<h3 id="hahahugoshortcode-s37-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>That&rsquo;s great. So can you tell me some of the benefits of having ROR in both the thesis repository system and the data repository system?</p>


<h3 id="hahahugoshortcode-s38-hbhb-tom-morrell"><figure class="round-figure"><img src="/img/blog/caltech/tom-morrell.jpg"
         alt="Avatar of Tom Morrell"/>
</figure>
 Tom Morrell 
</h3>
<p>Yeah. In terms of the thesis repository, it&rsquo;s allowed Caltech to be much more visible. If you <a href="https://commons.datacite.org/ror.org/05dxps055">do a search in DataCite for Caltech</a>, you will get all of our theses, and it&rsquo;ll pull up the disciplines and the tags. You just put it in the system, and it just works, because we&rsquo;ve attached the ROR to all the affiliations. So in terms of accurately describing the output of the institution, it&rsquo;s helped hugely in terms of just accurately saying, &ldquo;This scholarly content has come from Caltech.&rdquo;</p>
<figure class="blog-figure"><img src="/img/blog/caltech/datacite-caltech.png"
         alt="Screenshot of the results page for Caltech on DataCite Commons showing the first three of a total nearly 42,000 works filterable by publication year, work type, license, language, field of science, and registration agency, with the first two results being datasets and the third being a dissertation"/><figcaption>
            <p>ROR-powered Caltech organization page on DataCite Commons at <a href="https://commons.datacite.org/ror.org/05dxps055">https://commons.datacite.org/ror.org/05dxps055</a>.</p>
        </figcaption>
</figure>

<p>In terms of the data repository, we&rsquo;re still figuring out where it&rsquo;s going to be useful in terms of reporting and cleanliness. One of the areas where it&rsquo;s going to be very helpful is in terms of funders. <strong>At the moment we don&rsquo;t have to do a ton of reporting back to the funders to say what datasets were associated with a given grant, but we&rsquo;re anticipating that we&rsquo;re going to have to do that in the next few years, so having the ability to pull out exactly what records are associated with a given funder is going to be very important. We have it in the system now, so instead of just having people type in the text and the different variations of &ldquo;National Institutes of Health&rdquo; or &ldquo;National Science Foundation,&rdquo; instead, you have them type it in, it assigns the ROR to it, and then we&rsquo;ll be able to pull records out via the ROR.</strong> We don&rsquo;t have as many specific use cases where we can say, &ldquo;Yes, this is the reporting we&rsquo;re using this for today,&rdquo; but we&rsquo;re anticipating it&rsquo;s going to be really, really helpful, because there <a href="https://www.whitehouse.gov/ostp/news-updates/2022/08/25/ostp-issues-guidance-to-make-federally-funded-research-freely-available-without-delay/">probably will be more compliance reporting we have to do</a>.</p>


<h3 id="hahahugoshortcode-s40-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Great. So what problems have you identified with ROR, if any? What do you wish ROR did that it isn&rsquo;t currently doing?</p>


<h3 id="hahahugoshortcode-s41-hbhb-tom-morrell"><figure class="round-figure"><img src="/img/blog/caltech/tom-morrell.jpg"
         alt="Avatar of Tom Morrell"/>
</figure>
 Tom Morrell 
</h3>
<p>One of the challenges we&rsquo;ve found is in terms of what name shows up as the preferred name. Currently there is a little bit of variability as to what name is the main name for an organization versus other names. Particularly for international organizations, you know: Is the English name the one that shows up in the preferred name, or is it the Spanish name that shows up in the preferred name? We&rsquo;ve run into some of that when we were doing some of the cleanup work, figuring out which ROR is really the right one for a given affiliation. And that&rsquo;s something that also comes up in InvenioRDM when we&rsquo;re going back and looking at old records where we have just affiliation strings. We have to make a decision. Do we go with the name that is associated with the ROR or do we leave the name that somebody typed in? There&rsquo;s a decision point.</p>


<h3 id="hahahugoshortcode-s42-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Right.</p>


<h3 id="hahahugoshortcode-s43-hbhb-tom-morrell"><figure class="round-figure"><img src="/img/blog/caltech/tom-morrell.jpg"
         alt="Avatar of Tom Morrell"/>
</figure>
 Tom Morrell 
</h3>
<p>How do we respect the data that the user entered in, which might not be the cleanest or in the right form that we want, but also not change the way the name is shown? That&rsquo;s something that we&rsquo;re still working through, how we make decisions about how the name shows up in the system and on the specific records.</p>


<h3 id="hahahugoshortcode-s44-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Oh, that&rsquo;s interesting. Most ROR records have a lot of different names in different fields. There is the primary name, which, because we inherited it from GRID, tends to be in English. I&rsquo;m not sure if they had that as a policy or if that was just their practice. But we definitely don&rsquo;t have that policy, so <a href="https://ror.org/blog/2022-08-31-name-metadata-in-ror/#common-themes-with-name-metadata">we&rsquo;ve been taking lots and lots of requests from international organizations</a> to have the primary name in the name field be in the primary language of the country where that institution is located, when that is not English. But yeah, that&rsquo;s interesting. And I&rsquo;m sure you&rsquo;ve seen our <a href="https://ror.org/blog/2022-12-14-schema-scheming/">schema change proposals</a>. We have lots and lots of proposed name-related changes.</p>


<h3 id="hahahugoshortcode-s45-hbhb-tom-morrell"><figure class="round-figure"><img src="/img/blog/caltech/tom-morrell.jpg"
         alt="Avatar of Tom Morrell"/>
</figure>
 Tom Morrell 
</h3>
<p>Yeah, you&rsquo;re all working on it, so it&rsquo;s not like I&rsquo;m worried about this at all. I think you&rsquo;ve got it all under control. But that is an area where we did experience some challenges in terms of the data. And we have some general challenges with InvenioRDM in figuring out how we do the display of things.</p>


<h3 id="hahahugoshortcode-s46-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yeah, and even then, I&rsquo;m not sure what we could do to help with that display issue, really.</p>


<h3 id="hahahugoshortcode-s47-hbhb-tom-morrell"><figure class="round-figure"><img src="/img/blog/caltech/tom-morrell.jpg"
         alt="Avatar of Tom Morrell"/>
</figure>
 Tom Morrell 
</h3>
<p>No. It&rsquo;s on our end to figure out how we want to handle that.</p>


<h3 id="hahahugoshortcode-s48-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Interesting. I think that&rsquo;s about all I have for you. What else do you want to tell us about the data repository, or the thesis repository, or your integrations of ROR generally?</p>


<h3 id="hahahugoshortcode-s49-hbhb-tom-morrell"><figure class="round-figure"><img src="/img/blog/caltech/tom-morrell.jpg"
         alt="Avatar of Tom Morrell"/>
</figure>
 Tom Morrell 
</h3>
<p>I don&rsquo;t know if I have anything else. I&rsquo;m a big supporter of ROR, and I&rsquo;ve been really thrilled to see how it&rsquo;s evolved, particularly over the last couple of years. I feel like ROR has really gotten into its stride in terms of how it&rsquo;s managing the curation process, making the data available, providing both useful <a href="https://ror.readme.io/docs/data-dump">data dumps</a> and <a href="https://ror.readme.io/docs/rest-api">API features</a>. I think it&rsquo;s going to be a really powerful tool for allowing us to do all sorts of queries and identifying how people are associated with organizations and funders.</p>


<h3 id="hahahugoshortcode-s50-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Great! That&rsquo;s great to hear. Thank you so much, Tom.</p>


<h3 id="hahahugoshortcode-s51-hbhb-tom-morrell"><figure class="round-figure"><img src="/img/blog/caltech/tom-morrell.jpg"
         alt="Avatar of Tom Morrell"/>
</figure>
 Tom Morrell 
</h3>
<p>Well, thank you. It&rsquo;s been great to chat.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>Questions? Want to be featured in a ROR case study? Contact <a href="mailto:support@ror.org">support@ror.org</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">In this case study we interview Tom Morrell, Research Data Specialist at Caltech Library and key contributor to the open source research data management system InvenioRDM, about Caltech's early adoption of ROR IDs in their repository systems and why InvenioRDM is architected with ROR IDs for funders and author affiliations.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">ROR Turns Four: Highlights from the 2023 Annual Community Meeting</title><id>https://doi.org/10.71938/yymx-c109</id><link rel="alternate" href="https://ror.org/blog/2023-02-13-ror-turns-four/"/><published>2023-02-13T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="Event Recaps"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>Four years ago, ROR was first <a href="/blog/2019-02-10-announcing-first-ror-prototype">introduced to the world</a> at an open community meeting the day before PIDapalooza.</p>
<p>Since then, we have continued to celebrate ROR&rsquo;s anniversary every year with a big public event to bring together the broad network of ROR users and supporters and reinforce ROR&rsquo;s commitment to developing open, sustainable, community-driven infrastructure. (In previous blog posts, we&rsquo;ve shared recaps of the <a href="/blog/2020-02-10-ror-ing-in-portugal">2020</a>, <a href="/blog/2021-02-03-ror-annual-meeting">2021</a>, and <a href="/blog/2022-02-14-new-year-at-ror">2022</a> meetings.)</p>
<p>For the 2023 annual meeting, we celebrated four years of ROR with four exciting virtual sessions attended by hundreds of participants from around the world.</p>


<h3 id="session-1-ror-community-update">Session 1: ROR Community Update 
</h3>
<p>In the first sesssion - a general community update - the ROR team reviewed key highlights from the past year and discussed plans for the coming year. We also shared ways for community members to get more involved in ROR activities (if they are not already participating).</p>

<div class="blog-video">
  <iframe src="https://www.youtube.com/embed/GIxqvBIx4r0" allowfullscreen title="ROR Community Update | 2023 ROR Annual Community Meeting"></iframe>
</div>



<h3 id="session-2-open-drop-in-hour">Session 2: Open Drop-In Hour 
</h3>
<p>In the second session, we held an open drop-in session (not recorded) to meet informally with anyone interested in chatting about ROR. This is a model for similar sessions we&rsquo;d like to hold throughout the year, especially to be able to reach people located in different timezones or who are not always able to attend the bimonthly ROR community calls. Stay tuned for announcements about future drop-in times!</p>


<h3 id="session-3-introduction-to-the-ror-api">Session 3: Introduction to the ROR API 
</h3>
<p>This year&rsquo;s meeting also featured the introduction of a new session focused on how to work with the ROR API. Technical Community Manager Amanda French delivered an engaging tutorial on how to use the ROR API and get up and running with basic queries. If you weren&rsquo;t able to attend the session, watch the recording and check out the <a href="/tutorials/intro-ror-api">workshop materials</a>!</p>

<div class="blog-video">
  <iframe src="https://www.youtube.com/embed/Lf0z5ob_J9Y" allowfullscreen title="Introduction to the ROR API | 2023 ROR Annual Community Meeting"></iframe>
</div>



<h3 id="session-4-strategies-for-matching-affiliation-strings-to-ror-ids">Session 4: Strategies for Matching Affiliation Strings to ROR IDs 
</h3>
<p>The last session, &ldquo;Strategies for Matching Affiliation Strings to ROR IDs,&rdquo; featured a panel of users (<a href="https://openalex.org">OpenAlex</a>, <a href="https://allenai.org/">Allen Institute for AI</a>, <a href="https://www.osti.gov/">DOE/OSTI</a>, and <a href="https://www.crossref.org">Crossref</a>) sharing their work to develop automated approaches to reconciling affiliation data with ROR. We are eager to bring people together to develop collaborative solutions to shared problems, and the panel was a great example of putting this into practice.</p>

<div class="blog-video">
  <iframe src="https://www.youtube.com/embed/Tx5y7lX030U" allowfullscreen title="Strategies for Matching Affiliation Strings to ROR IDs | 2023 ROR Annual Community Meeting"></iframe>
</div>



<h3 id="four-more-years---and-then-some-more">Four more years - and then some more! 
</h3>
<p>Four years is a huge milestone for ROR! At this stage in our journey, we are proud and excited that ROR&rsquo;s open data and <a href="https://ror.readme.io/docs/rest-api">API</a> are being used around the world and in <a href="/community#adopters">more than 50 known integrations</a>, that registry records are being actively and regularly maintained through a transparent community-based <a href="/registry">process</a>, that ROR is supported in Crossref, DataCite, and ORCID and being recommended in national PID strategies (most recently, in <a href="https://ardc.edu.au/article/strategic-investment-in-identifiers-could-save-24-million-and-38000-person-days-per-year/">Australia</a>), that ROR&rsquo;s three governing organizations are committed to ROR&rsquo;s <a href="/blog/2022-10-10-strengthening-sustainability">long-term sustainability</a>, that ROR has been selected by <a href="https://scoss.org">SCOSS</a>, and that we are part of a growing movement of initiatives to develop and follow <a href="https://openscholarlyinfrastructure.org">best practices</a> for community-led open infrastructure.</p>
<p>We could not have reached this milestone without your support - whether you have been part of ROR since the very beginning, or whether you are a newer member of the pride. Thank you! Here&rsquo;s to ROR-ing together for the next four years and then some more!</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Four years ago, ROR was first introduced to the world at an open community meeting the day before PIDapalooza.
Since then, we have continued to celebrate ROR&amp;rsquo;s anniversary every year with a big public event to bring together the broad network of ROR users and supporters and reinforce ROR&amp;rsquo;s commitment to developing open, sustainable, community-driven infrastructure.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Case Study: ROR in FAIRsharing</title><id>https://doi.org/10.71938/hd7y-q648</id><link rel="alternate" href="https://ror.org/blog/2023-01-24-fairsharing-case-study/"/><published>2023-01-24T00:00:00+00:00</published><updated>2025-11-07T11:35:16-05:00</updated><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><category term="Case Studies"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>In this installment of the ROR Case Studies series, we talk with Allyson Lister, Content and Community Lead for FAIRsharing, a cross-disciplinary registry of scientific standards, databases, and policies, about how and why FAIRsharing used ROR to help make organizations first-class citizens in their data model.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>
<h2 id="key-quotations">Key quotations 
  <a href="#key-quotations" aria-label="Key quotations">
<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 640 512" class="ha ha-link" width="16" height="16" alt="Key quotations"><path d="M579.8 267.7c56.5-56.5 56.5-148 0-204.5c-50-50-128.8-56.5-186.3-15.4l-1.6 1.1c-14.4 10.3-17.7 30.3-7.4 44.6s30.3 17.7 44.6 7.4l1.6-1.1c32.1-22.9 76-19.3 103.8 8.6c31.5 31.5 31.5 82.5 0 114L422.3 334.8c-31.5 31.5-82.5 31.5-114 0c-27.9-27.9-31.5-71.8-8.6-103.8l1.1-1.6c10.3-14.4 6.9-34.4-7.4-44.6s-34.4-6.9-44.6 7.4l-1.1 1.6C206.5 251.2 213 330 263 380c56.5 56.5 148 56.5 204.5 0L579.8 267.7zM60.2 244.3c-56.5 56.5-56.5 148 0 204.5c50 50 128.8 56.5 186.3 15.4l1.6-1.1c14.4-10.3 17.7-30.3 7.4-44.6s-30.3-17.7-44.6-7.4l-1.6 1.1c-32.1 22.9-76 19.3-103.8-8.6C74 372 74 321 105.5 289.5L217.7 177.2c31.5-31.5 82.5-31.5 114 0c27.9 27.9 31.5 71.8 8.6 103.9l-1.1 1.6c-10.3 14.4-6.9 34.4 7.4 44.6s34.4 6.9 44.6-7.4l1.1-1.6C433.5 260.8 427 182 377 132c-56.5-56.5-148-56.5-204.5 0L60.2 244.3z"/></svg>
  </a>
</h2>
<p>&ldquo;If we&rsquo;re going to integrate with somebody like ROR, we have to commit to it, you know. It was intensive at first, but mainly because of the creation of the original scripts and the batch of curation checks, as our records are all curated. But we&rsquo;ve had a couple of these three-month checks since, and it&rsquo;s been an hour or two maximum of checking, with the main curation team each doing maybe fifteen minutes or so of checking.&rdquo;</p>
<p>&ldquo;The integration itself was smooth. Those challenges I mentioned, the fact that we had to do string comparisons, and the fact that we don&rsquo;t have the same scope, those aren&rsquo;t <em>large</em> challenges. Honestly, they&rsquo;re just natural differences when you&rsquo;re doing a bulk match against another repository.&rdquo;</p>
<p>&ldquo;So we have something like 3,500 organizations in FAIRsharing, which is nothing compared to, what is it? A hundred thousand [in ROR], or something like that? &hellip; Ramon took our organizations and pulled the information from your API, and calculated an &rsquo;edit distance,&rsquo; so he looked at the name and the abbreviation and the home page fields for ROR and for FAIRsharing and he calculated the distance between the two terms for all of them, finding the least distance, the closest match.&rdquo;</p>
<p>&ndash; Allyson Lister, Content and Community Lead, FAIRsharing</p>
</span></div>
	</div>
</div>


<h3 id="hahahugoshortcode-s1-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>So, Allyson, please tell me your name, title, and organization.</p>


<h3 id="hahahugoshortcode-s2-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>I can do that. I am <a href="https://orcid.org/0000-0002-7702-4495">Allyson Lister</a>, and I am the <a href="https://fairsharing.org">FAIRsharing</a> Content and Community Lead. I work with <a href="https://orcid.org/0000-0001-9572-0972">Ramon Granell</a>, who is a Research Knowledge Software Engineer, who unfortunately isn&rsquo;t able to join us. We&rsquo;re both staff at the University of Oxford at an institute called the <a href="https://oerc.ox.ac.uk/">Oxford e-Research Centre</a> (OeRC), which is a cross-disciplinary center.</p>


<h3 id="hahahugoshortcode-s3-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>So can you tell me both about that Institute and about FAIRsharing? What do they do, and what makes them special?</p>


<h3 id="hahahugoshortcode-s4-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>The OeRC started in 2006, and was created as a cross-disciplinary research Centre in digital methodology, working across the humanities, communications, engineering, life sciences and more. And for FAIRsharing, we started in about 2009 as the result of <a href="https://www.science.org/doi/abs/10.1126/science.1180598">an article in <em>Science</em></a> that was authored by a number of different research councils, research institutes, and funding agencies and led by my boss, Professor <a href="https://eng.ox.ac.uk/people/susanna-assunta-sansone/">Susanna-Assunta Sansone</a>. As a result, BioSharing was created, which was intended to collect all the existing data policies within the life sciences to ensure they could be compared and harmonized according to the resources they recommend. You know, the standards they recommend, the databases they recommend. And this was so popular that it changed in 2011 from this original blog site to a website launched from the University of Oxford. And so it continued to grow over the next few years, including under the auspices of an RDA working group. That&rsquo;s the <a href="https://www.rd-alliance.org/">Research Data Alliance</a>. Are you familiar with them?</p>


<h3 id="hahahugoshortcode-s5-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yep.</p>


<h3 id="hahahugoshortcode-s6-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>They are a fantastic, absolutely amazing, volunteer-led community organization for research data. And so that went along quite merrily for a few years, until the publication of the <a href="https://doi.org/10.1038/sdata.2016.18">FAIR principles</a> in 2016, which Susanna and two other members of our group at the time were authors on, among a bunch of other authors. And so, when the FAIR principles were published, we quickly realized that the best thing to do was to change the scope of BioSharing to be more than life sciences. And so it became FAIRsharing in 2017. And now it&rsquo;s cross-discipline, looking to store information about databases, standards, and data policies across all research areas, because these are really the pillars of the FAIR principles; they give you the ability to describe the places you store your data, the way in which you format your data, and who is asking you to store and structure it in a certain way (such as funders, journals, societies). And so this is where we are now: a curated registry. Or, in reality, a curated set of three registries for (meta)data standards, databases, and data policies.</p>
<figure><img src="/img/blog/fairsharing/fairsharing-home.png"
         alt="FAIRsharing home"/><figcaption>
            <p>FAIRsharing home page</p>
        </figcaption>
</figure>

<p>FAIRsharing doesn&rsquo;t just describe those resources; my favorite part is that it links them together. So it creates this network of all the relationships, the landscape of resources for a particular domain. It allows you to explore what&rsquo;s available for your particular research area. And it&rsquo;s been adopted not only by the RDA, in which it continues to be an active member, but also a number of other international organizations and funders, such as <a href="https://eosc-portal.eu/">EOSC</a> and <a href="https://force11.org">FORCE11</a>. We have all these different organizations and publishers &ndash; journal publishers, funders. And the operational team is advised by <a href="https://fairsharing.org/communities#governance">Executive and Stakeholder Boards</a> that we use to make sure our direction is right for the community that we serve. And I think that&rsquo;s about it. Later on I&rsquo;m going to touch upon my favorite bit, which is that I&rsquo;m running this <a href="https://fairsharing.org/community_curation">community curation program</a> that is bringing our domain experts even closer into the project to help them showcase their resources.</p>


<h3 id="hahahugoshortcode-s8-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Oh, wonderful! So where and when did you personally first hear about ROR?</p>


<h3 id="hahahugoshortcode-s9-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>I felt like I&rsquo;ve always known about ROR, at least ever since we started looking outward. The first few years, it was all about curating the resources. So we were very focused on the databases and standards and policies, and we always linked to organizations. But we didn&rsquo;t have any way of unambiguously referring to them other than our own internal curation checks. And then at the beginning of 2022, when we redeveloped FAIRsharing and created a brand new data model and a brand new front end, we said, &ldquo;What we need is for organizations to be first-class citizens.&rdquo; And so I immediately thought of ROR. Because what you get is an unambiguously, persistently-identified set of research organizations that will fix a lot of the problems that we had found. It wasn&rsquo;t a huge issue, but there were cases where we had redundancy in our organizations. And so we did a big push to align our organizations with ROR organizations.</p>


<h3 id="hahahugoshortcode-s10-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Right.</p>


<h3 id="hahahugoshortcode-s11-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>FAIRsharing is a part of the <a href="https://datareadiness.eng.ox.ac.uk/">Data Readiness Group</a>. As well as working with data transparency, we also look at research integrity and making data FAIR for humans and machines. And so at FAIRsharing we are looking to incorporate many different international community activities around research data management. So ROR was a natural place to go to for organizations.</p>


<h3 id="hahahugoshortcode-s12-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Well, it&rsquo;s interesting, because I think the timeline of FAIRsharing and ROR overlap quite a bit, because the planning for ROR began really in 2016, and went on through 2019 when it officially launched, which is sort of right when FAIRsharing was making the move to become FAIRsharing.</p>


<h3 id="hahahugoshortcode-s13-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>Exactly. We became FAIRsharing officially in 2017, but by 2018, 2019 we were really starting to snowball into the other communities. The life sciences had already heard of us, because we are a part of ELIXIR. We are a <a href="https://elixir-europe.org/platforms/interoperability/rirs">recommended interoperability resource in ELIXIR</a>, which is one of the parts of <a href="https://www.eosc-life.eu/">EOSC Life</a> within the broader EOSC context. I was at the <a href="https://eosc-portal.eu/events/eosc-symposium-2022">EOSC symposium in Prague</a> a few weeks ago, and everyone was clear about the interoperability aspects being really important. And I think that&rsquo;s something that you need to be very proud of at ROR, because you are the one connecting and making sure that people are saying what they mean to say when they&rsquo;re talking about organizations.</p>


<h3 id="hahahugoshortcode-s14-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>And that does lead into a question that came up when you were talking about your own awareness of ROR which is: did you consider any other organizational identifiers when you were refactoring FAIRsharing?</p>


<h3 id="hahahugoshortcode-s15-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>You know, I&rsquo;m trying to remember if we did, or if once I heard of ROR, it just made sense. Ramon has told me that your API was really easy to use, and this is really important when developing an integration like this one.</p>


<h3 id="hahahugoshortcode-s16-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>That&rsquo;s very good to hear!</p>


<h3 id="hahahugoshortcode-s17-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>He said the API was never the issue. The issue was the classical natural language processing issue, you know, of comparing, making the initial comparison against what we already had from 2011 onwards to what you guys had. But I&rsquo;m trying to remember if there were any other major contenders. I have to say that ROR is the one that you always hear of in presentations. ROR is the one that you&rsquo;re always referring to when people are talking about it, and it just seems &hellip; I don&rsquo;t know exactly what your perspective is, but from my perspective it&rsquo;s the <em>de facto</em> way of describing organizations. And there are bits of the ROR scope that don&rsquo;t align with FAIRsharing. I&rsquo;m not saying that&rsquo;s a criticism of ROR. There are some places where we can&rsquo;t match our organizations to ROR, and that&rsquo;s not a limitation as much as just an acknowledgment of the differences in the way that people describe an organization. Because you could go on in workshops for days about &ldquo;What is an organization?&rdquo; And I&rsquo;m sure you have, being ROR! So it&rsquo;s just about deciding what you want to describe, and clearly listing what those requirements are, and for ROR that&rsquo;s very clear. You go the on the website and you can click straight away and find out <a href="https://ror.org/about/faqs/#scope">what your scope is</a>.</p>


<h3 id="hahahugoshortcode-s18-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Wonderful. So let&rsquo;s get into the &ldquo;How&rdquo; at this point. Describe how you&rsquo;ve implemented ROR in FAIRsharing. Tell us about that.</p>


<h3 id="hahahugoshortcode-s19-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>Well, in our old version we had a very nice web form that a lot of people used, that allowed you to “free-text add” everything about an organization, and while that organization could be reused, it was an object local only to FAIRsharing. Equally, while there weren’t a hundred different versions of &ldquo;University of Oxford&rdquo;, we did see duplications and mistakes creeping in. This was also true for ORCID IDs before our integration as a trusted partner with ORCID and for our subject areas and our technical domains, which were free text in the earliest versions of our project. So we wanted to align with community ontologies. We wanted to link up with ORCID. We wanted to link up with ROR. So this was our opportunity when we re-wrote the data model. You know, I wrote in my notes for this question that I wanted to mention xkcd and Randall Munroe. I have to!</p>


<h3 id="hahahugoshortcode-s20-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>I know the one you&rsquo;re talking about, of course!</p>
<figure class="blog-figure"><img src="/img/blog/fairsharing/standards.png"
         alt="xkcd.com 927 Standards"/>
</figure>



<h3 id="hahahugoshortcode-s22-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>Yes, the reinventing of the wheel. <a href="https://xkcd.com/927/">The 14 standards we have, we must make another one to align them all, and then we have 15 standards.</a> We try our hardest when we&rsquo;re all aligning, wanting to align with FAIR principles, to ensure that we&rsquo;re not making this kind of mistake. Inevitably there&rsquo;s going to be some, but we do our best at every stage not to do that. So our subject tags and our technical domain tags all come from community ontologies. We use nearly 60 different community ontologies that we&rsquo;ve merged together to try to create all of the tags that our users had already put into the system. So you know, most people would say, &ldquo;Let&rsquo;s find an ontology we like, and that&rsquo;s what we&rsquo;ll provide our users.&rdquo; But we had had seven or eight years of users putting whatever they wanted, self-describing their records.</p>


<h3 id="hahahugoshortcode-s23-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Right.</p>


<h3 id="hahahugoshortcode-s24-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>And so we were working from the bottom up, and so we ended up having to look in lots of different places. But for organizations, thankfully, it was more straightforward. We had a lot of organizations. Our definition of organization was quite deliberately loose. And so when we went to ROR, we couldn&rsquo;t match everything, but it was still a useful match for what we did have. So not reinventing the wheel, using community terminologies, linking up with ORCID, linking up with ROR. FAIRsharing records can have maintainers. We also have our community curators, and both of these are the people who can edit records. So you either have to be the owner of the resource &ndash; a standard developer or a database developer or a policymaker &ndash; or you can be one of our domain experts that we&rsquo;ve given particular roles and can edit multiple records. And they are trying to describe their resource in a way that is pertinent to them in their community. And so we try very hard not to put too many requirements on what they have. We provide fifty, at least, maybe sixty different metadata fields, but only something like three of them are required, because we understand that for each community, especially when we&rsquo;re working cross-disciplinary, they&rsquo;re going to have different needs for what they need to describe. But, saying that, we always like to have an organization. So even though it&rsquo;s not a required field, we&rsquo;ll come back to them and say &ldquo;We&rsquo;re not going to publish it till you put an organization in,&rdquo; because we want to explore with them what they think of as an organization. So they can say that an organization maintains a resource, or collaborates on it, or funds it. A general &ldquo;associated with,&rdquo; if it&rsquo;s sort of nebulous and they still want to include someone. Like often EOSC Life or EOSC will get an &ldquo;associated with,&rdquo; because your project is part of and supported by them, but it&rsquo;s not a direct connection. And so we have a lot of different relationship types that link our records to our organizations. And so ROR being a large, open, community-driven database of organizations that has lots of different fields, and having relationships among the organizations, incorporating ROR into the new data model helps us speed up the creation of organizations in FAIRsharing for our users.</p>
<figure><img src="/img/blog/fairsharing/create-org-01.png"/>
</figure>

<figure><img src="/img/blog/fairsharing/create-org-02.png"/>
</figure>

<figure><img src="/img/blog/fairsharing/create-org-03.png"/>
</figure>

<p>FAIRsharing doesn’t just allow them to search ROR while they&rsquo;re making a record, but also provides the metadata from ROR for them so they don&rsquo;t have to put it in themselves. So it&rsquo;s a really nice way to ensure that we&rsquo;re accurately describing what they need. Now in terms of direct implementation, what we did is &ndash; or I shouldn&rsquo;t say, what &ldquo;we&rdquo; did. I helped, Ramon mainly did, and the rest of our curation team backed him up. So we have something like 3,500 organizations in FAIRsharing, which is nothing compared to, what is it? A hundred thousand [in ROR], or something like that?</p>


<h3 id="hahahugoshortcode-s28-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>About 104,000 right now.</p>


<h3 id="hahahugoshortcode-s29-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>A hundred and four thousand. Yes, and so clearly we&rsquo;re not an organization registry, but people over the years have added many organizations. Ramon took our organizations and pulled the information from your API, and calculated an &rsquo;edit distance,&rsquo; so he looked at the name and the abbreviation and the home page fields for ROR and for FAIRsharing and he calculated the distance between the two terms for all of them, finding the least distance, the closest match. [*Note: More information is available in Ramon’s <a href="https://blog.fairsharing.org/?p=239">blog post</a> about this work, and Ramon has <a href="https://github.com/ror-community/ror-utilities">made the scripts available in the ror-utilities GitHub repository</a>.]</p>


<h3 id="hahahugoshortcode-s30-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Right.</p>


<h3 id="hahahugoshortcode-s31-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>And then he provided a set of spreadsheets, where the majority have a cutoff distance. So if the edit distance was a certain value or or lower, then we didn&rsquo;t really have to look at it and could discount them immediately. At the other end, there were a few hundred high-scoring matches that were very clearly matches. And then the curation team focused on those where the edit distance started moving away from optimal, but wasn&rsquo;t so far away that it was clearly nonsense.</p>


<h3 id="hahahugoshortcode-s32-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>So this was a one-time process?</p>


<h3 id="hahahugoshortcode-s33-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>This was a one-time process, but now gets repeated on a smaller scale, I think it&rsquo;s every three months. Because we always have new people adding organizations, and of course we encourage them to add organizations through the route we have with ROR. That means they should search ROR, because we have it built in, we have the connection to your API in our edit interface. But they don&rsquo;t have to do that route when they put an organization in, and so every three months we run everything again, and we check all the new matches or any changed matches, and make sure they still make sense. So it&rsquo;s still something that has to carry on, it will have to carry on for as long as we integrate with you, or we&rsquo;ll just draw away over time.</p>


<h3 id="hahahugoshortcode-s34-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>That&rsquo;s really fascinating, actually. And I love doing these interviews, because I&rsquo;m always hearing about new ways to solve different problems. So first of all, that is one thing that we <a href="https://ror.readme.io/docs/forms">recommend</a>, that nobody require the entry of a ROR into their system, because there certainly are institutions that are not in ROR that should be, or that are not in ROR because of scope issues, but that are in scope for your system. Oddly enough, you can require an organization without requiring a ROR ID, right, but you&rsquo;re not even requiring organizations, correct?</p>


<h3 id="hahahugoshortcode-s35-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>We&rsquo;re not, but we generally don&rsquo;t allow it unless they put something in. So it&rsquo;s a soft requirement, shall we say.</p>


<h3 id="hahahugoshortcode-s36-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yeah. But what is really interesting is what people do with those things that don&rsquo;t match a ROR. I haven&rsquo;t actually heard of anybody doing this kind of repetitive batch matching before. I know there may be other people doing it, but I think that&rsquo;s really fascinating.</p>


<h3 id="hahahugoshortcode-s37-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>Thank you! Others seem to agree with you, as Ramon <a href="https://blog.fairsharing.org/?p=388">presented this work</a> last fall at the <a href="https://rsecon2022.society-rse.org/">Research Software Engineer Conference</a>! If we&rsquo;re going to integrate with somebody like ROR, we have to commit to it, you know. It was intensive at first, but mainly because of the creation of the original scripts and the batch of curation checks, as our records are all curated. But we&rsquo;ve had a couple of these three-month checks since, and it&rsquo;s been an hour or two maximum of checking, with the main curation team doing maybe fifteen minutes or so of checking each. So it&rsquo;s not been a high overhead since then, and I think as long as you keep it up, you&rsquo;re fine. But what we found is often that people can be overly verbose when they describe an organization, and they include, for instance, the name of the organization and all of the abbreviations in the same line and the same string, and maybe even the location, like a state and the country, or a city and the country. And they might use foreign language words. Some resources naturally have, as you know, working in ROR, multiple different names, depending on whether it&rsquo;s in their native language, or in English, which they often add. And then home pages change over time, so sometimes the home page we have wasn&rsquo;t the same as yours, even though the name was the same. Or sometimes the home page was the same, but one had an &ldquo;https&rdquo; and one had an &ldquo;http,&rdquo; or one had a &ldquo;/home&rdquo; and one had an &ldquo;index.html.&rdquo;</p>
<figure><img src="/img/blog/fairsharing/matching-ghent.jpg"
         alt="Matching a FAIRsharing organization to a ROR ID"/>
</figure>



<h3 id="hahahugoshortcode-s39-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yeah. Yep.</p>


<h3 id="hahahugoshortcode-s40-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>All of these different calculations. And so the automated procedure can only go so far because it will find a decent edit distance if there&rsquo;s these ancillary changes, but ultimately the core is the same. So it&rsquo;s interesting to really see how it works. And so, as of this month, Ramon&rsquo;s given me some numbers: he says we have 3,462 organizations, and of those just over 1,500 are mapped to ROR, so we&rsquo;re closing on 50%. But I don&rsquo;t know if we’ll ever be going much higher than that. I think it&rsquo;ll probably sit around that because of the scope issues. For us, 50% is not a “failure”.</p>


<h3 id="hahahugoshortcode-s41-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>I&rsquo;m sort of interested in that: whether these are organizations that should be in ROR or whether it&rsquo;s primarily a scope issue.</p>


<h3 id="hahahugoshortcode-s42-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>I don&rsquo;t actually know. I&rsquo;ve forgotten, do you include companies?</p>


<h3 id="hahahugoshortcode-s43-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yes. We do. The companies that we tend to include are companies that produce research, and that researchers will cite as their affiliated organization when they publish research. So because the primary use case of ROR is researcher affiliation &ndash; certainly not the only way that people use ROR at all, but the primary one &ndash; we include <a href="https://ror.org/00mgss748">3M</a> and places like that, where people conduct and publish research affiliated with that company.</p>


<h3 id="hahahugoshortcode-s44-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>Yeah. So things like laboratories. A lot of people want to say that my group at this university is the one who made this, and so we allow those. Consortia. So with the <a href="https://fairsharing.org/IVOA">International Virtual Observatory</a> for astronomy. They make a lot of standards, and while not really an organization, they are the ones who are responsible for it. So we need to talk about them as an organization. And there are these programs that are part of the government, for example the <a href="https://fairsharing.org/organisations/356">Canada Vigilance Program</a> which is related to the COVID-19 dashboard there. These programs are part of a government group or institute, but we allow them to self-identify. And so what we define as an organization becomes nebulous, or perhaps intentionally fuzzy. We said, &ldquo;We&rsquo;ll take all the ROR organizations that we want, but we&rsquo;ll also have these other organizations outside of ROR’s scope, because this is how people want to provide attribution for resources.&rdquo; And it&rsquo;s a kind of personal decision. They feel, and I think rightly, tied to who owns a particular resource. And some will just put in their university, while others will put in the university and the department and the laboratory &ndash; all these different levels that they want to show. And so we try to be agnostic about what it is, and let them choose for themselves.</p>


<h3 id="hahahugoshortcode-s45-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>That makes total sense. And as you doubtless know, we inherited our seed data from GRID and have just begun curating the ROR registry independently in the spring of 2022, so we&rsquo;ll see where it goes. Labs are in scope for ROR, but I suspect that we&rsquo;re missing a lot of labs that would be in scope for ROR. And then for a lot of labs in a university, I mean, you know, there are sort of official ones, and then there are just sort of research groups that call themselves a lab that may not have an official existence.</p>


<h3 id="hahahugoshortcode-s46-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>Yeah. See, that&rsquo;s interesting, because we didn&rsquo;t get any matches with labs, so I presumed that the labs weren&rsquo;t part of ROR. Interesting to know that that&rsquo;s not necessarily the case.</p>


<h3 id="hahahugoshortcode-s47-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Right. So I&rsquo;m looking randomly, but here&rsquo;s the <a href="https://ror.org/00yee3n23">Laboratory of Astrophysics of Bordeaux</a>, for instance.</p>


<h3 id="hahahugoshortcode-s48-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>Ah, yes, I see. So they are there, and they have the right parent organization. I mean, we&rsquo;d be very happy to pull ROR IDs for laboratories. For example, the <a href="https://fairsharing.org/organisations/1230">He Group</a>: they are a really prolific ontology development group, and they&rsquo;re heavily involved with the <a href="https://obofoundry.org/">OBO Foundry</a> which is a big life science and biomedical ontology community, and so they have loads of records. But that wasn&rsquo;t a match in ROR. Maybe because they never submitted themselves? I don&rsquo;t know. But we&rsquo;d be very happy to have a ROR link for things like that. And those are the sorts of things that didn&rsquo;t come up with matches. Whether or not they actually are in scope, and we just need better communication, perhaps.</p>


<h3 id="hahahugoshortcode-s49-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>That was the other thing I was going to ask, is whether you or anyone on the FAIRsharing team ever says, &ldquo;Oh, let&rsquo;s add this to ROR.&rdquo; Not that that&rsquo;s your job!</p>


<h3 id="hahahugoshortcode-s50-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>We have thought about it, and I think once or twice I did use <a href="https://curation.ror.org">the form</a>. But it&rsquo;s time, isn&rsquo;t it. And when you are, as you and we both are, manually curated resources, you always have a pinch point of curation, which is one of the reasons we are building community curators. And no matter how hard you try, it’s difficult to keep up. Now, we have a commitment that we are trying to look at every record at least once a year, and together with our maintainers, who are meant to look at their records once a year, and the community curators, that&rsquo;s our goal.</p>


<h3 id="hahahugoshortcode-s51-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Right.</p>


<h3 id="hahahugoshortcode-s52-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>So I know that in that sense it&rsquo;s hard for us, so if we don&rsquo;t find the match but we&rsquo;re not sure, or if we do find a match but there are metadata fields that don&rsquo;t match, we tend to just keep ours, but add the ROR link. It could be that we have a more up to date web page, for instance, or sometimes it&rsquo;s the other way around. And so that&rsquo;s why we haven&rsquo;t sort of <em>en masse</em> shunted all the metadata fields from ROR to FAIRsharing, because that would require another check. But there&rsquo;s no reason why we couldn&rsquo;t investigate sharing of some kind of the results of these three-monthly runs in sort of a more automated fashion. So if you had time to look at what we found and see where the differences are, we&rsquo;d give you a list of differences. But I think for that we&rsquo;d have to talk a little bit more about an official collaboration so we could say &ldquo;This is what the goals are, this is what the end result will be.&rdquo; Then something like the output of some of these regular runs might inform the curation you do, rather than having to go and fill out the form every time.</p>


<h3 id="hahahugoshortcode-s53-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Oh, certainly, and our curation process is also new, and we have a nice community curation process as well, but we have a large volume of requests, even with just that single form, right. So we are doing wonderfully right now, but I don&rsquo;t know that we have the capacity to do a lot of things. However, we have come across a number of these projects where they do have data about organizations. We do know that we want to examine or analyze it, but we just don&rsquo;t have the capacity to do it. So you know, that does suggest a grant-funded project. But we are continuing to work on building a curation process that is really robust and scalable, and it would be great to try to create a sort of an abstractable way to do this with anyone who is adopting ROR. Because almost anybody who is adopting ROR has this issue. They have data about things that are not in ROR &ndash; maybe because they shouldn&rsquo;t be, but maybe they should be. So we&rsquo;re sort of just beginning to look at that. But we have one Curation Lead and something like half a dozen community curators, so we&rsquo;re not looking for work right now. And I have to say we&rsquo;ve had something like <a href="https://github.com/ror-community/ror-updates/issues?q=is%3Aissue+is%3Aclosed">1,500 curation requests this year</a>. We get requests all the time, and they have to be evaluated and curated. We are proud of that, too, like you are, that these are well-curated records. We haven&rsquo;t touched all of them, certainly, but we are doing a lot of curation. [*Note: in 2022 ROR <a href="https://ror.org/blog/2022-12-20-year-in-review/">added 2,010 new records and updated 10,073 records</a> &ndash; about 10% of the registry &ndash; from external curation requests and internal quality assurance.]</p>


<h3 id="hahahugoshortcode-s54-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>You don&rsquo;t have to explain to me. It&rsquo;s the same issue with any curated resource, I think. My first job was with Swiss-Prot and TrEMBL; they&rsquo;re now called <a href="https://doi.org/10.25504/FAIRsharing.s1ne3g">UniProt</a>, which is a protein sequence database. You&rsquo;re coming from the library sciences, so you may not have come across them, but I was a biologist first, and then I moved into data science and I was a database programmer for them in Cambridge. Not the university, but an <a href="https://www.ebi.ac.uk/">institute</a> there. And they had a team of, probably, between the main group of curators in Cambridge and another group in Geneva, forty or fifty Ph.D.-level curators that were manually curating, and they could spend more than a week on one record, because they&rsquo;d read all the publication material and could do extremely thorough curation. And there was simply no way they were going to keep up with all protein sequence submissions. A manually-curated resource is an amazing resource, but you&rsquo;re always going to be fighting against scarcity.</p>


<h3 id="hahahugoshortcode-s55-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>I&rsquo;ve done some work with, you know, crowdsourcing types of projects, and the ideal is to automate as much as you can, and to build tools that help the manual curators. If you have some kind of continuum between &ldquo;everything is manually curated&rdquo; and &ldquo;everything is automatic,&rdquo; my ideal for good quality metadata is that you automate as much as you can, and you build tools to help the human curators, and that produces to my mind the very best metadata. If you do only one or only the other, humans make mistakes, machines are <em>very</em> dumb and make mistakes, so you know, you really have to find that hybrid model that works well for everybody.</p>


<h3 id="hahahugoshortcode-s56-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>It&rsquo;s true, because when we&rsquo;re curating a resource, we have to look at all these different types of websites with all these different types of things. We can&rsquo;t really screen scrape; it&rsquo;s not really been an option. But with the advent of things like <a href="https://bioschemas.org/">Bioschemas</a> and <a href="https://schema.org">schema.org</a>, that kind of thing might happen in the future, but it&rsquo;s not there yet. Certainly for our integrations, we look to automate as much as we can with checks, with curations like the ROR integration for organizations. Now is such an exciting time, because when we made the new data model, we wanted to think of an organization as a digital object. Before that it was the record, the description of the resource that was at the center of everything in FAIRsharing, and that&rsquo;s still important to us. But we&rsquo;re broadening our idea of what&rsquo;s a node in our graph, if you see what I&rsquo;m saying. There&rsquo;s a graph for every single record in FAIRsharing. So you go to the <a href="https://doi.org/10.25504/FAIRsharing.wkggtx">Dryad record</a>, you go to the <a href="https://doi.org/10.25504/FAIRsharing.1jKfji">ROR record</a>, and you look at the graph, and it shows you all of its relationships that it has with all of the other resources that implement it, or the databases that share data with it. Hidden and not displayed yet, but still there in a graph, are the organizations, because the organizations themselves have their own digital object in FAIRsharing. And our next iteration of the graph will showcase that, so not only will you be able to see the resource landscape, you&rsquo;ll see the resource landscape integrated with the organizations that develop and maintain and fund it. And if you look at our organization pages now, like the one for GBIF, you can see that we&rsquo;ve put all the information there on its page.</p>
<figure><img src="/img/blog/fairsharing/organization-page-gbif.png"
         alt="GBIF Organization page on FAIRsharing"/>
</figure>

<p>That&rsquo;s <a href="https://fairsharing.org/organisations/1166">our GBIF record</a>. You can see that they sit as part of a larger ecosystem. So GBIF has seventeen records that it&rsquo;s related to in some way, that it maintains, funds, anything. And it has its ROR ID and it has its country, because a lot of people like to find resources built in certain areas, and it has the users that are maintainers or community curators. In this case, there are community curators who are affiliated with the organization to broaden the scope of that graph. And then what you get from that is the ability to represent organizations as first-class citizens in FAIRsharing through relationships with people like ROR, and through the links they have to their resources, which are described in FAIRsharing. And our GBIF community curator <a href="https://fairsharing.org/users/5832">Kyle Copas</a> has particularly focused on the organizations within his consortium.</p>
<figure><img src="/img/blog/fairsharing/gbif-use-case.png"
         alt="GBIF user story"/>
</figure>

<p>And so GBIF said, &ldquo;Well, we are part of this <a href="https://doi.org/10.3897/rio.8.e81136">BiCIKL</a> consortium, and we need to find out, for instance, all the resources that are associated with our consortium members and if we&rsquo;re missing anything. So how can we build a graph?&rdquo; And Kyle said, &ldquo;I know! FAIRsharing has graphs. But they&rsquo;re not complete yet.&rdquo; And he came to us and he became an early adopter. This was right at the very beginning of our <a href="https://fairsharing.org/community_curation">community curator program</a>. He&rsquo;s one of the very first. And he used his expert knowledge to enrich the curation on our records related to his consortium. For him, it was the records and the relationships they had with their organizations that was most interesting. This is where it all ties back to ROR, and organizations as important members of an ecosystem of resources. As a community curator, he received <a href="https://orcid.org/0000-0002-6590-599X">attribution for his role with us on his ORCID profile</a>.</p>
<figure class="blog-figure"><img src="/img/blog/fairsharing/orcid-kyle-copas.png"
         alt="ORCID profile of Kyle Copas"/>
</figure>

<p>His organization gained this enriched view of not just the resources, but the organizations that were relevant to their consortium. And they created a tool that uses our API. So essentially, Kyle came to us, curated with us, and then pulled all that data back out using our API. So it was a whole happy family of everyone getting what they wanted. Us getting high quality curation, them getting a view of all the organizations that were a member of the consortium, and the associated record curation, it was just a lovely collaboration. He began with a few months of quite concerted curation, and now he can move on to more of a maintenance mode. We’re looking for our community curators to get as much back from their role with us as we get from them. So this is the way in which we see organizations gaining the spotlight: through their relationship with the records that we store in FAIRsharing. And I want to show you the other way we show organizations, which is on our community curator pages.</p>
<div class='centered'><figure><img src="/img/blog/fairsharing/community-curator-profile.jpg"
         alt="Stephen Serjeant FAIRsharing profile" width="50%"/>
</figure>
</div>
<p>So Stephen Serjeant is from the <a href="https://www.open.ac.uk/">Open University</a>, and so we show that relationship. And we also have his infrastructure: <a href="https://projectescape.eu/">ESCAPE</a> is part of EOSC. And so it showcases their organizations as well as the community curator.</p>


<h3 id="hahahugoshortcode-s61-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>It&rsquo;s a contribution in kind from those organizations.</p>


<h3 id="hahahugoshortcode-s62-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>Yes. So we want to make sure that they know they are valued, and make sure they&rsquo;re getting something back. And it&rsquo;s clear to their organization, as well as them, that they&rsquo;re getting attribution. They&rsquo;re getting expertise. Their curator pages show all the edits they&rsquo;ve done. One of our curators has done something like eight or nine hundred edits. And she can add that information to her CV, and it has been automatically added by us to her ORCID profile. Back on her FAIRsharing profile, we can see how much work she’s done, we can see the organization she works for. And we invite them to write guest blog posts, and contribute to our educational material. This means that our community curators have a number of concrete points of attribution they can point to. We have a Slack workspace so they can chat with each other. We&rsquo;re trying to build something that works for them as much as it does for us, because unfortunately what we can&rsquo;t offer them is money.</p>


<h3 id="hahahugoshortcode-s63-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Can I ask, do you have criteria for your community curators? Do they have to have a background in biological sciences or that kind of thing?</p>


<h3 id="hahahugoshortcode-s64-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>Only if it is biological science records that they want to update. The most important thing is a willingness to contribute. For example, Genveieve, one of our Humanities curators, began by curating humanities records. But then she realized that she actually also needed to update a related schema, a persistent identifier record we have. As a result, she&rsquo;s gained that expertise. Now we ask that each curator self-identifies with a domain, and you can see that on their profile tiles. But they can edit any record in the registries, because those records are so heavily connected. We couldn’t just give them write access to one subject area, because there are too many links across disciplines. And so, although they have areas which they call their own, they can edit whatever they need to. Anything that gets edited goes into a queue to be checked out by our in-house curation team. So they don&rsquo;t have to worry about making a horrible mistake. Because they&rsquo;ve said that: &ldquo;What if I make a mistake?&rdquo;</p>


<h3 id="hahahugoshortcode-s65-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>That&rsquo;s fascinating. I&rsquo;ve done work with community transcription projects, because I actually come from the humanities as well.</p>


<h3 id="hahahugoshortcode-s66-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>Is that Project Gutenberg and things like that?</p>


<h3 id="hahahugoshortcode-s67-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p><em>Like</em> that, not Project Gutenberg itself, but other projects in the humanities and archives world. You know, there are all of these handwritten manuscripts that you just can&rsquo;t OCR, you know. So I worked with Zooniverse and things like that.</p>


<h3 id="hahahugoshortcode-s68-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>Zooniverse! That is what Stephen, our curator from the Open University, is involved in.</p>


<h3 id="hahahugoshortcode-s69-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Oh, really? Oh, I love Zooniverse. And Wikipedia too, I mean I&rsquo;ve done things like that. But Zooniverse, I still am on their newsletter, and I had one project, and I&rsquo;m always wishing that I had time to go and actually do transcription and give volunteer time to all of the projects on Zooniverse. It&rsquo;s like when I retire, I&rsquo;m going to be one of these people who spends all of my time classifying images and transcribing medieval documents and whatnot, you know, because it&rsquo;s so fun, it really is.</p>


<h3 id="hahahugoshortcode-s70-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>I completely agree with you. I got distracted when I visited, because when Stephen said he would update the Zooniverse FAIRsharing record, I went to look on my own, and, oh no, I&rsquo;m going to spend the next twenty minutes, when I should be working, navigating through all this! It was just too interesting.</p>


<h3 id="hahahugoshortcode-s71-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yeah, and the National Archives and the Smithsonian and the Library of Congress in the United States all have very fun crowdsourced transcription, crowdsourced classification things &ndash; they call it that, more than &ldquo;curation,&rdquo; you know. And then the New York Public Library, they had a whole collection of historic menus that they did a wonderful project on. They built a wonderful tool, and had a wonderful project for people to transcribe all of these historic menus. People ate a lot of oysters in the early twentieth century. You&rsquo;d be surprised. Like, lots of oysters.</p>


<h3 id="hahahugoshortcode-s72-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>I was surprised at how black currants weren&rsquo;t a thing in America, and then apparently, they were transmitting some disease back in the thirties or forties and so they had to all stop growing them, and then nobody wanted them decades later when they were allowed to grow them again.</p>


<h3 id="hahahugoshortcode-s73-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>I am so thrilled to know that! I actually watch a lot of British gardening TV, or did, and yes, black currants and red currants are always, you know, a sort of a staple plant in those programs, and you do not find those here at all.</p>


<h3 id="hahahugoshortcode-s74-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>You don&rsquo;t get grape juice here. As a child you&rsquo;re given black currant juice, that&rsquo;s what you&rsquo;re given all the time. Grape juice? Why would you have grape juice? You have to go look at the fancy juice area in the grocery store to find the grape juice. Isn&rsquo;t it funny? Anyway.</p>


<h3 id="hahahugoshortcode-s75-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Anyway. All right, back to ROR. I&rsquo;m going to try to keep as much of that in the interview as I can.</p>


<h3 id="hahahugoshortcode-s76-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>You should definitely include the black currants. I went down a bit of a <a href="https://en.wikipedia.org/wiki/Blackcurrant/history">Wikipedia black hole</a> when I suddenly realized everyone was black currant here, and no one was grape, and why weren&rsquo;t we having black currants in the States. And so that&rsquo;s why.</p>


<h3 id="hahahugoshortcode-s77-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yeah.</p>


<h3 id="hahahugoshortcode-s78-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>So, yes. The integration itself was smooth. Those challenges I mentioned, the fact that we had to do string comparisons, and the fact that we don&rsquo;t have the same scope, those aren&rsquo;t <em>large</em> challenges. Honestly, they&rsquo;re just natural differences when you&rsquo;re doing a bulk match against another repository. Compared to our match procedure of our database registry with <a href="https://www.re3data.org/">re3data</a>, or when we match to ORCIDs, the ROR integration was more straightforward. And so Ramon, who is a machine learning and natural languages sort of guy, he&rsquo;s the person we go to anytime we want to do one of those things. So the first integration was just mapping from our organizations to yours, and a few months later we had the time to take a step further and integrate the search of ROR while someone is editing the record, which we <a href="https://blog.fairsharing.org/?p=385">wrote a blog post about</a>.</p>


<h3 id="hahahugoshortcode-s79-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Back when it was published I did read that.</p>


<h3 id="hahahugoshortcode-s80-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>And then <a href="https://github.com/FAIRsharing/fairsharing.github.io/pull/1952">the next step, which we&rsquo;ve nearly finished, is the integration of the relationships</a>, because from the very beginning in our data model, we allowed for a parent child relationship with organizations, because it only makes sense. But we hadn&rsquo;t populated it yet, right? And so we were going to start by populating it every time a new organization is added. If it has a parent-child relationship in ROR, we&rsquo;ll map that across when we map all the other metadata. Then we’ll look at our existing records and see where those parent-child relationships can be added as well.</p>


<h3 id="hahahugoshortcode-s81-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Wonderful. So could you talk a little bit more about what benefits to you and your users the ROR implementation is providing?</p>


<h3 id="hahahugoshortcode-s82-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>Yes. Well, I mean clearly, before we integrated with you, it was much harder to ensure that we only had one copy of each organization, and so we definitely had cases where we would think we would have two resources, both developed by the same organization, but they actually had two different instances of the organization in our database. They were so similar, they would have been caught by something like integration with ROR, and they are being caught now. And so this sort of thing happens much less frequently. Access to a large, well-maintained resource helps with the creation and maintenance of organizations on our side. And having a higher-quality set of organizations has already helped our community curators, our maintainers, our consortium members, as they&rsquo;re not stopped if there&rsquo;s an issue with the organizations. Essentially, you can think of ROR not just as a database, but as a controlled vocabulary. You have a persistent identifier, and you have &ldquo;label&rdquo; and associated metadata. There&rsquo;s no reason you couldn&rsquo;t represent your organizations as a graph. And so it&rsquo;s this kind of alignment with community registries and terminologies that helps anyone align with FAIR and enable FAIR data. And so when we use ROR, we are utilizing standardized formats, registries. We are also being interoperable. Whenever you have these persistent identifiers to identify digital objects in a repository, you are naturally making yourself more interoperable with other resources. You are also going to be really machine-readable: when you pull from the API, you don&rsquo;t just get a string, &ldquo;University of Oxford,&rdquo; which doesn&rsquo;t do much for those programs. But you do get a persistent identifier, and you gain machine-readability. So all these little things. Every time we take a step in that direction we&rsquo;re enabling FAIR for us, but also for the consumers of our resource.</p>


<h3 id="hahahugoshortcode-s83-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Fantastic. What would it help you if ROR did in the future?</p>


<h3 id="hahahugoshortcode-s84-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>Well, there&rsquo;s a few things we already spoke about, one of which sounds like a lot of fun. If we had time and funds &ndash; which is where every sentence starts with these sorts of things &ndash; then being able to look at the differences in scope that we have and you have. Like we just discovered today, laboratories: you have them, but most of ours don&rsquo;t match, and so knowing how we can feed back what should be relatively high-quality organizational metadata to you in a way that is, as you were saying, more partially computationally done rather than having to be done on a manual level. That is how we can help each other: we&rsquo;re getting information from you, and anytime we have something that you may want, we give it to you. That&rsquo;s one way of doing it. Again, that&rsquo;s opening a can of worms in a way, because there&rsquo;s so much we could do.</p>


<h3 id="hahahugoshortcode-s85-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Absolutely. All right. Is there anything else you&rsquo;d like to say before we wrap up?</p>


<h3 id="hahahugoshortcode-s86-hbhb-allyson-lister"><figure class="round-figure"><img src="/img/blog/fairsharing/allyson-lister.jpg"
         alt="Avatar of Allyson Lister"/>
</figure>
 Allyson Lister 
</h3>
<p>Oh, I should mention that part of the reason we&rsquo;re able to work so effectively with ROR and others is the collaborations that the Data Readiness group has within Oxford. Susanna’s role as the <a href="https://researchsupport.admin.ox.ac.uk/research-practice">Academic Lead for Research Practice</a> is also focused on ensuring that good practices around planning, executing, and reporting research are established and widely used across all divisions and groups at the University. Good research practices also include good research data management and FAIR data. The <a href="https://www.bodleian.ox.ac.uk/home">Bodleian Libraries</a> also plays a key role in this activity, as do all libraries. So that is another way for ROR to reach out, because the libraries are often very much the focal point of these sort of concerted mechanisms for being FAIR and being open. It was our collaboration with Bodleian Libraries that gave us our route to mint DOIs, and to become a trusted organization with ORCID, so that we can push our community curator awards to ORCID. So I think that&rsquo;s probably all I wanted to say. Thank you very much for having me and FAIRsharing!</p>


<h3 id="hahahugoshortcode-s87-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Well, thank you so much for speaking to me. It was lovely to talk to you.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>Questions? Want to be featured in a ROR case study? Contact <a href="mailto:support@ror.org">support@ror.org</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">In this installment of the ROR Case Studies series, we talk with Allyson Lister, Content and Community Lead for FAIRsharing, a cross-disciplinary registry of scientific standards, databases, and policies, about how and why FAIRsharing used ROR to help make organizations first-class citizens in their data model.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Joy Owango and Natasha Simons Join ROR Steering Group</title><id>https://doi.org/10.71938/5sar-5118</id><link rel="alternate" href="https://ror.org/blog/2023-01-23-steering-group-additions/"/><published>2023-01-23T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="General Updates"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>ROR is excited to welcome two new members to its Steering Group this year: <strong>Joy Owango</strong> from the Training Centre in Communication (TCC Africa) and <strong>Natasha Simons</strong> from the Australian Research Data Commons (ARDC).</p>
<p><strong>About Joy Owango</strong></p>
<p>Joy Owango is Founding Director of the <a href="https://www.tcc-africa.org/">Training Centre in Communication</a> (TCC Africa), the first African-based training centre to teach effective communication skills to scientists. TCC Africa is in partnership and housed at the University of Nairobi, Kenya, and provides capacity support in improving African researchers output and visibility through in scholarly and science communication.</p>
<p>Joy has successfully created  Global North–South collaborations, where the Centre promotes Open Science dialogue and provides equitable access to open-source research discovery solutions to African governments and their respective academic communities. Through these partnerships, TCC Africa actively creates awareness through advocacy on Open Science in Africa. Joy also currently sits on the Steering Committee of the International Science Council’s project on the future of scientific publishing.</p>
<p><strong>About Natasha Simons</strong></p>
<p>Natasha Simons is Associate Director, Data &amp; Services, for the <a href="https://ardc.edu.au/">Australian Research Data Commons</a> (ARDC). Based in Brisbane, she leads a team who are passionate about building infrastructure to enable FAIR data and delivering ARDC&rsquo;s suite of national information infrastructure services (persistent identifiers, vocabularies, metadata, and discovery portals). Natasha collaborates internationally to solve challenges that improve FAIR data infrastructure, policies, skills, and practices.</p>
<p><strong>About the ROR Steering Group</strong></p>
<p><a href="/blog/2019-11-22-meet-the-ror-steering-group/">Established in late 2019</a>, the ROR Steering Group is an advisory body that supports the registry’s strategic planning and decision-making activities and reflects ROR&rsquo;s global community of stakeholders. Group members provide feedback on ROR strategy and help drive awareness and adoption of ROR IDs through their respective networks and communities. The Steering Group comprises ROR’s <a href="/about#governance">operating organizations</a> (California Digital Library, Crossref, and DataCite), who serve in a permanent capacity, as well as members representing external organizations, who serve rotating terms of 2-4 years.</p>
<p>As Joy and Natasha join the 2023 cohort, two members of the group’s original cohort—Daniel Hook from Digital Science, and Clifford Lynch from the Coalition for Networked Information—will be cycling off. We are grateful to Daniel and Cliff for their guidance and support since ROR’s early days!</p>
<p>The complete current ROR Steering Group cohort for 2023 is:</p>
<ul>
<li>Matt Buys, <a href="https://datacite.org/">DataCite</a> (permanent member)</li>
<li>John Chodacki, <a href="https://cdlib.org/">California Digital Library</a> (CDL) (permanent member)</li>
<li>Lautaro Matas, <a href="https://www.lareferencia.info/">LA Referencia</a></li>
<li>Ritsuko Nakajima, <a href="https://www.jst.go.jp/">Japan Science and Technology Agency</a> (JST)</li>
<li>Joy Owango, <a href="https://www.tcc-africa.org/">Training Centre in Communication</a> (TCC Africa)</li>
<li>Ed Pentz, <a href="https://www.crossref.org">Crossref</a> (permanent member)</li>
<li>Judy Ruttenberg, <a href="https://www.arl.org/">Association of Research Libraries</a> (ARL)</li>
<li>Kathleen Shearer, <a href="https://www.coar-repositories.org/">Confederation of Open Access Repositories</a> (COAR)</li>
<li>Chris Shillum, <a href="https://orcid.org/">ORCID</a></li>
<li>Natasha Simons, <a href="https://ardc.edu.au/">Australian Research Data Commons</a> (ARDC)</li>
</ul>
<p>In addition to the Steering Group, ROR engages community input and participation through a larger community advisory group. This group, which is open to anyone, meets regularly (currently bimonthly) to discuss ROR activities, share about implementations, and give feedback on upcoming work. Those who are interested in participating in the group should email <a href="mailto:info@ror.org">info@ror.org</a> to join.</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">ROR is excited to welcome two new members to its Steering Group this year: Joy Owango from the Training Centre in Communication (TCC Africa) and Natasha Simons from the Australian Research Data Commons (ARDC).</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">ROR's Year in Review: 2022</title><id>https://doi.org/10.71938/6vea-5c14</id><link rel="alternate" href="https://ror.org/blog/2022-12-20-year-in-review/"/><published>2022-12-20T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="Year in Review"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>2022 has been a momentous year for ROR! We experienced significant growth, reached critical milestones, and established a foundation for long-term sustainability. As the year comes to a close, we&rsquo;re taking a moment to reflect on our activities and give thanks to our community for supporting us along the way.</p>


<h2 id="2022-highlights">2022 highlights 
</h2>
<p>Activities and milestones we&rsquo;re proud of in 2022 include:</p>
<ul>
<li>
<p><strong>Expanding the ROR core team</strong> (we hired curation lead <a href="/adam-buttrick">Adam Buttrick</a> and technical community manager <a href="/amanda-french">Amanda French</a>, and shifted <a href="/liz-krznarich">Liz Krznarich</a> to technical lead).</p>
</li>
<li>
<p><strong>Updating the registry independently and implementing an end-to-end community-based curation process</strong>: we <a href="https://ror.org/blog/2022-03-17-first-independent-release/">launched ROR&rsquo;s independent curation and data release process</a> and began publishing registry updates on a regular basis in response to community feedback. We also onboarded new curators to ROR&rsquo;s <a href="https://ror.org/registry/#curation-advisory-board">curation advisory board</a>, expanding beyond the original pilot cohort. </p>
</li>
<li>
<p><strong>Seeing ROR adoption grow</strong>: new <a href="https://ror.org/community#adopters">integrations</a> were launched by organizations including <a href="https://ror.org/blog/2022-11-28-case-study-osf/">Open Science Framework</a>, Beilstein Institut, Science Data Bank, FAIRsharing, and the Swiss National Science Foundation. <a href="https://www.crossref.org/blog/a-ror-some-update-to-our-api/">Crossref also launched support for ROR IDs</a> in its schema and APIs.</p>
</li>
<li>
<p><strong>Accelerating technical development work</strong> according to a new <a href="https://github.com/ror-community/ror-roadmap">product development process</a>.</p>
</li>
<li>
<p><strong>Improving ways to search and query registry data</strong>: we implemented <a href="https://ror.readme.io/docs/api-advanced-query">&ldquo;advanced search&rdquo; functionality</a> in the ROR API and <a href="https://ror.readme.io/changelog/2022-10-24-affiliation-matching-improvements-api-only">improvements to the ROR API affiliation matching function</a>, and <a href="https://github.com/ror-community/ror-roadmap/issues/8">fixed some search-related bugs</a>. We also implemented new handling to support <a href="https://ror.org/blog/2022-12-07-handling-org-status/">organization status changes</a>.</p>
</li>
<li>
<p><strong>Planning for future changes to the ROR schema/API</strong>: we finalized a <a href="https://docs.google.com/document/d/18nl6pq0kdCU5ApcdbNjKnV7xHIw9eEY7DJG1WHjaLSs/edit">plan for schema and API versioning</a> based on community input, and drafted a <a href="https://ror.org/blog/2022-12-14-schema-scheming/">proposal for the first set of changes</a> to come in late 2023.</p>
</li>
<li>
<p><strong>Updating <a href="https://ror.org">ROR&rsquo;s website</a> and <a href="https://ror.org/search">search interface</a></strong> to meet optimize accessibility and improve functionality, including adding more details to the ROR record display.</p>
</li>
<li>
<p><strong>Growing the ROR Steering Group</strong> by welcoming <a href="https://ror.org/blog/2022-04-11-new-steering-group-members/">three new members </a>(<strong>Lautaro Matas</strong> from <a href="https://www.lareferencia.info/">LA Referencia</a>, <strong>Kathleen Shearer</strong> from <a href="https://www.coar-repositories.org/">COAR</a>, and <strong>Chris Shillum</strong> from <a href="https://orcid.org">ORCID</a>). </p>
</li>
<li>
<p><strong>Formalizing a revised resourcing model for ROR</strong> based on long-term <a href="https://ror.org/blog/2022-10-10-strengthening-sustainability">support from our three operating organizations</a>.</p>
</li>
<li>
<p><strong>Being <a href="https://ror.org/blog/2022-11-22-scoss-selects-ror/">recognized by SCOSS</a></strong> as essential open infrastructure.</p>
</li>
</ul>


<h2 id="2022-by-the-numbers">2022 by the numbers 
</h2>
<p>Another way to understand ROR&rsquo;s growth in 2022 is looking at our activities in numbers.</p>
<ul>
<li>
<p>ROR API usage grew to 13+ million requests per month in 2022, up from ~6 million per month at the end of 2021 and ~3 million per month at the end of 2020. </p>
</li>
<li>
<p>We published eighteen <a href="https://github.com/ror-community/ror-updates/releases">registry data releases</a>, which comprise the addition of 2,010 new organization records and the application of 11,135 updates to 10,073 existing records (about 10% of the total registry).</p>
</li>
<li>
<p>The <a href="https://doi.org/10.5281/zenodo.6347574">ROR data dump</a> had approximately 2,900 unique downloads</p>
</li>
<li>
<p>Community participation remained strong, with 50+ attendees at ROR&rsquo;s bi-monthly community advisory meetings, 3,000+ email newsletter recipients, and 2,100+ Twitter followers. </p>
</li>
<li>
<p>ROR team members presented at 16 conferences and community events. </p>
</li>
</ul>


<h2 id="looking-ahead">Looking ahead 
</h2>
<p>We&rsquo;re proud of what we accomplished in 2022, and we&rsquo;re looking forward to another productive year. In 2023, we&rsquo;ll be focusing on accelerating ROR integrations, scaling our technical infrastructure, and improving coverage and quality of the ROR dataset.</p>
<p>As always, in all of this work, we will continue to depend on community input and involvement to make sure we are reflecting your needs and use cases. Thank you for your continued engagement and support!</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">2022 has been a momentous year for ROR! We experienced significant growth, reached critical milestones, and established a foundation for long-term sustainability.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Schema Scheming: Evolving the ROR Data Model</title><id>https://doi.org/10.71938/njsy-kh44</id><link rel="alternate" href="https://ror.org/blog/2022-12-14-schema-scheming/"/><published>2022-12-14T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><author><name>Liz Krznarich</name><uri>https://orcid.org/0000-0001-6622-4910</uri></author><category term="Technical News"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>We&rsquo;ve run several calls for feedback in 2022, but here&rsquo;s one you&rsquo;ll definitely want to chime in on: our <strong><a href="https://docs.google.com/document/d/1JNDMoKmjR2y0quWXwFfoJTsIttbltJVN0l5Wddw1cIk/edit?usp=sharing">draft proposal of major changes to the ROR data model for schema version 2.0</a></strong>. Comments are open until <strong>February 5, 2023</strong>, and we hope to release version 2.0 of the ROR schema and API in late 2023.</span></div>
	</div>
</div>
<p>When <a href="https://ror.org/blog/2022-03-17-first-independent-release">ROR diverged from GRID in March of 2022</a>, it became possible for the first time to make changes to ROR&rsquo;s data model. Our first priority was to handle inactive organizations and other organization status changes, and the community solution, as it turned out, didn&rsquo;t involve any changes to the schema. We did the technical work to implement these organization status changes, and they were <a href="https://ror.org/blog/2022-12-07-handling-org-status/">moved into production on December 1st, 2022</a>. At the same time, knowing that both ROR&rsquo;s data model and its API will need to change in the future, we asked for feedback on a proposal to handle schema and API versioning, and the ROR community agreed on a <a href="https://docs.google.com/document/d/18nl6pq0kdCU5ApcdbNjKnV7xHIw9eEY7DJG1WHjaLSs/edit?usp=sharing">set of practices</a> meant to make schema and API changes as painless as possible for ROR users.</p>
<p>With that necessary work done, <strong>we are now proposing a number of significant changes to <a href="https://ror.readme.io/docs/ror-data-structure">metadata elements in ROR records</a></strong> &ndash; removing some, changing others, adding a few, and generally rationalizing the whole schema. We expect ROR schema 2.0 to be a major overhaul: we think it&rsquo;s best to do a lot of big changes at once and get them over with so that future changes can be less disruptive.</p>
<p>ROR&rsquo;s current top-level <a href="https://ror.readme.io/docs/ror-data-structure">data structure</a>:</p>
<figure class="blog-figure"><img src="/img/ror-schema-1-0.png"
         alt="ROR data structure with 16 elements including name, id, addresses, etc."/>
</figure>

<p>There are <strong>four major categories of alteration</strong> we&rsquo;re proposing for ROR schema 2.0:</p>
<ul>
<li>
<p>Standardizing, clarifying, and reorganizing <strong>name fields</strong> (<code>name</code>, <code>aliases</code>, <code>labels</code>)</p>
</li>
<li>
<p>Removing repetitive, excessively granular <strong>location</strong> information retrieved from the GeoNames API (sub-fields in <code>addresses</code>)</p>
</li>
<li>
<p>Rethinking <strong>web domain</strong> information (<code>email_address</code>, <code>ip_addresses</code>, <code>links</code>)</p>
</li>
<li>
<p>Restructuring <strong>external identifiers</strong> to allow future addition or removal of identifier types without schema changes (sub-fields in <code>external_ids</code>)</p>
</li>
<li>
<p>Adding <strong>administrative information such as created and last modified dates</strong> </p>
</li>
</ul>
<p>These suggested changes come from ROR users, from metadata best practices, and from our own sense of what would make most sense for ROR. </p>
<p>To give us feedback, <strong>please leave comments in the <a href="https://docs.google.com/document/d/1JNDMoKmjR2y0quWXwFfoJTsIttbltJVN0l5Wddw1cIk/edit?usp=sharing">ROR Schema v2.0 draft proposal </a></strong> Google doc. If you prefer to give feedback privately or cannot access the Google doc, you can email your comments to <a href="mailto:support@ror.org">support@ror.org</a>. Your thoughts are more than welcome &ndash; we rely on your input to guide us!</p>
<div class="container">

<iframe src="https://docs.google.com/document/d/e/2PACX-1vR0k1VZwRDO7fF6c9R0FJWRa2Dpjedn_5IXkjdjjYbBKSgsqUA6UTVPufFrvcqsXnQiJa2wwjK1tfMH/pub?embedded=true" width="100%" height="700" frameborder="0" allowfullscreen="allowfullscreen" allow="geolocation *; microphone *; camera *; midi *; encrypted-media *" class="responsive-iframe"></iframe>

</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">We&amp;rsquo;ve run several calls for feedback in 2022, but here&amp;rsquo;s one you&amp;rsquo;ll definitely want to chime in on: our draft proposal of major changes to the ROR data model for schema version 2.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Handling Organization Status Changes in ROR</title><id>https://doi.org/10.71938/6395-0b14</id><link rel="alternate" href="https://ror.org/blog/2022-12-07-handling-org-status/"/><published>2022-12-07T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Liz Krznarich</name><uri>https://orcid.org/0000-0001-6622-4910</uri></author><category term="Technical News"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>2022 was a big year for ROR in many ways, but it was especially notable from a tech standpoint. In early 2022, <a href="https://ror.org/blog/2022-03-17-first-independent-release/">we built tools and processes that allowed ROR to begin curating the registry separately from GRID</a>, which quickly opened possibilities for tackling projects that had been waiting on the back burner until ROR became fully independent.</p>
<p>One of these projects was handling cases where an organization’s status in ROR has changed - for example, it merged with another organization, it ceased to operate, or it was added to ROR in error.</p>


<h2 id="the-challenge-organizations-change-over-time">The challenge: Organizations change over time 
</h2>
<p>ROR’s data model previously did not support capturing status changes or information about successor organizations (in the case where another organization carries on the work of a defunct organization). This led to a backlog of curation requests to update records for organizations that had merged, split, or disbanded that couldn’t be processed.</p>
<p>Another sticky situation ROR faced was that the original MVR (minimum viable registry) approach for ROR and the temporary syncing arrangement with GRID that was in place while ROR developed its independent curation infrastructure did not account for how to update organization records that GRID removed. Each time ROR ingested a new GRID dataset, any organizations no longer present in the new GRID dataset were removed from ROR. As a result, 1,295 previously-assigned ROR IDs failed to resolve. Clearly a problem for an identifier intended to be persistent!</p>
<p>The practice of deactivating records in GRID also entailed stripping the metadata associated with the records, so we needed to come up with a solution for restoring these deactivated records in ROR, reflecting the organizations’ statuses and metadata appropriately, and ensuring the affected ROR IDs would resolve.</p>


<h2 id="developing-a-solution-it-takes-a-village">Developing a solution: It takes a village 
</h2>
<p>As with all things ROR, developing and implementing a solution took a village (er, a community). We’re grateful to all those who participated in the steps along the way to making this project a reality:</p>
<ul>
<li><strong>June 2022: <a href="https://docs.google.com/document/d/1CK3-Q9T1jeZ-CDvsAZoeg5Ng7ljzVau6iE_NzN8Kw88">Draft proposal</a> presented at community advisory meeting</strong>, opened for public comment through July 2022. This proposal contained several metadata options for expressing organization status changes and successor organizations, as well as options for changes to ROR tools/services.</li>
<li><strong>Aug 2022: <a href="https://docs.google.com/document/d/13zFXGOuaEcgJlXz6gw9IOZUaP_khZ0d5r0nODFDyFfE">Final draft proposal</a> presented at community advisory meeting</strong>
We received lots of input during the public comment period, and the community leaned heavily toward the simplest metadata solution: use the existing status and relationship fields.</li>
<li><strong>Aug 2022: Identified and recreated the removed records using historic GRID data</strong>, including their successor organizations (as identified by GRID)</li>
<li><strong>Oct 2022: <a href="https://github.com/ror-community/ror-roadmap/discussions/95">Beta testing</a> of the proposed approach</strong> in the API, web search interface and data dump changes</li>
<li><strong>Dec 2022: <a href="https://ror.readme.io/changelog/2022-12-01-organization-status-changes">Production release</a></strong> of the API, web search interface and data dump changes and restoration of GRID-removed records</li>
</ul>


<h2 id="putting-it-into-practice">Putting it into practice 
</h2>
<p>After six months of planning, community input and implementation, we are thrilled to say that  all the changes needed to support non-active organizations were released to production on 1 Dec 2022, and records removed by GRID that previously failed to resolve have been restored! These changes are all considered “non-breaking” and include:</p>
<ul>
<li>Two new values, <strong>inactive</strong> and <strong>withdrawn</strong>, appear in the status field in a small number of ROR records. Previously, all records in ROR had a status of <strong>active.</strong> Inactive indicates that an organization has ceased operation (or ceased producing research outputs), while withdrawn indicates that a record was created in error or determined by the ROR curation team to no longer be in scope for ROR.</li>
<li>Two new relationship types, <strong>Predecessor</strong> and <strong>Successor</strong>, appear in the relationships field in a small number of ROR records.</li>
<li>API and <a href="https://ror.org/search">web search interface</a> results now default to records with a status of <strong>active</strong> only.</li>
<li>New filters and parameters in the API and the <a href="https://ror.org/search">web search interface</a> allow users to find and exclude records based on status</li>
<li>Data dump continues to include all records (with all statuses/relationship types). The first data dump that includes the new statuses and relationships types is v1.15 (<a href="https://doi.org/10.5281/zenodo.7387951">https://doi.org/10.5281/zenodo.7387951</a>).</li>
</ul>
<p>Check out the <a href="https://ror.readme.io/changelog/2022-12-01-organization-status-changes">release notes</a>, our <a href="https://ror.readme.io/">documentation</a>, and the examples below for full details about the changes.</p>


<h2 id="examples">Examples 
</h2>


<h3 id="inactive-organization">Inactive organization 
</h3>
<p>JDSU (United States) <a href="https://ror.org/01a5v8x09">https://ror.org/01a5v8x09</a> split into two separate companies, and one of the new companies, Viavi Solutions (United States) <a href="https://ror.org/059a9e323">https://ror.org/059a9e323</a>, carries on its work. JDSU (United States) <a href="https://ror.org/01a5v8x09">https://ror.org/01a5v8x09</a> now has a status of inactive and points to Viavi Solutions (United States) <a href="https://ror.org/059a9e323">https://ror.org/059a9e323</a> as its successor. Viavi Solutions (United States) <a href="https://ror.org/059a9e323">https://ror.org/059a9e323</a> now includes a predecessor relationship to JDSU (United States) <a href="https://ror.org/01a5v8x09">https://ror.org/01a5v8x09</a>. <a href="https://github.com/ror-community/ror-updates/issues/485">See the curation request</a> that led to these changes.</p>
<figure><img src="/img/jdsu-inactive.png" width="100%"/>
</figure>



<h3 id="restored-record-that-previously-did-not-resolve">Restored record that previously did not resolve 
</h3>
<p>Midwestern University was assigned ROR ID <a href="https://ror.org/00zg0xv61">https://ror.org/00zg0xv61</a> and was subsequently removed by GRID. Prior to Dec 2022, its record did not resolve in ROR.</p>
<figure><img src="/img/grid-removed-404.png" width="100%"/>
</figure>

<p>The ROR curation team found that this record had been removed by GRID because it was a duplicate of an existing record with the same name <a href="https://ror.org/046yatd98">https://ror.org/046yatd98</a>, and should not have been added. <a href="https://ror.org/00zg0xv61">https://ror.org/00zg0xv61</a> now has a status of withdrawn and points to <a href="https://ror.org/046yatd98">https://ror.org/046yatd98</a> as its successor.</p>
<figure><img src="/img/midwestern-withdrawn.png" width="100%"/>
</figure>

<p>As always, please let us know of any issues or opportunities for improvement by submitting a <a href="https://github.com/ror-community/ror-roadmap/issues">Github issue to the ROR roadmap</a> or contacting us at <a href="mailto:support@ror.org">support@ror.org</a>. To request new ROR records or updates to existing records, use the <a href="https://curation-request.ror.org">curation request form</a>.  Note that we do accept requests for new records with a status of inactive, in order to support community use cases around matching affiliation information in older metadata to ROR IDs.</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">2022 was a big year for ROR in many ways, but it was especially notable from a tech standpoint. In early 2022, we built tools and processes that allowed ROR to begin curating the registry separately from GRID, which quickly opened possibilities for tackling projects that had been waiting on the back burner until ROR became fully independent.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Case Study: ROR in the COS Open Science Framework</title><id>https://doi.org/10.71938/gdr4-d136</id><link rel="alternate" href="https://ror.org/blog/2022-11-28-case-study-osf/"/><published>2022-11-28T00:00:00+00:00</published><updated>2025-11-07T11:35:16-05:00</updated><author><name>Amanda French</name><uri>https://orcid.org/0000-0002-4325-1809</uri></author><category term="Case Studies"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>This conversation with Eric Olson of the Center for Open Science is the first in a new series of interviews, &ldquo;Case Studies in ROR Integration,&rdquo; a series designed to provide in-depth detail on why and how people are choosing to integrate ROR IDs into their systems.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>
<h2 id="key-quotations">Key quotations 
  <a href="#key-quotations" aria-label="Key quotations">
<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 640 512" class="ha ha-link" width="16" height="16" alt="Key quotations"><path d="M579.8 267.7c56.5-56.5 56.5-148 0-204.5c-50-50-128.8-56.5-186.3-15.4l-1.6 1.1c-14.4 10.3-17.7 30.3-7.4 44.6s30.3 17.7 44.6 7.4l1.6-1.1c32.1-22.9 76-19.3 103.8 8.6c31.5 31.5 31.5 82.5 0 114L422.3 334.8c-31.5 31.5-82.5 31.5-114 0c-27.9-27.9-31.5-71.8-8.6-103.8l1.1-1.6c10.3-14.4 6.9-34.4-7.4-44.6s-34.4-6.9-44.6 7.4l-1.1 1.6C206.5 251.2 213 330 263 380c56.5 56.5 148 56.5 204.5 0L579.8 267.7zM60.2 244.3c-56.5 56.5-56.5 148 0 204.5c50 50 128.8 56.5 186.3 15.4l1.6-1.1c14.4-10.3 17.7-30.3 7.4-44.6s-30.3-17.7-44.6-7.4l-1.6 1.1c-32.1 22.9-76 19.3-103.8-8.6C74 372 74 321 105.5 289.5L217.7 177.2c31.5-31.5 82.5-31.5 114 0c27.9 27.9 31.5 71.8 8.6 103.9l-1.1 1.6c-10.3 14.4-6.9 34.4 7.4 44.6s34.4 6.9 44.6-7.4l1.1-1.6C433.5 260.8 427 182 377 132c-56.5-56.5-148-56.5-204.5 0L60.2 244.3z"/></svg>
  </a>
</h2>
<p>&ldquo;[W]hen we mint DOIs for all the content types that we have DOIs for – which right now are preprints, preregistration documents, and project spaces – we send the ROR IDs from the affiliation data that’s associated with the user profile with the metadata for each contributor.&rdquo;</p>
<p>&ldquo;The really cool stuff that we have just done with the ROR integration will allow us to continue to build in the direction of describing relationships.&rdquo;</p>
<p>&ldquo;[W]e want to be part of that movement: being able to visualize the relationships between one DOI and all the people and places and other things that it’s related to.&rdquo;</p>
<p>&ndash; Eric Olson, Product Manager, Center for Open Science</p>
</span></div>
	</div>
</div>


<h3 id="hahahugoshortcode-s1-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Thank you so much for agreeing to tell us about your ROR integration. Can you start by telling us your name, title, and organization?</p>


<h3 id="hahahugoshortcode-s2-hbhb-eric-olson"><figure class="round-figure"><img src="/img/blog/osf/eric-olson-100.png"
         alt="Avatar of Eric Olson"/>
</figure>
 Eric Olson 
</h3>
<p>I&rsquo;m Eric Olson. I am the Product Manager at the <a href="https://cos.io">Center for Open Science</a>.</p>


<h3 id="hahahugoshortcode-s3-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Do you work on a particular product? Or are you a product manager who works on multiple products?</p>


<h3 id="hahahugoshortcode-s4-hbhb-eric-olson"><figure class="round-figure"><img src="/img/blog/osf/eric-olson-100.png"
         alt="Avatar of Eric Olson"/>
</figure>
 Eric Olson 
</h3>
<p>We do have multiple products, but the product that we are actively developing and maintaining is the <a href="https://osf.io">Open Science Framework (OSF)</a>, which itself has many, many tools involved in that one platform. But you know, as a community-driven organization, there are other products in the landscape that are involved, like ROR. So that all falls under our infrastructure product team.</p>
<figure class="blog-figure"><img src="/img/blog/osf/cos-osf-infrastructure.png"
         alt="Pyramid schematic with five levels in the following order from bottom to top: infrastructure, user experience, communities, incentives, and policy. Also shown is the COS Infratructure vision statement: To empower communities, institutions, and funders to advance rigor and transparency of research."/><figcaption>
            <p>Graphic provided by the Center for Open Science</p>
        </figcaption>
</figure>



<h3 id="hahahugoshortcode-s6-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Tell us about the Center for Open Science.</p>


<h3 id="hahahugoshortcode-s7-hbhb-eric-olson"><figure class="round-figure"><img src="/img/blog/osf/eric-olson-100.png"
         alt="Avatar of Eric Olson"/>
</figure>
 Eric Olson 
</h3>
<p>Our mission is to increase the openness, integrity, and reproducibility of research. We are a culture change organization. And that mission can&rsquo;t be accomplished by any one change or any one stakeholder. The OSF, while it&rsquo;s extremely important to us and has over half a million users, is just one means of accomplishing our mission.</p>
<p>COS works on the diffusion of innovations. There are early adopters who, if you give them the most exciting practice, they see the value in it right away. And they want to run right to that. All of us in the infrastructure space, including in persistent identifiers, are in exactly the same boat. The question is, &ldquo;How do we even make it possible for researchers to do those innovative open science things that they&rsquo;re super excited about?&rdquo; They really want to do them, but they need something to enable those things to happen.</p>
<p>So on the infrastructure team, our primary goal is to make it possible for researchers to participate in the open practices that we are advocating. We get those early adopters, and because early adopters are usually very exciting, and what they&rsquo;re practicing is very exciting, they also pull people with them because they just have a gravity to how enthusiastic they are and can demonstrate benefits to what they&rsquo;re doing. So we don&rsquo;t just throw the tool at them and say, &ldquo;Go use this.&rdquo; We continue to iterate on it, make it better, make it easier for researchers to use, easier for the next group who might not be the earliest adopters, but who are willing to give this a chance if we make it continue to make it easier for them. And that&rsquo;s what we do at the infrastructure level.</p>
<p>We have other teams at the center that work with policymakers, funders, journals, publishers. They&rsquo;re advocating for practices and policies that align with the mission: openness, transparency, integrity, and reproducibility. We also have a research team, what we call the metascience team, that looks at the &ldquo;science of science&rdquo; and determines if the practices that we advocate and the tools that we provide are actually moving us toward the mission. And if they aren&rsquo;t: Do we need to take a step back and figure out what we&rsquo;re missing? Where are we not meeting researchers where they are?</p>
<p>Our theory of change for creating more openness is that researcher communities determine that &ldquo;this is the thing we should do.&rdquo; And researchers don&rsquo;t need to be forced to do it, even though they might have been encouraged to do it by their funders and their institutions. We give them the tools to do it. And then those communities just make that a normal thing that researchers are expected to do, and then the rest of their community is going to continue to come along with it, because that&rsquo;s the expectation. So it&rsquo;s getting these open, transparent practices in scholarly communication to be among those norms that that researchers and research communities and research institutions normalize as part of their practices and their expectations.</p>


<h3 id="hahahugoshortcode-s8-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Great. So since we are talking specifically about the ROR integration into OSF, can you tell us a little bit more about OSF, the Open Science Framework?</p>


<h3 id="hahahugoshortcode-s9-hbhb-eric-olson"><figure class="round-figure"><img src="/img/blog/osf/eric-olson-100.png"
         alt="Avatar of Eric Olson"/>
</figure>
 Eric Olson 
</h3>
<p>The OSF is a research, management, and planning platform that, as I mentioned before, has a lot of tools, because it wants to provide tools at each stage of the research lifecycle &ndash; elements that the researchers can rely on so that they can bundle lots of stuff: their planning, their files, their data that&rsquo;s an outcome or an input into their research, their collaborators, their protocols. All those things might have previously been scattered in lots of different places, so if someone asked them, &ldquo;Can you put all this together for me along with your paper so that I can try to reproduce your study?&rdquo;, the amount of effort involved, if it were even possible at all, would be pretty significant. Whereas on the OSF, you can have your data stored on the OSF or linked to another storage provider, you can write out your protocols, you can do what we call &ldquo;pre-register&rdquo;, plan out your studies even before you go and gather your data and document them with a timestamp. And have all of those be discoverable and do all that in one location. And we don&rsquo;t want to duplicate what other tools do, so we integrate with storage providers, with citation managers, and with persistent identifier infrastructure to take advantage of the really cool stuff they already do that our users already use. And so those just become part of those available workflows in the OSF.</p>
<figure class="blog-figure"><img src="/img/blog/osf/osf-screenshot.png"
         alt="Screenshot of Open Science Framework with text There&#39;s a better way to manage your research - OSF is a free, open platform to support your research and enable collaboration"/><figcaption>
            <p><a href="https://www.osf.io">https://www.osf.io</a></p>
        </figcaption>
</figure>



<h3 id="hahahugoshortcode-s11-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>So let&rsquo;s move on to ROR, The Research Organization Registry. Do you remember where and when you personally first heard about ROR?</p>


<h3 id="hahahugoshortcode-s12-hbhb-eric-olson"><figure class="round-figure"><img src="/img/blog/osf/eric-olson-100.png"
         alt="Avatar of Eric Olson"/>
</figure>
 Eric Olson 
</h3>
<p>Yeah, I can. And it&rsquo;s been really exciting over the last year or so to see so much manifesting in the ROR space. Because I first heard of it in 2018 while I was on the ORCID team. We obviously heard a lot about persistent identifier progress in the community, and at <a href="https://www.pidapalooza.org/past-events">PIDapalooza in Girona, Spain</a>, there was a kickoff of &ldquo;What is this thing going to be?&rdquo; And I was fortunate that I was there early and was able to attend that with several members of the team that you&rsquo;re on now, like <a href="/liz-krznarich">Liz Krzarnich</a>.</p>
<p>And it was really cool to see very early what <a href="https://ror.org/blog/2018-12-02-the-ror-of-the-crowd/">the anticipated concerns were</a>. Not just with the technology, really, that was not what they were concerned about at all. It was &ldquo;How do we make this continue to be a community-driven initiative like ORCID and Crossref and DataCite?&rdquo; You know, we&rsquo;re already living that! So it&rsquo;s &ldquo;How do we create something that is not going to reproduce the problems that the other organization identifiers have, but instead can be something that is community-owned and governed and understood?&rdquo; I&rsquo;ve always been a fan of PIDs that can represent the key elements of our work in scholarly communication. And there were previously and there are still other organization identifiers, but to see one that actually aligns with what the rest of the PID infrastructure that we&rsquo;re working with in scholcomm was really a huge opportunity. And it&rsquo;s really exciting to see it all take off and be where we are.</p>


<h3 id="hahahugoshortcode-s13-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>ROR didn&rsquo;t exist at that point as a technology.</p>


<h3 id="hahahugoshortcode-s14-hbhb-eric-olson"><figure class="round-figure"><img src="/img/blog/osf/eric-olson-100.png"
         alt="Avatar of Eric Olson"/>
</figure>
 Eric Olson 
</h3>
<p>No. I saw the first version at the next PIDapalooza. We saw <a href="https://ror.org/blog/2019-02-10-announcing-first-ror-prototype/">the first beta of the ROR index</a> the next year in Dublin.</p>


<h3 id="hahahugoshortcode-s15-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>And we&rsquo;re now at the stage where although there are plenty of things we can do, the tech is really very mature, compared to then.</p>


<h3 id="hahahugoshortcode-s16-hbhb-eric-olson"><figure class="round-figure"><img src="/img/blog/osf/eric-olson-100.png"
         alt="Avatar of Eric Olson"/>
</figure>
 Eric Olson 
</h3>
<p>Yeah. It was barely past the <a href="https://ror.readme.io/docs/grid">inheriting GRID</a> phase at the time. And there were still lots of questions about &ldquo;What do we represent here?&rdquo; Hierarchy was always a big debate, and it still is.</p>


<h3 id="hahahugoshortcode-s17-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Yep, <a href="/about/faqs/#is-my-organization-in-scope-for-ror">yes indeed</a>. So who were the primary advocates of implementing ROR at your organization?</p>


<h3 id="hahahugoshortcode-s18-hbhb-eric-olson"><figure class="round-figure"><img src="/img/blog/osf/eric-olson-100.png"
         alt="Avatar of Eric Olson"/>
</figure>
 Eric Olson 
</h3>
<p><a href="https://www.cos.io/blog/how-osf-meets-desirable-characteristics-for-data-repositories">The OSF has supported PIDs for years</a> &ndash; DOIs and even Ark IDs have been in the OSF for years, though not always implemented in the way that they are now. We always knew that this was the direction that the community was headed, and we&rsquo;re carving out the space for it. And ORCID IDs were also added prior to my arrival. And when I got here, ROR just seemed like a really great opportunity, because we were working with these institutional partners to give them insights into what their communities are already doing on the OSF. And because we have high confidence that those users are part of those member institutions, that we know who they are, it just made sense to also send ROR IDs for all these DOIs we&rsquo;re minting on behalf of the users that come from those institutions. And so for those institutions, it instantly adds some additional value for them.</p>
<p>And it starts to complete the loop, you know, for PIDs. <a href="https://doi.org/10.5860/crln.83.9.398">We have PIDs now for people, places, and things</a> to go along with globally unique identifiers that we use within the platform. There&rsquo;s still room for us to grow there and make it more nuanced and add more identifiers, and those are coming. But for the product team, ROR was definitely on our radar very early on when we were aligning with our institutional partners a couple of years ago.</p>


<h3 id="hahahugoshortcode-s19-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Why did you decide to implement ROR at the time you did? What was it that sort of pushed you over the edge from being a supporter to being an integrator?</p>


<h3 id="hahahugoshortcode-s20-hbhb-eric-olson"><figure class="round-figure"><img src="/img/blog/osf/eric-olson-100.png"
         alt="Avatar of Eric Olson"/>
</figure>
 Eric Olson 
</h3>
<p>The OSF has so many workflows, and so many tools that we support, and so many integrations with other PID providers and with other storage and citation managers, that prioritization of new integrations in particular is a bit of a negotiation for us internally. As we continue to mature the OSF so that that promise of having tools across the research lifecycle can really be met, the key is iterating sort of a piece at a time so that as we mature, we don&rsquo;t sacrifice other parts of our infrastructure. Having some maturity on our institutional models was part of that. And we continue to work on making those services better and providing more reliable metrics for our institutional partners. And this was a part of that: it provides more insights for our partners, our institutional members, while also continuing to implement more parts of <a href="https://www.project-freya.eu/en/pid-graph/the-pid-graph">the PID Graph</a>.</p>
<p>Since then, we&rsquo;ve added a couple of more pieces to this, so it&rsquo;s kind of PID dominoes. Starting earlier this year, it just made sense to start doing these updates and getting the metadata ready to take in more information than we ever did previously. We <a href="https://github.com/CenterForOpenScience/osf.io/pull/9810/commits">made a lot of changes</a> to what kinds of information we can collect from users, or know about users and their content, so that we could send those to DataCite and Crossref in the most effective ways. And we had to align all of that up so that it works, without making assumptions about a user and their data. It took a bit of trial and error there. But now we&rsquo;re in a place where we have more metadata options that are coming soon, and we can continue to send this new metadata to DataCite and Crossref.</p>
<p>So really, the timing lined up, because we have a lot of PID and metadata projects that we&rsquo;ve just completed or that we&rsquo;re in the process of completing through the beginning of next year that are going to support a lot of researcher needs as well as funder and institutional needs. Being able to see Who funded this research? What institution is it associated with? Who are the collaborators? All of those will be able to have PIDs, so it&rsquo;s not a mystery anymore to unlock the box. Where&rsquo;d this come from? What&rsquo;s it about? Who provided support for it? It&rsquo;ll all be reliable connections that describe those relationships.</p>


<h3 id="hahahugoshortcode-s21-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>I really like your image of PID dominoes, because you hear all these metaphors about networks and ecosystems and landscapes that try to capture how PIDs interact: a DOI for the output and a ROR for the institution and an ORCID for the researcher. But there is a sense in which everyone needs to adopt all the PIDs at once in order to create the full picture.</p>
<figure class="blog-figure"><img src="/img/blog/osf/domino_effect_003.gif"
         alt="Domino effect"/><figcaption>
            <p><a href="https://commons.wikimedia.org/wiki/File:Domino_effect_003.gif">https://commons.wikimedia.org/wiki/File:Domino_effect_003.gif</a></p>
        </figcaption>
</figure>



<h3 id="hahahugoshortcode-s23-hbhb-eric-olson"><figure class="round-figure"><img src="/img/blog/osf/eric-olson-100.png"
         alt="Avatar of Eric Olson"/>
</figure>
 Eric Olson 
</h3>
<p>Yeah, and it&rsquo;s overwhelming to hear that if you&rsquo;re an institution, you know, as just one corner of all the places that provide and use PIDs. To say, &ldquo;You have to get everybody on your campus signed up for ORCID IDs, get their affiliations on, and make sure they <a href="https://info.orcid.org/documentation/integration-guide/syncronizing-with-orcid/">have their syncs turned on</a>&rdquo; &ndash; which are just miracle workflows, you know, but you still have to get them to do it, and it sounds like a whole lot. And so part of why we at the Center for Open Science do this is if we&rsquo;re going to advise that these are good things to have your researchers do, this is part of open science, it really only makes sense that we would enable the things that we&rsquo;re advising them to do. So it&rsquo;s just walking the walk that we articulate as the right path.</p>


<h3 id="hahahugoshortcode-s24-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>I&rsquo;m going to ask you now to describe how you&rsquo;ve implemented ROR in your system. Where does it fit into your workflow? Does it fit into multiple places in your workflow? Can users see the ROR ID? What ROR tech are you using? Is it the API or the data dump? I&rsquo;m actually also interested in this idea of affiliation verification since you&rsquo;ve mentioned that a couple of times. Can you talk about why that&rsquo;s important for you?</p>


<h3 id="hahahugoshortcode-s25-hbhb-eric-olson"><figure class="round-figure"><img src="/img/blog/osf/eric-olson-100.png"
         alt="Avatar of Eric Olson"/>
</figure>
 Eric Olson 
</h3>
<p>Currently, the way we have this implemented is that we have a service within the OSF where institutions can choose to have public content from their affiliated researchers be visually identified with that institution.</p>
<figure class="blog-figure"><img src="/img/blog/osf/osf-institutions.png"
         alt="OSF Institutions graphic with the tagline &#39;OSF Institutions increases rigor and transparency across the research lifecycle by revealing researcher activity, institutional affiliations, and trends to enable more effective compliance and reporting&#39; "/><figcaption>
            <p>Graphic provided by the Center for Open Science</p>
        </figcaption>
</figure>

<p>And the way we do that is that an institution <a href="https://www.cos.io/products/osf-institutions">becomes a Service Partner with us</a>, and we set up single sign-on (SSO) into the OSF using their credentials, using their systems, which for a number of institutions is a critical part of identity management. I won&rsquo;t speak for all institutions here, but for some of them, they don&rsquo;t want users to be using a lot of tools without something that gives the institution the ability to give the &ldquo;go or no go.&rdquo; And so that&rsquo;s what we provide, is a single sign-on access for each institutional service partner. And if the user attempts to sign in as a member of an institution that they don&rsquo;t belong to, then they will not get access to the OSF that way, although they can access it in a different way.</p>
<figure class="blog-figure"><img src="/img/blog/osf/osf-institutions-login.png"
         alt="OSF Institutions login screen with the tagline &#39;If your institution has partnered with OSF, please select its name below and sign in with your institutional credentials. If you do not currently have an OSF account, this will create one for you.&#39; "/><figcaption>
            <p><a href="https://accounts.osf.io/login?campaign=institution&amp;institutionId=&amp;service=">https://accounts.osf.io/login?campaign=institution&amp;institutionId=&amp;service=</a></p>
        </figcaption>
</figure>

<p>If they do sign in as a member of an institution, then they now have that institution associated with their profile. And they can be part of multiple institutions; I myself am part of two.</p>
<figure class="blog-figure"><img src="/img/blog/osf/osf-institutions-admin.png"
         alt="OSF Institutions administrative dashboard page showing the page for the Center for Open Science with users, user handles, departments, and projects listed"/><figcaption>
            <p>Graphic provided by the Center for Open Science</p>
        </figcaption>
</figure>

<p>And when they are working on content, they can add a visual brand based on their affiliations, and they can add that branding to their content.</p>
<figure class="blog-figure"><img src="/img/blog/osf/osf-registration-nosek.jpg"
         alt="Screenshot of the OSF pre-registration &#39;Investigating variation in replicability: A “Many Labs” Replication Project&#39; "/><figcaption>
            <p><a href="https://osf.io/scayl/resources">https://osf.io/scayl/resources</a></p>
        </figcaption>
</figure>

<p>And then publicly those will all aggregate on the OSF side on a discovery page associated with that institution.</p>
<figure class="blog-figure"><img src="/img/blog/osf/osf-cos-projects.png"
         alt="OSF Institutions browse page for the Center for Open Science"/><figcaption>
            <p><a href="https://osf.io/institutions/cos">https://osf.io/institutions/cos</a></p>
        </figcaption>
</figure>

<p>The user and the content can have multiple affiliations. If you and I are both creators of the same piece of content, we can each have a different affiliation, and we can each have multiple affiliations. But I can&rsquo;t make that claim on your behalf; you&rsquo;d be adding it yourself. And so that&rsquo;s how we deal with it within the OSF&rsquo;S ecosystem. We verify based on that single sign-on that a user is associated with an institution.</p>
<p><strong>And then when we mint DOIs for all the content types that we have DOIs for &ndash; which right now are preprints, preregistration documents, and project spaces &ndash; we send the ROR IDs from the affiliation data that&rsquo;s associated with the user profile with the metadata for each contributor.</strong></p>
<p>So you&rsquo;ll see for each contributor what their affiliations are rather than getting them all blended together in the metadata that we send to Crossref and to DataCite.</p>
<figure class="blog-figure"><img src="/img/blog/osf/osf-datacite-nosek.png"
         alt="OSF DOI metadata in DataCite showing ROR IDs for creator Brian Nosek for both the University of Virginia and the Center for Open Science"/><figcaption>
            <p>OSF DOI metadata in DataCite - <a href="https://api.datacite.org/application/vnd.datacite.datacite+json/10.17605/osf.io/scayl">https://api.datacite.org/application/vnd.datacite.datacite+json/10.17605/osf.io/scayl</a></p>
        </figcaption>
</figure>

<figure class="blog-figure"><img src="/img/blog/osf/osf-crossref-nosek.png"
         alt="OSF DOI metadata in Crossref showing ROR IDs for creator Brian Nosek for both the University of Virginia and the Center for Open Science"/><figcaption>
            <p>OSF DOI metadata in Crossref <a href="http://api.crossref.org/works/10.31234/osf.io/xahgh">http://api.crossref.org/works/10.31234/osf.io/xahgh</a></p>
        </figcaption>
</figure>

<p>So that&rsquo;s where we&rsquo;ve started, because we know that those individuals are part of those institutions, so we have very high confidence that those affiliations belong with those individuals and their content.</p>
<p>And we are looking at ways to implement ROR beyond that so that you can self-identify your institution. There are a number of technical things we have to work through to do that, so that we&rsquo;re enabling them in ways that for one takes advantage of the <a href="https://ror.readme.io/docs/rest-api">ROR API</a> so that <a href="https://ror.readme.io/docs/forms">users choose based on a ROR query rather than a free-text field</a>, which certainly would not help us verify anything, because it could be wrong, or just different than the way the institution includes their information. So that&rsquo;s something we&rsquo;re still working on. Currently, we associate a ROR ID with each of our current institutional partners and with each new one that comes aboard. And, yeah, that&rsquo;s it.</p>


<h3 id="hahahugoshortcode-s33-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Gotcha. What challenges did you run into as you were integrating ROR into your systems?</p>


<h3 id="hahahugoshortcode-s34-hbhb-eric-olson"><figure class="round-figure"><img src="/img/blog/osf/eric-olson-100.png"
         alt="Avatar of Eric Olson"/>
</figure>
 Eric Olson 
</h3>
<p>Part of the challenge was that <a href="https://github.com/CenterForOpenScience/osf.io/pull/9810">we were adding ROR to the metadata that we send to DataCite and Crossref</a> at the same time as we were adding other additional information to send and preparing for even more information that&rsquo;s coming. Well, actually, some of that we&rsquo;ve added since we did the ROR work, and there&rsquo;s more coming soon. So we made a lot of changes and prepared for even more changes all at the same time. And so each time we would, you know, try to make the change, as soon as we hit one error, the whole thing would fail &ndash; which was good, because we didn&rsquo;t want to upload a bunch of data that was wrong. So it took us three attempts to get it all lined up and ready to be sent properly and update it so that the ROR IDs are available for all content going forward as well as backfilling content that is eligible for ROR IDs to be added to that metadata. So we updated a lot of DOIs with that rollout.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>
<h3 id="cos-dois-with-ror-ids-as-of-11112022">COS DOIs with ROR IDs as of 11/11/2022 
  <a href="#cos-dois-with-ror-ids-as-of-11112022" aria-label="COS DOIs with ROR IDs as of 11/11/2022">
<svg xmlns="http://www.w3.org/2000/svg" viewBox="0 0 640 512" class="ha ha-link" width="16" height="16" alt="COS DOIs with ROR IDs as of 11/11/2022"><path d="M579.8 267.7c56.5-56.5 56.5-148 0-204.5c-50-50-128.8-56.5-186.3-15.4l-1.6 1.1c-14.4 10.3-17.7 30.3-7.4 44.6s30.3 17.7 44.6 7.4l1.6-1.1c32.1-22.9 76-19.3 103.8 8.6c31.5 31.5 31.5 82.5 0 114L422.3 334.8c-31.5 31.5-82.5 31.5-114 0c-27.9-27.9-31.5-71.8-8.6-103.8l1.1-1.6c10.3-14.4 6.9-34.4-7.4-44.6s-34.4-6.9-44.6 7.4l-1.1 1.6C206.5 251.2 213 330 263 380c56.5 56.5 148 56.5 204.5 0L579.8 267.7zM60.2 244.3c-56.5 56.5-56.5 148 0 204.5c50 50 128.8 56.5 186.3 15.4l1.6-1.1c14.4-10.3 17.7-30.3 7.4-44.6s-30.3-17.7-44.6-7.4l-1.6 1.1c-32.1 22.9-76 19.3-103.8-8.6C74 372 74 321 105.5 289.5L217.7 177.2c31.5-31.5 82.5-31.5 114 0c27.9 27.9 31.5 71.8 8.6 103.9l-1.1 1.6c-10.3 14.4-6.9 34.4 7.4 44.6s34.4 6.9 44.6-7.4l1.1-1.6C433.5 260.8 427 182 377 132c-56.5-56.5-148-56.5-204.5 0L60.2 244.3z"/></svg>
  </a>
</h3>
<ul>
<li><a href="https://api.datacite.org/dois?affiliation=true&amp;query=(creators.affiliation.affiliationIdentifier:*%20AND%20NOT%20creators.affiliation.affiliationIdentifier:%22%22)%20OR%20(contributors.affiliation.affiliationIdentifier:*%20AND%20NOT%20contributors.affiliation.affiliationIdentifier:%22%22)&amp;client-id=cos.osf">COS DataCite DOIs with ROR IDs </a>: 11545</li>
<li><a href="https://api.crossref.org/works?query.publisher-name=Center+for+Open+Science&amp;filter=has-ror-id:t&amp;rows=0">COS Crossref DOIs with ROR IDs</a>: 1634</li>
</ul>
</span></div>
	</div>
</div>
<p>I guess the key challenge is that we did more than ROR all at once, so we had to really get it lined up properly in order for it to work.</p>


<h3 id="hahahugoshortcode-s36-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>What benefits to you and your users is this ROR integration currently providing?</p>


<h3 id="hahahugoshortcode-s37-hbhb-eric-olson"><figure class="round-figure"><img src="/img/blog/osf/eric-olson-100.png"
         alt="Avatar of Eric Olson"/>
</figure>
 Eric Olson 
</h3>
<p>A big part of it for us is that this is something that we get to tell our partners and other institutions beyond us. Implementing the PID graph, as we talked about before, creates a lot of opportunities. If you have the ORCID IDs and you&rsquo;re minting DOIs, or if you&rsquo;re using some other identifier for a specific use case, and you can use ROR IDs, then you have a lot of information that&rsquo;s unlocked that way.</p>
<p>And these things are <em>going</em> to be used. If they&rsquo;re not in use yet, they&rsquo;re <em>going</em> to be adopted by the other stakeholder groups that researchers are working with and rely on, like publishers and like funders. These things are coming. So we want to &ndash; not just be out ahead, that&rsquo;s not the goal, but we want to say that we are advocates of this adoption, and that we&rsquo;re so confident in the reliability of this great infrastructure that we&rsquo;re going to do it. And we&rsquo;re going to do it now: we&rsquo;re not going to wait for it to get to the tipping point. Even though we&rsquo;re on the precipice of that, I&rsquo;m sure. But we want to get there and help that happen.</p>
<p>For some of our partners, this is not really something that&rsquo;s on their radar yet. Or it might be, but it&rsquo;s under the surface a little bit. And that&rsquo;s okay. We&rsquo;re still going to talk about it, that this is something that&rsquo;s really advantageous. And they&rsquo;ll be seeing more of it one way or another, so we want to enable those things in our workflows, enable ROR, enable more relationships between DOIs and other DOIs, and with person and place IDs and grant and funder IDs, those are all either there already, or coming.</p>
<p>And then it&rsquo;s about all those lifecycle pieces in the OSF. Getting to a consistency with these cool features, so that they&rsquo;re not just in one part of OSF but in all the tools in the platform.</p>
<figure class="blog-figure"><img src="/img/blog/osf/osf-research-lifecycle.png"
         alt="OSF research lifecycle circular graphic showing ten steps in the research lifecycle from Search and Discover to Publish Report"/><figcaption>
            <p>Graphic provided by the Center for Open Science</p>
        </figcaption>
</figure>

<p>We&rsquo;re still catching up with updating some parts of the OSF to take advantage of all this cool stuff. But that really is the goal, is to be able to demonstrate the things that we articulate as very valuable. We want to take part in those practices not just because they benefit our tech, although we thoroughly believe that they do, but because we want to lead by example. It&rsquo;s part of our theory of change to help not just the early adopters, but everyone &ndash; to make it easier for researchers to find the change they need, to see that these things are possible.</p>
<p>And if the OSF can provide that, that&rsquo;s great. But if someone just reads about all this cool stuff that we&rsquo;re talking about and realizes that another tool is actually better because it does these things in a different way, that&rsquo;s okay too. That&rsquo;s still pushing toward our mission, in our theory of change, when people work with other tools, tools that a lot of times we work with ourselves. So I think that&rsquo;s the key, is that it&rsquo;s not just a benefit to the OSF but really to our entire mission. And this is really a critical opportunity to describe the change we want to see in open infrastructure.</p>


<h3 id="hahahugoshortcode-s39-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>So that leads right into the next question, which is: What benefits &ndash; maybe specifically tools, reports, that kind of thing &ndash; do you hope or expect to get from ROR integration in the future?</p>


<h3 id="hahahugoshortcode-s40-hbhb-eric-olson"><figure class="round-figure"><img src="/img/blog/osf/eric-olson-100.png"
         alt="Avatar of Eric Olson"/>
</figure>
 Eric Olson 
</h3>
<p><strong>The really cool stuff that we have just done with the ROR integration will allow us to continue to build in the direction of describing relationships.</strong> We&rsquo;ve already added ways to describe relationships in our registration workflows: you can add the data and the materials and the code, all the outcomes or inputs into your research, you can link the DOIs and describe those relationships. Having the ROR IDs and the ORCID IDs be effectively implemented and displayed for both ORCID and ROR is something we still have to get to. But that&rsquo;s something we have for one-quarter of the workflows that we have.</p>
<p>And now the question is: Can we enable this for more parts of the platform so that there are even more ways to describe all of the things, whether they&rsquo;re on the OSF or not? That&rsquo;s the key here: you can have your things stored with a DOI somewhere else, you can have your code on Zenodo, you can have your data stored in your institutional repository, you can have your paper published by <em>Nature,</em> all of those things you have different DOIs for that articulate the connection to your preregistration, and then we send all that to DataCite, including the ROR ID if we have it. And so right now the task is to capture all of this information.</p>
<p>And then, you know, really what we are looking for, just like many other institutions and tools, is How can we learn from what&rsquo;s been captured? How do we provide a picture, you know, a window into that? We have analytics tools that are available in the OSF currently, but they aren&rsquo;t getting into this yet, where they are exploring these relationships. And that&rsquo;s going to be going to be a while yet, but that&rsquo;s really what we&rsquo;d like to take advantage of. And we&rsquo;re not alone in this, because we have open APIs that can get a lot of this information, so we don&rsquo;t have to be the ones that develop it, even.</p>
<p><strong>But we want to be part of that movement: being able to visualize the relationships between one DOI and all the people and places and other things that it&rsquo;s related to.</strong> And that really is the promise of the PID graph: being able to click a button and see what other institutions and what other regions worked on this project. Is there data connected to this paper? Who are all the individuals that contributed, and then if I click on the individual, can I see all their other stuff? How do I break that open? There&rsquo;s still work to do there, but we&rsquo;re getting the data to enable it now. We&rsquo;re at that point where we&rsquo;re collecting what we want. The Funder IDs are coming soon. So it&rsquo;s going to be rolling into the next step of exposing all of that in creative and effective ways. That will really be the next step. But all this data is going to be available, so anybody can start to explore it. And our role, just like we said before, will be to make that easier as a next step to get at that information.</p>


<h3 id="hahahugoshortcode-s41-hbhb-amanda-french"><figure class="round-figure"><img src="/img/amanda-sq-100.png"
         alt="Avatar of Amanda French"/>
</figure>
 Amanda French 
</h3>
<p>Great! This was tremendously interesting &ndash; thanks for telling us all about your ROR integration.</p>
<div class='callout green'>
	<div class="row">
		<div class="col-md-12"><span>Questions? Want to be featured in a ROR case study? Contact <a href="mailto:support@ror.org">support@ror.org</a>.</span></div>
	</div>
</div>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">This conversation with Eric Olson of the Center for Open Science is the first in a new series of interviews, &amp;ldquo;Case Studies in ROR Integration,&amp;rdquo; a series designed to provide in-depth detail on why and how people are choosing to integrate ROR IDs into their systems.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">SCOSS Selects ROR as Essential Open Infrastructure</title><id>https://doi.org/10.71938/y947-eq16</id><link rel="alternate" href="https://ror.org/blog/2022-11-22-scoss-selects-ror/"/><published>2022-11-22T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="General Updates"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>We are thrilled to announce that ROR has been selected by <a href="https://scoss.org">SCOSS</a> as essential open infrastructure and will be highlighted in the latest SCOSS campaign, which <a href="https://scoss.org/4thpledgingroundannouncment/">kicked off this week</a>.</p>
<p>SCOSS - the Global Sustainability Coalition for Open Science Services - is an initiative that focuses on identifying non-commercial infrastructures that are key for open science and worthy of community investment. In order to be selected, applicants must go through a competitive and rigorous evaluation process, and only a few infrastructures are approved every cycle.</p>
<p>It is an honor for ROR to receive this selection, and we are excited that ROR is continuing to be recognized as essential open scholarly infrastructure.</p>
<p><em>Image credit: <a href="https://twitter.com/scossfunding">@scossfunding</a> on Twitter</em>
<figure><img src="/img/scoss-card.png" width="75%"/>
</figure>
</p>
<p>With this selection, ROR joins the esteemed company of other community-based open infrastructure initiatives such as <a href="https://www.lareferencia.info/en/">LA Referencia</a> and <a href="https://datadryad.org/">Dryad</a> (both selected for the current cycle), along with past awardees <a href="https://doaj.org/">DOAJ</a>, <a href="https://v2.sherpa.ac.uk/romeo/">Sherpa Romeo</a>, <a href="https://www.doabooks.org/">DOAB</a>, <a href="https://pkp.sfu.ca/">PKP</a>, <a href="https://opencitations.net/">OpenCitations</a>, <a href="https://arxiv.org/">arXiv</a>, <a href="https://www.redalyc.org/">Redalyc</a>/<a href="http://amelica.org/">AmeliCA</a>, and <a href="https://dspace.lyrasis.org/">DSpace</a>. ROR also carries the distinction of being the first persistent identifier (PID) to receive SCOSS recognition!</p>
<p>SCOSS&rsquo;s selection of ROR is another prominent signal of ROR&rsquo;s growing presence in coordinated infrastructure strategies, such as those described in recent reports by <a href="https://research.jiscinvolve.org/wp/2022/07/13/moving-ahead-with-the-uk-national-pid-strategy/">Jisc</a> and the <a href="https://ardc.edu.au/article/strategic-investment-in-identifiers-could-save-24-million-and-38000-person-days-per-year/">Australian Research Data Commons</a>, and in recommendations outlined by US research data stakeholder groups to <a href="https://www.arl.org/resources/implementing-effective-data-practices-stakeholder-recommendations-for-collaborative-research-support/">implement effective data practices</a> and <a href="https://www.aau.edu/accelerating-public-access-research-data">accelerate public access</a>.</p>
<p>As an early signatory to the <a href="/blog/2020-12-16-aligning-ror-with-posi">Principles of Open Scholarly Infrastructure</a>, ROR has been committed to carrying out a careful and responsible approach to managing the initiative. We began the process of applying for SCOSS at the same time that we were firming up an overarching sustainability model for ROR that we <a href="/blog/2022-10-10-strengthening-sustainability/">announced in October</a> and that is based on a shared resourcing commitment by  ROR&rsquo;s three operating organizations <a href="https://cdlib.org">California Digital Library</a>, <a href="https://www.crossref.org">Crossref</a>, and <a href="https://datacite.org">DataCite</a>. We knew that we would need to have a working sustainability framework in place regardless of whether our SCOSS application was successful, and that if the SCOSS application was successful, it would be an opportunity to highlight how our three organizations&rsquo; shared commitment can be supplemented by outside investment by community stakeholders.</p>
<p>ROR has always and will always continue to be community-led open infrastructure. Broadening the base of community investment in ROR is one way to ensure that our infrastructure is sustained and secured as a common good for us all.</p>
<p>We are grateful to SCOSS for recognizing ROR as essential open infrastructure, and we look forward to continuing our work to support broad adoption of open persistent identifiers and rich metadata across the research ecosystem.</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">We are thrilled to announce that ROR has been selected by SCOSS as essential open infrastructure and will be highlighted in the latest SCOSS campaign, which kicked off this week.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Strengthening ROR’s Sustainability Model</title><id>https://doi.org/10.71938/hbxz-1621</id><link rel="alternate" href="https://ror.org/blog/2022-10-10-strengthening-sustainability/"/><published>2022-10-10T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>ROR Leadership Team</name><uri>https://ror.org/about/team/#operations-team</uri></author><category term="General Updates"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>Since its launch in 2019, ROR has been charting a unique path toward sustainability. In this blog post, we want to provide an update for the community on where this path is taking us.</p>


<h2 id="journey-to-sustainability">Journey to sustainability 
</h2>
<p>In response to ROR&rsquo;s growing needs and in anticipation of future ones, we have devoted time over the past few years to developing a stronger sustainability model that can support ROR over the long term while also allowing ROR to remain a community-based, low-overhead operation focused  on providing infrastructure that is freely and openly available to all.</p>
<p>Today, we are excited to announce that ROR is continuing on its path toward sustainability by formalizing a <strong>resourcing model based on long-term support from its three operating organizations</strong>, as opposed to relying on outside revenue sources.</p>
<p>This is not a major change from the status quo, but it is a significant development in that it clarifies our strategy going forward and addresses some questions and uncertainties about the future.</p>
<p>In short, what this shift means is that <strong>ROR’s governing organizations are re-committing to ROR and have agreed to share full responsibility for covering ROR’s core operating costs</strong>. It also means that ROR is not currently planning to cover primary operating expenses by relying on revenue from future paid services or time-limited grant funds.</p>


<h2 id="how-we-got-here">How we got here 
</h2>
<p>ROR itself is not an organization and does not intend to become one. It launched, and continues to exist, as a collaborative initiative led by <a href="https://cdlib.org">California Digital Library</a>, <a href="https://www.crossref.org">Crossref</a>, and <a href="https://datacite.org">DataCite</a>. Our three organizations share responsibility for ROR’s governance and strategic decision-making, a responsibility we formalized in a Memorandum of Agreement signed in 2020.</p>
<p>As part of our shared commitment to ROR, our three organizations have provided limited in-kind resources to partially cover ROR’s operating expenses over the years. This in-kind support has been supplemented with grant funds (<a href="https://www.imls.gov/grants/awarded/lg-246305-ols-20">IMLS</a> and <a href="https://www.nsf.gov/awardsearch/showAward?AWD_ID=2031172">NSF</a>) and <a href="/supporters">contributions from community stakeholders</a>.</p>
<p>ROR&rsquo;s hybrid resourcing model has helped us grow rapidly and has allowed us to be nimble and adaptable in our first few years. However, while ROR has been and continues to be a relatively small operation, it still requires resources to be successful and sustainable, especially as more users depend upon it. Infrastructure costs are rising with more usage of the ROR API, and the ROR team is expanding to be able to support ROR’s technical development, user community, and curation activities.</p>
<p>As we worked toward finalizing a sustainability model, a few factors were key in this process.</p>
<p>First of all, in line with our commitment to the <a href="https://openscholarlyinfrastructure.org">Principles of Open Scholarly Infrastructure</a> (POSI), we did not want to be dependent on grants for long-term and/or core operating expenses. (We remain very  grateful to IMLS and NSF for supporting key development work over the past two years to support curation and adoption.) And in line with POSI, we were clear that any paid revenue would need to come from additional services to be designed and developed, not from ROR&rsquo;s core data.</p>
<p>Second, we acknowledged that such paid services would take time and resources to scope and develop, and would distract from supporting core functionality. Moreover, given ROR’s aim to provide a freely and openly available solution to the problem of identifying research affiliations, a paid service is not necessarily in line with such a mission. We did not want to invest time and resources in developing paid services just for the sake of doing so.</p>
<p>Lastly, while <a href="/supporters">community fundraising has been successful</a>, it has not proven to be sufficient for covering all expenses. Fundraising also requires a lot of overhead to administer. We have also experienced challenges in explaining the need to obtain contributions for something that is ostensibly “free”.</p>
<p>Other potential revenue-generation options that might be relevant in other situations were never on the table for ROR&ndash;i.e., ROR is not set up to become a separate membership organization and thus does not provide organizational memberships.</p>
<p>This experience led to a realization of how ROR is inextricably tied to our three organizations’ strategic goals, and vice versa. For instance:</p>
<ul>
<li>ROR will only be successful if there is widespread inclusion of ROR IDs in DataCite and Crossref metadata. Likewise, DataCite and Crossref benefit from having the clean and consistent affiliation data that ROR IDs provide.</li>
<li>ROR’s success also depends on institutions such as California Digital Library leveraging and advocating for ROR IDs in publishing workflows, publisher agreements, citation databases, and research information systems. California Digital Library likewise needs&ndash;and benefits from&ndash;the open availability of affiliation metadata that <a href="https://osc.universityofcalifornia.edu/2022/08/pathways-to-oa-open-infrastructure/">allows for University of California research outputs to be discoverable</a>.</li>
</ul>
<p>Therefore, we realized that the <strong>optimal way to position ROR for success and sustainability would be for our organizations to commit to supporting ROR independently</strong>. With this approach, we can set up ROR for long-term resourcing, free up time and focus by not pursuing additional revenue-generation options, and remove uncertainties about being able to fund staff roles or infrastructure costs in the future.</p>


<h2 id="looking-ahead-whats-next">Looking ahead: What’s next? 
</h2>
<p>While the mechanics of ROR’s sustainability model have evolved, ROR’s core activities have not. The ROR team will continue with its activities already underway and in consultation with the community.</p>
<p>To formalize our organizations’ commitment to ROR, we have updated the memorandum of agreement that guides our collaboration, and we have <a href="/documents/ROR-Memorandum-of-Agreement-2022.pdf">published this document on the ROR website</a>.</p>
<p>ROR will also continue to be a community-driven and community-supported initiative. A wide network of stakeholders around the world have helped to support ROR over the past years by building integrations, raising awareness, providing feedback on future directions, and contributing funds to help ROR grow. Investments from community stakeholders are a key part of signaling the value and importance of ROR as shared open infrastructure, and many organizations are motivated to support open initiatives such as ROR. While ROR’s operating organizations are committing to fund the registry’s core expenses, community stakeholders are welcome to continue <a href="/sustain">contributing to ROR</a> if they wish to do so, and these investments will be used to offset operating costs and support special projects and initiatives. ROR may still pursue grant funding in the future, and such funds would only be used for the same purpose.</p>
<p>Thank you to the many organizations and individuals that have helped ROR get to this point and that are continuing to join us on this journey!</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Since its launch in 2019, ROR has been charting a unique path toward sustainability. In this blog post, we want to provide an update for the community on where this path is taking us.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">What's in a Name? Handling Name Metadata in ROR</title><id>https://doi.org/10.71938/7ac6-v827</id><link rel="alternate" href="https://ror.org/blog/2022-08-31-name-metadata-in-ror/"/><published>2022-08-31T00:00:00+00:00</published><updated>2025-11-05T16:17:42-08:00</updated><author><name>Adam Buttrick</name><uri>https://orcid.org/0000-0003-1507-1031</uri></author><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="Explainers"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>The ROR registry includes unique IDs and associated metadata records for 100,000+ research organizations. Each one of these organizations has at least one name. Some of them have multiple names (one even has 15)! Organization names in ROR appear in 120 different languages and counting.</p>
<p>Names in ROR are a key piece of metadata that helps with discovery and disambiguation of ROR records. Because organization names frequently change over time and because they can be written in multiple ways and in multiple languages, maintaining name metadata in ROR is an important and complex part of the registry&rsquo;s curation process.</p>
<p>One of the most common questions we receive is about how we handle this process, so we thought it would be helpful to put together this blog post to share some observations and discuss our current practices with regard to organization names in ROR.</p>


<h2 id="background"><strong>Background</strong> 
</h2>
<p>ROR first launched in 2019 using seed data from GRID, and subsequent registry updates were coordinated through and <a href="https://ror.org/blog/2019-07-02-ror-development-update/">synced to GRID</a> while we worked to develop infrastructure and practices for maintaining the registry. ROR completed its multi-year transition to an independent community-based curation model in <a href="https://ror.org/blog/2022-03-17-first-independent-release/">March 2022</a>, officially diverging from GRID at this time.</p>
<p>Curation of ROR is an ongoing process and helps to ensure that the registry is as comprehensive as possible, that registry records are discoverable and usable, and that organization metadata is correct and reliable.</p>
<p>Now that the registry is fully independent, we are able to be more responsive to and proactive about community feedback about registry additions and updates. The registry is updated on a rolling basis, with new releases available approximately once a month, and the curation process is tracked publicly on <a href="https://github.com/ror-community/ror-updates#ror-updates">Github</a>.</p>


<h2 id="names-in-context-ror-aims-and-scope"><strong>Names in context: ROR aims and scope</strong> 
</h2>
<p>In order to understand how ROR handles organization names, it is important to understand the wider context of ROR&rsquo;s aims and scope.</p>
<p>ROR is a global, community-led registry of open persistent identifiers for research organizations. ROR was developed to provide an open and community-driven solution to the problem of identifying affiliations in research outputs. First and foremost, ROR is a <em>registry of persistent identifiers for research organizations</em>, and the registry&rsquo;s purpose is to <em>enable connections in scholarly infrastructure</em> between research organizations, research outputs, and researchers.</p>
<p>In this way, ROR differs from other types of databases about institutions, because it is uniquely focused on the use case of identifying affiliations. For example, ROR is not designed to be a registry of legal entity information about organizations, because an organization&rsquo;s legal name may not be the same as the organization name used as an affiliation — i.e., scholarship published by researchers from the University of California is not affiliated with &ldquo;Regents of the University of California&rdquo; (the university&rsquo;s legal name).</p>
<p>ROR is also not designed to promote or rank the value or quality of a given organization&rsquo;s research outputs. If an organization is included in ROR, it simply means that some research has been or will soon be associated with that organization. Inclusion in ROR does not imply anything about the quality or legitimacy of this research.</p>
<p>These nuances about ROR&rsquo;s aims and scope are important to understand because they influence ROR&rsquo;s approach to metadata curation. Let&rsquo;s turn to that topic next.</p>


<h2 id="the-role-of-metadata-in-ror"><strong>The role of metadata in ROR</strong> 
</h2>
<p>As noted above, ROR is a registry of persistent identifiers. However, an identifier string on its own is not very useful. This is where metadata comes in.</p>
<p>Every ROR ID includes descriptive metadata about the organization with which it is associated. This metadata includes the organization name, type, URL, and location, among other information. These details can be helpful in a couple of ways.</p>
<p>First, metadata is important for discovery. Let&rsquo;s say a user is trying to find the ROR ID for Harvard University. The ID can be found by searching ROR for the name &ldquo;Harvard,&rdquo; because this name appears in the descriptive metadata associated with the ID. Or let&rsquo;s say a user is interested in finding all of the ROR records that exist for organizations in Iceland. This information can be retrieved by querying the ROR API with a country filter that limits results to records that have &ldquo;Iceland&rdquo; in the country location field. Metadata is key to these discovery processes.</p>
<p>Metadata in ROR is also useful for disambiguation. Let&rsquo;s say we are looking for the ROR record for Northwestern University. If ROR only had basic name metadata, it would be difficult to identify the right record, because there are two records in ROR with the primary name &ldquo;Northwestern University.&rdquo; How do we know which one is which? Since ROR includes additional information about each organization, we can distinguish between the two records by looking at the URL and location information they include (one is in the United States and the other is in the Philippines, by the way).</p>
<p>Metadata is therefore an important, useful, and necessary part of ROR. An organization might change over time, but its ROR identifier will not. ROR metadata needs to be able to reflect these changes. ROR metadata also needs to be understood in the context of ROR&rsquo;s purpose as an open, community-driven registry of persistent identifiers. ROR metadata should not be interpreted as serving as the canonical authority on a given organization.</p>


<h2 id="how-names-are-represented-in-ror"><strong>How names are represented in ROR</strong> 
</h2>
<p>Now let&rsquo;s turn to the question of how names are used in ROR.</p>
<p>A given ROR record may have multiple names associated with it. ROR actually has four different <a href="https://github.com/ror-community/ror-updates/wiki/ROR-Metadata-Policies">metadata fields</a> that capture information about names:</p>
<ul>
<li>The primary form of the organization name when used as an affiliation (<code>name</code>)</li>
<li>Translations of the primary form of the affiliation name in other languages (<code>labels</code>)</li>
<li>Alternate versions or previous versions of the name (<code>aliases</code>)</li>
<li>Acronyms or initialisms in use for the name (<code>acronyms</code>)</li>
</ul>
<p><em>Example ROR record with multiple names</em>
<figure><img src="/img/name-metadata-example.png" width="55%"/>
</figure>
</p>
<p>All of these fields can be useful because collectively they support the goals of discovery and disambiguation discussed above. We believe that having robust metadata for names is key for this reason. When a record has less metadata, it can be harder for users to discover the ID they are looking for, or it can be challenging to disambiguate between two similar records.</p>
<p>When we are creating a new record in ROR, or updating metadata in an existing record, we make decisions about which names appear in which fields based on details provided by the organization (or the person submitting feedback about the organization&rsquo;s record), information available on the organization&rsquo;s website and other official channels, and evidence of research activities associated with the organization. We aim to make sure that the name metadata in the record reflects the current state of the organization as well as current affiliation usage.</p>
<p>For example, let&rsquo;s say we are working on the record for an organization whose official name is &ldquo;University of ROR,&rdquo; but we identify many published articles that have the affiliation written as &ldquo;ROR University.&rdquo; In order to make sure these articles can be associated with the correct ROR ID, we include this name variation — even though it is not official — as an alias in the record.</p>


<h2 id="common-themes-with-name-metadata"><strong>Common themes with name metadata</strong> 
</h2>
<p>As we curate the ROR registry, we spend a lot of time making sure that metadata for organization names is accurate, up to date, and comprehensive. A few common themes come up in this work. </p>


<h3 id="language">Language 
</h3>
<p>As mentioned above, ROR was built with seed data from the GRID database of institutions. GRID had a policy of defaulting to English-language names in the primary name field. Because ROR was initially synced to GRID, ROR inherited this metadata and the majority of organization records have an English-language name in the primary name field.</p>
<p>Now that ROR is being maintained independently, we have received feedback that this practice is not always appropriate in certain situations. We therefore have been publishing organization records with non-English primary names when this is requested. We are also performing a more comprehensive analysis of our records to identify other changes that should be made to existing metadata.</p>
<p>One exception to this practice is that the primary name field currently supports Latin characters only according to the ROR schema. We will be looking into changing this as part of a larger effort to evolve the ROR schema. Another challenge is that the primary name field does not currently include language tags. This is another area that we would like to improve in future schema changes. </p>


<h3 id="affiliation-names-vs-legal-names">Affiliation names vs. legal names 
</h3>
<p>Because ROR&rsquo;s focus is on identifying affiliations, the name used in an organization&rsquo;s ROR record might be different from that organization&rsquo;s legal name. When making decisions about the primary name, we look to published research outputs and scholarly databases to understand current usage of the organization name as an affiliation. If it is not clear that the legal form of the name is actively used as an affiliation, it may not be included in the metadata record. </p>


<h3 id="name-variations">Name variations 
</h3>
<p>Many ROR records include translations of the organization&rsquo;s name in multiple languages, or shortened or alternate versions of the primary name. We sometimes receive requests to remove these variations from the record. However, our general practice is to retain these variations. As discussed throughout this piece, the metadata in ROR serves to support discovery and disambiguation; it is not meant to serve as the canonical name authority for a given organization. If there is evidence of research associated with those versions of the name, the metadata in ROR reflects this. </p>


<h2 id="future-directions-for-name-metadata-in-ror"><strong>Future directions for name metadata in ROR</strong> 
</h2>
<p>As we continue to improve the quality and coverage of organization metadata in ROR, we plan to focus especially on name metadata. This will be a key component of future schema changes, which we will be exploring in the coming months with input from community stakeholders.</p>
<p>Another area of focus for the future is analyzing current name metadata across the registry to identify opportunities for metadata enrichment at scale. For example, we are looking at all records where the primary name is not in English and <a href="https://github.com/ror-community/ror-updates/issues/720">reviewing them for the completeness and quality of a corresponding English label</a>.</p>
<p>We also work to support those who are integrating ROR in their systems to help them develop implementations that can leverage the full breadth of name metadata (and additional information) in ROR. Implementation guidance is available on the <a href="https://ror.readme.io/docs/forms">ROR support site</a>. </p>


<h2 id="bringing-it-all-together"><strong>Bringing it all together</strong> 
</h2>
<p>To sum up these thoughts: in this discussion about handling name metadata in ROR, we aim to underscore a few key concepts:</p>
<ul>
<li>ROR&rsquo;s primary purpose is to be a <strong>registry of persistent identifiers</strong> for research affiliations</li>
<li>Metadata serves to <strong>support discovery and disambiguation</strong> of ROR records</li>
<li>Metadata in ROR is <strong>not the same thing as legal metadata</strong> about an organization</li>
<li>ROR records are more useful when they have <strong>more metadata</strong></li>
<li>Names in ROR are a key part of the metadata for each record, and it is <strong>useful to have multiple names</strong> associated with each record</li>
<li>ROR is a <strong>community-curated registry</strong> that is responsive to community feedback</li>
</ul>
<p>We hope this explanation has provided interesting insights into the work we do at ROR to support rich and useful metadata associated with every ROR ID.</p>
<p>Curation is an ongoing effort and we invite community users to become more involved in this work through participation in the curation advisory board and/or through community discussions.</p>
<p>If you have questions or feedback, or if you would like to get more involved in curating ROR records, please get in touch via <a href="mailto:support@ror.org">support@ror.org</a>.</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">The ROR registry includes unique IDs and associated metadata records for 100,000+ research organizations. Each one of these organizations has at least one name.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Welcome Amanda French, Technical Community Manager</title><id>https://doi.org/10.71938/cqcw-fk94</id><link rel="alternate" href="https://ror.org/blog/2022-06-13-welcome-amanda-french/"/><published>2022-06-13T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="Team Updates"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>We are thrilled to introduce a new member of the ROR pride: <a href="https://amandafrench.net">Amanda French</a> has joined ROR as our full-time Technical Community Manager.</p>
<p>Amanda is a well-known community manager and project director in the digital humanities and scholarly
communication sphere. Most recently, she served as Community Lead at <a href="https://covidtracking.com/">The COVID Tracking Project</a> at <em>The Atlantic</em>, where she helped build and nurture a community of more than 800 volunteers dedicated to collecting and publishing key COVID-19 data. Prior to that, she directed the Mellon-funded project <a href="https://mellon.org/grants/grants-database/grants/the-george-washington-university/11600667/">&ldquo;Resilient Networks for Inclusive Digital Humanities&rdquo;</a> at the George Washington University Libraries, directed the Digital Research Services unit at <a href="https://lib.vt.edu/">Virginia Tech University Libraries</a>, led the <a href="http://thatcamp.org">THATCamp unconference initiative</a> at George Mason University&rsquo;s Roy Rosenzweig Center for History and New Media, and was a member of the first cohort of Council on Library and Information Resources (CLIR) Postdoctoral Fellows. She often speaks and sometimes writes about openness in scholarly publishing, crowdsourcing, Agile, digital humanities, and related topics.</p>
<p>Amanda will be working with ROR&rsquo;s <a href="https://ror.org/governance">three operating organizations</a> and the broader ROR community to promote and support the adoption of ROR in systems used throughout research and scholarly communications workflows. This includes engaging with new and existing ROR adopters, as well as with other community stakeholders, to understand their workflows and systems and to guide their implementations and integrations.</p>
<p>We are excited to have Amanda join the team at such a critical time for ROR&rsquo;s growth and expansion!</p>
<p>As a reminder, there are a number of resources available to learn about integrating ROR and working with ROR IDs, and there are many ways to get involved in community activities and discussions about ROR integrations. These include:</p>
<ul>
<li><a href="https://ror.readme.io">Technical support documentation</a></li>
<li><a href="https://groups.google.com/a/ror.org/g/ror-tech">Technical support discussion forum</a></li>
<li>List of <a href="https://bit.ly/ror-integrations">current integrations</a> (fill out <a href="https://bit.ly/ror-integration-form">this form</a> if your integration is not listed)</li>
<li>Community advisory group and adoption working group (<a href="info@ror.org">email ROR</a> to request to join)</li>
</ul>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">We are thrilled to introduce a new member of the ROR pride: Amanda French has joined ROR as our full-time Technical Community Manager.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">ROR Welcomes Three New Steering Group Members</title><id>https://doi.org/10.71938/gn84-p107</id><link rel="alternate" href="https://ror.org/blog/2022-04-11-new-steering-group-members/"/><published>2022-04-11T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="General Updates"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>ROR is excited to welcome three new Steering Group members: Lautaro Matas, Kathleen Shearer, and Chris Shillum.</p>
<p>Inaugurated in late 2019, the ROR Steering Group is an advisory body that supports the registry&rsquo;s strategic planning and decision-making activities. Steering group members provide feedback on ROR strategy and help drive awareness and adoption of ROR IDs through their respective networks and communities. The Steering Group comprises ROR&rsquo;s operating organizations (California Digital Library, Crossref, and DataCite), who serve in a permanent capacity, as well as members representing external organizations, who serve rotating terms of 2-4 years.</p>
<p>The new members of the ROR Steering Group join those who formed the original cohort <a href="https://ror.org/blog/2019-11-22-meet-the-ror-steering-group/">announced at the end of 2019</a>. One member of the original cohort, Ina Smith, has stepped down as of 2022 for personal reasons. The current list of members is as follows (new members in 2022 are identified with an asterisk):</p>
<ul>
<li><strong>Matt Buys</strong>, <a href="https://datacite.org">DataCite</a></li>
<li><strong>John Chodacki</strong>, <a href="https://cdlib.org">California Digital Library</a></li>
<li><strong>Daniel Hook</strong>, <a href="https://digital-science.com">Digital Science</a></li>
<li><strong>Clifford Lynch</strong>, <a href="https://cni.org">Coalition for Networked Information</a></li>
<li><strong>Ritsuko Nakajima</strong>, <a href="https://ror.org/00097mb19">Japan Science and Technology Agency</a></li>
<li><strong>Lautaro Matas</strong>*, <a href="https://www.lareferencia.info/">LA Referencia</a></li>
<li><strong>Ed Pentz</strong>, <a href="https://www.crossref.org">Crossref</a></li>
<li><strong>Judy Ruttenberg</strong>, <a href="https://arl.org">Association of Research Libraries</a></li>
<li><strong>Kathleen Shearer</strong>*, <a href="https://www.coar-repositories.org/">Confederation of Open Access Repositories</a></li>
<li><strong>Chris Shillum</strong>*, <a href="https://orcid.org">ORCID</a></li>
</ul>
<p>In addition to the Steering Group, ROR engages community input and participation through other groups and channels. A larger <a href="https://ror.org/community/">community advisory group</a>, which is open to anyone, meets bimonthly to discuss ROR activities, share implementations and integrations, and give feedback on upcoming work. ROR&rsquo;s <a href="https://ror.org/registry/#curation-advisory-board">curation advisory board</a> reviews proposed changes to ROR and maintains ROR&rsquo;s curation policies and workflows. A publisher adoption working group brings together those working on or planning ROR integrations in publishing systems.</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">ROR is excited to welcome three new Steering Group members: Lautaro Matas, Kathleen Shearer, and Chris Shillum.
Inaugurated in late 2019, the ROR Steering Group is an advisory body that supports the registry&amp;rsquo;s strategic planning and decision-making activities.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">ROR-ing On Our Own: Announcing Our First Independent Registry Update</title><id>https://doi.org/10.71938/1f0g-a647</id><link rel="alternate" href="https://ror.org/blog/2022-03-17-first-independent-release/"/><published>2022-03-17T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="Technical News"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us"><![CDATA[

<h2 id="the-ror-registry-has-grown"><strong>The ROR registry has grown!</strong> 
</h2>
<p>We are excited to announce a new ROR registry release. With this update, the registry has grown to 102,559 records.</p>
<p>This update is really exciting for many reasons. But we know some of you may be eager to skip ahead and check out the update right away. If that&rsquo;s you, here are the quick links:</p>
<ul>
<li><a href="https://api.ror.org/organizations">ROR API</a></li>
<li><a href="https://ror.org/search">ROR search</a></li>
<li><a href="https://doi.org/10.5281/zenodo.6347575">ROR data dump</a></li>
<li><a href="https://github.com/ror-community/ror-updates/releases/tag/v1.0">Release notes</a></li>
<li><a href="https://ror.readme.io">Documentation</a></li>
</ul>


<h2 id="a-key-milestone-for-ror"><strong>A key milestone for ROR</strong> 
</h2>
<p>This week&rsquo;s update represents a significant milestone for ROR and for the broader community, for a few reasons. </p>


<h3 id="ror-is-now-officially-independent-from-grid">ROR is now officially independent from GRID 
</h3>
<p>This registry update is the first one that ROR has handled completely on its own. This means it is the first version of the registry in which some ROR records will not have an equivalent GRID ID.</p>
<p>Many of you know the backstory, but here is a recap of where we started and how we got to this point: ROR first launched in 2019 using seed data from GRID, which had been selected as the foundational data source for ROR to start with and then build upon independently and with community input. As ROR began building the technical infrastructure to support independent management of the registry, we coordinated updates via GRID and kept the two registries in sync while working toward a point of divergence.</p>
<p>Although ROR always planned to diverge from GRID as an independent, community-oriented registry, GRID&rsquo;s future plans were not clear until mid-2021, when it announced that it would <a href="https://ror.org/blog/2021-07-12-ror-grid-the-way-forward/">retire from the public space at the end of the year and officially pass the torch to ROR</a>. <em>Note: If you are mapping GRID IDs to ROR IDs and need guidance, <a href="https://ror.readme.io/docs/mapping">this is a good place to start</a>.</em></p>
<p>We are excited to have finally reached this long-awaited milestone of full independence!</p>


<h3 id="ror-has-fully-operationalized-its-unique-community-based-curation-model">ROR has fully operationalized its unique community-based curation model 
</h3>
<p>Launching with the GRID dataset as a seed file gave ROR a jump-start and meant that it did not have to build the registry from scratch. However, one aspect of ROR that was not part of the initial launch was a model for how the registry would be updated and maintained over time. This is because we wanted the community to start using ROR as quickly as possible and we wanted to make sure that we developed an approach to curation that was in line with community needs and community input.</p>
<p>After ROR launched, we held meetings with community stakeholders to discuss different approaches to curation and figure out what made the most sense for ROR. This led to the development of a community-based curation model that:</p>
<ul>
<li><strong>Decentralizes the feedback process</strong> (anyone can submit feedback about registry additions or updates)</li>
<li><strong>Centralizes the curation process</strong> through a community curation board (to ensure consistency and integrity of metadata across the registry)</li>
<li><strong>Makes the entire process open and transparent</strong> (documenting pending changes in a public Github repository)</li>
</ul>
<p>We began piloting this curation model while ROR was still coordinating updates through GRID. The registry update this week represents the first end-to-end completion of the new community-based process.</p>
<p>Read more about ROR&rsquo;s <a href="https://github.com/ror-community/ror-updates#readme">curation model and current workflow</a>. </p>


<h3 id="ror-is-now-in-a-position-to-continue-developing-the-registry-further">ROR is now in a position to continue developing the registry further 
</h3>
<p>While ROR was fortunate to be able to coordinate some registry updates through GRID in the beginning years after our launch, this arrangement was never intended to be a permanent solution. The dependency on GRID also limited our autonomy over the curation process and registry model. Now that we have our own independent infrastructure for maintaining ROR, we have the ability to explore further changes to the registry structure and functionality that community stakeholders have been interested in seeing.</p>


<h2 id="where-were-going-next"><strong>Where we&rsquo;re going next</strong> 
</h2>
<p>While this release marks the culmination of lots of hard work, it is also just the beginning. Now that the ROR registry is officially being updated independently, we are excited to explore how we can further improve the existing dataset and help people use it.</p>
<p>For starters, there is some core functionality that was not in scope for our MVP launch but needs to be implemented as soon as possible. This includes the ability for ROR to handle records that have become inactive. ROR needs to be able to merge and redirect records to reflect organizational changes, and it needs to be able to deprecate IDs when an organization shuts down. While ROR IDs will never be deleted, it&rsquo;s still important that the registry reflects changes to an organization&rsquo;s status.</p>
<p>It is also clear from community feedback that the ROR data model needs to evolve, which is possible to do now that it is not tied to GRID. For example, we frequently get questions about the various name fields that ROR supports, which express different versions of organizations&rsquo; names (versions in multiple languages, aliases and historical names, and acronyms). Name fields are important in ROR for representing organizational affiliations and for optimizing discovery and disambiguation. We think that ROR can and should improve how these fields are handled so that this metadata is as useful and as accurate as possible for our global community of users and integrators. We will be looking into this and other potential adjustments to the data model in consultation with the ROR curation advisory board and community members.</p>
<p>We also know that as ROR adoption continues to increase, ROR integrators need to be able to count on having comprehensive data and on being able to easily report issues or gaps in our records. We expect to see a greater need to support registry updates in bulk, and to establish automated ways of collecting feedback from system integrations. </p>


<h2 id="questions-you-might-have"><strong>Questions you might have</strong> 
</h2>
<p>We hope that you are excited as we are about this milestone. We also know you might have some questions about what it means. Here are some things you might be wondering about and a few things that might be helpful to keep in mind:</p>
<ul>
<li><strong>How can I check what is in this release?</strong> You can see a summary of the records added or updated in the <a href="https://github.com/ror-community/ror-updates/releases/tag/v1.0">release notes</a>. You can look up the IDs and metadata in the ROR production service via the <a href="https://ror.org/search">search UI</a> (<a href="https://ror.readme.io/docs/web-search">documentation</a>), <a href="https://api.ror.org/organizations">API</a> (<a href="https://ror.readme.io/docs/rest-api">documentation</a>), and <a href="https://doi.org/10.5281/zenodo.6347575">data dump</a> (<a href="https://ror.readme.io/docs/data-dump">documentation</a>).</li>
<li><strong>Has the data model changed from previous releases?</strong> This release preserves ROR&rsquo;s existing <a href="https://ror.readme.io/docs/fields">data structure</a>, which was inherited from GRID. For the new and updated records included in the release, there are some slight differences in how we handle certain values in the location data (within the &ldquo;addresses&rdquo; array) because we are now pulling directly from Geonames based on a given location ID, and this was not always the case for previous releases.</li>
<li><strong>I submitted a request - why isn&rsquo;t it in this release?</strong> This release (v1.0) includes addition and update requests that ROR received from late 2020 through the end of 2021, along with a few that were submitted in early 2022. If you submitted a request and it was not included in this release, this could be because it was submitted in the past couple of months. Another reason is that your request may have been determined to be out of scope for ROR. If so, the Github issue should explain why. Some requests were also not included in this release because we did not have the functionality to process them. For instance, since we do not yet have the ability to merge, redirect, and deprecate records, these types of requests are currently on hold.  </li>
<li><strong>Why did it take so long for this release to be completed?</strong> We know that some of you submitted requests a while ago and expected a faster turnaround. While we were able to initiate workflows for reviewing and approving requests shortly after ROR&rsquo;s launch, we needed to build brand-new infrastructure to support the technical process by which the registry is updated (which includes generating and validating metadata, generating new ROR IDs, linking related records to each other, indexing the data, and deploying to our production services). This work ended up taking longer than expected for our small, minimally resourced team. Now that we have reached the critical milestone of publishing our first release with the new infrastructure, we assure you that the next releases will not take as long to complete!</li>
<li><strong>So, when is the next release going to be?</strong> Going forward, we plan to publish new releases on a regular and predictable basis, approximately quarterly. This is similar to the frequency with which the public GRID data used to be updated (which ROR would sync to). We will communicate more details about the release schedule when these are firmed up.</li>
<li><strong>There&rsquo;s something wrong in the data - can you fix it?</strong> If you have feedback about a specific organization record (whether it was part of this release or not), the best way to let us know is to submit the form available <a href="https://ror.org/curation">here</a>. If you have questions or feedback about the registry data overall, please <a href="mailto:info@ror.org">get in touch</a> or post your question in the <a href="https://groups.google.com/a/ror.org/g/ror-api-users">technical support discussion forum</a>. Do keep in mind that ROR IDs do not change when a record is updated, so you can still use a ROR ID even if the metadata needs to be corrected.</li>
</ul>


<h2 id="join-us-in-growing-the-registry-even-further"><strong>Join us in growing the registry even further</strong> 
</h2>
<p>If you&rsquo;re excited about this milestone, or if you have opinions about the registry and what&rsquo;s in it, you can get involved in helping to make sure ROR is as useful, comprehensive, and up-to-date as possible. You can do this by:</p>
<ul>
<li>Submitting <a href="https://ror.org/curation">feedback about registry additions or updates</a> (please review the scope and criteria before submitting)</li>
<li>Getting involved as a member of the <a href="https://github.com/ror-community/ror-updates/wiki/ROR-Curation-Advisory-Board-Overview">ROR curation advisory board</a> (contact <a href="mailto:info@ror.org">info@ror.org</a> to nominate yourself or someone else)</li>
<li>Submitting <a href="https://github.com/ror-community/ror-roadmap/discussions">feedback on new features</a> you would like to see</li>
<li>Sign up your organization as a <a href="https://ror.org/sustain">sustaining supporter</a> to ensure ROR has the resources to maintain and improve the registry&rsquo;s open infrastructure over the long term</li>
</ul>


<h2 id="a-note-of-thanks"><strong>A note of thanks</strong> 
</h2>
<p>Reaching this milestone would not have been possible without the involvement and contributions of many individuals and organizations.</p>
<p>In the first place, huge thanks to the team that brought this release to fruition:</p>
<ul>
<li>ROR Technical Lead <strong>Esha Datta</strong> developed the infrastructure and workflow for generating new and updated metadata records, indexing them in our API, and deploying the changes to production. </li>
<li><strong>Liz Krznarich</strong> stepped into the Technical Lead role in February when Esha returned to her primary role at Crossref, and handled the final pieces of preparing the release and setting the foundation for future ones. </li>
<li>ROR&rsquo;s new Metadata Curation Lead, <strong>Adam Buttrick</strong>, joined the team in January and quickly developed tools and workflows for reviewing, preparing, and testing the updates in this release. </li>
<li>ROR&rsquo;s <strong><a href="https://ror.org/registry/#curation-advisory-board">curation advisory board</a></strong> has been working for the past year to review and approve changes and to develop policies and workflows to guide these decisions.</li>
</ul>
<p>Second, thanks are also due to <strong>Digital Science</strong>. ROR would not have such a comprehensive and useful dataset today if we had not been able to start with the seed file from GRID and coordinate updates through GRID for the past few years.</p>
<p>ROR was also fortunate to receive crucial assistance for this project through a <a href="https://www.imls.gov/grants/awarded/lg-246305-ols-20">grant from the Institute of Museum and Library Services</a> (IMLS). We thank <strong>IMLS</strong> for supporting ROR&rsquo;s community-based approach to curation!</p>
<p>Last but certainly not least, thank you to the community members who have been supporting ROR through this transition. Whether you have submitted requests for new or updated records or contributed to early discussions about ROR&rsquo;s approach to curation, we know you have been eager for ROR to reach this milestone. We acknowledge and appreciate your patience!</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">The ROR registry has grown! We are excited to announce a new ROR registry release. With this update, the registry has grown to 102,559 records.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Let's Fund Open Infrastructure Together</title><id>https://doi.org/10.71938/etgw-fp65</id><link rel="alternate" href="https://ror.org/blog/2022-02-28-help-sustain-ror/"/><published>2022-02-28T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="General Updates"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us"><![CDATA[

<h2 id="ror-begins-a-new-round-of-community-fundraising"><strong>ROR begins a new round of community fundraising</strong> 
</h2>
<p>Since ROR launched in 2019, we have been charting a path to sustainability that leverages our broad community network and diversifies our funding sources. ROR is currently funded through a combination of in-kind support from its three operating organizations, project-based grant funds, and financial contributions from community members.</p>
<p>While ROR aims to minimize overhead and contain costs, it still requires resources to build and maintain the registry&rsquo;s infrastructure, especially as adoption continues to grow. ROR has been working to establish independent revenue streams that complement ROR&rsquo;s in-kind support, avoid dependence on grant funds, and ensure the registry data remains openly available.</p>
<p>This year, ROR is initiating a new round of community fundraising. Building on the <a href="/blog/2019-10-16-help-sustain-ror">community fundraising campaign</a> we ran during 2019-2021, we are renewing a call for organizations to commit to supporting ROR financially. We are launching a Sustaining Supporters program that opens up new ways for organizations to participate in the collective funding of ROR.</p>


<h2 id="ror-sustaining-supporters-program"><strong>ROR Sustaining Supporters program</strong> 
</h2>
<p>With the Sustaining Supporters program, organizations are encouraged to support ROR&rsquo;s operating expenses on a recurring annual basis. Any organization that signs up to support ROR through the end of 2022 will be recognized as a Founding Supporter and receive a supporter badge that can be displayed on their website.</p>
<p>We want to make the process of contributing to ROR as easy as possible. To ensure this is the case, organizations can support ROR at any amount that works for their budget and capacity. Also, to simplify the invoicing process, organizations that are already members of <a href="https://www.crossref.org">Crossref</a> or <a href="https://datacite.org">DataCite</a> can choose to receive an invoice directly from Crossref and DataCite for their ROR contributions. However, if organizations prefer, they can also be invoiced directly from ROR.</p>


<h2 id="why-support-ror"><strong>Why support ROR</strong> 
</h2>
<p>ROR aims to be an example of the power and potential of community-funded open infrastructure. ROR is committed to providing open, stakeholder-governed infrastructure for research organization identifiers and associated metadata. Implementation of ROR IDs in scholarly infrastructure and metadata enables more efficient discovery and tracking of research outputs across institutions and funding bodies.</p>
<p>The Sustaining Supporters program is the next step in ROR&rsquo;s sustainability journey. ROR is continuing to explore future potential paid service tiers designed for those organizations and companies that rely heavily on our infrastructure, which would complement the supporters program. However, rest assured that any paid services will not impact the availability of ROR data or our commitment to supporting our community, in line with our commitment to the <a href="https://openscholarlyinfrastructure.org/">Principles of Open Scholarly Infrastructure (POSI)</a>.</p>
<p>We&rsquo;ve all seen key infrastructure components disappear, be enclosed, or get acquired. We are also realistic about how much effort and cost is involved in sustaining key components of open infrastructure that the scholarly community depends on. And we are committed to doing this right. That means not just sustaining core infrastructures, but investing in them so that they can evolve alongside community needs.</p>
<p>ROR is a free resource for the research community. However, this shared infrastructure does require a collective funding approach that can sustain it as a common good.</p>


<h2 id="join-us"><strong>Join us!</strong> 
</h2>
<p>This is an exciting moment to be part of ROR&rsquo;s growth. Let&rsquo;s fund open infrastructure together!</p>
<p>If your organization is interested in supporting ROR and helping to fund open, community-led infrastructure, <a href="https://ror.org/sustain/">sign up here</a>.</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">ROR begins a new round of community fundraising Since ROR launched in 2019, we have been charting a path to sustainability that leverages our broad community network and diversifies our funding sources.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">ROR Is Hiring a Technical Community Manager</title><id>https://doi.org/10.71938/rk8p-cy85</id><link rel="alternate" href="https://ror.org/blog/2022-02-18-hiring-technical-community-manager/"/><published>2022-02-18T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="Team Updates"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>ROR is hiring! We are looking for a full-time Technical Community Manager to expand the adoption and integration of ROR throughout the global scholarly communications ecosystem.</p>
<p><a href="https://ror.org">ROR</a> (Research Organization Registry) is a community-led initiative to develop an open, sustainable, usable, and unique identifier for every research organization in the world. It is jointly managed by <a href="https://datacite.org">DataCite</a>, <a href="https://www.crossref.org">Crossref</a>, and <a href="https://cdlib.org">California Digital Library</a>. Each of these operating organizations provides input on decisions and strategies that support the growth and sustainability of ROR. Our goal is to address the problem of tracking affiliations in research communications. The Technical Community Manager will be a key driver of that change.</p>
<p>The Technical Community Manager will work closely with the small and committed core ROR team, staff from the three operating organizations, and the broader ROR community, to promote and support the adoption of ROR in systems used throughout research and scholarly communications workflows. This includes engaging with new and existing ROR adopters and other community stakeholders to understand their workflows and systems, and to guide their implementations and integrations.</p>
<p>This position will be employed by Crossref as a full-time staff member and included in all Crossref staff activities. It is fully remote, so location and hours are flexible but overlap with the US Pacific timezone will be necessary. As pandemic circumstances allow, we expect to resume a small amount of international travel for meetings and events.</p>


<h2 id="responsibilities"><strong>Responsibilities</strong> 
</h2>


<h4 id="1-build-strategies-to-drive-adoption-and-technical-implementation"><strong>(1) Build strategies to drive adoption and technical implementation</strong> 
</h4>
<ul>
<li>Identify opportunities and challenges for adoption and integration of ROR in key research and scholarly communications systems and workflows</li>
<li>Develop and implement strategies to encourage and support the adoption and integration of ROR into scholarly communication systems and workflows, including showcasing exemplars and promoting best practices</li>
<li>Develop measures/benchmarks to assess and communicate adoption progress both internally and to the ROR community</li>
</ul>


<h4 id="2-lead-technical-community-engagement-efforts"><strong>(2) Lead technical community engagement efforts</strong> 
</h4>
<ul>
<li>Organize regular meetings, webinars, and other events for new and current integrators</li>
<li>Engage with the ROR community to develop and encourage integration best practices</li>
<li>Build and maintain tools and documentation that meet the needs of key communities</li>
<li>Engage with the ROR community to identify and build consensus around evolving needs</li>
<li>Provide first-line support and troubleshooting help to adopters</li>
</ul>


<h4 id="3-cultivate-and-manage-relationships-with-adopters"><strong>(3) Cultivate and manage relationships with adopters</strong> 
</h4>
<ul>
<li>Introduce ROR to potential adopters you have identified, such as specific publishers, funders, repositories, research institutions, and the service providers and developers that offer platforms to those organizations</li>
<li>Maintain ongoing communications with adopters through regular check-ins to ensure their integration work is well-supported</li>
<li>Consult with adopters to recommend ROR integration approaches for their particular system/use case, collaborating with other ROR team members as needed</li>
<li>Communicate feedback about adopter needs to the ROR team</li>
</ul>


<h4 id="4-contribute-to-the-development-and-implementation-of-overall-ror-strategies"><strong>(4) Contribute to the development and implementation of overall ROR strategies</strong> 
</h4>
<ul>
<li>Based on community needs, identify specifications for improvements and new features</li>
<li>Collaborate with ROR team on product development strategy</li>
<li>Work with ROR and Crossref teams to develop and implement strategies that support wider adoption</li>
</ul>


<h2 id="skills-and-experience"><strong>Skills and experience</strong> 
</h2>
<ul>
<li>Community management experience, particularly in an international environment.</li>
<li>An understanding that community management needs a mix of interpersonal, technical, program management, program development, and communication skills. The <a href="https://doi.org/10.5281/zenodo.4437294">CSCCE skills wheel</a> is a good resource to explore.</li>
<li>While we don&rsquo;t ask for a specific number of years&rsquo; experience, entry-level candidates are unlikely to be successful in the role.</li>
<li>Sufficient technical skills to advise adopters on integrations, including experience with making and troubleshooting requests to RESTful APIs and familiarity with XML and JSON data structures (or technical aptitude and a desire to learn!)</li>
<li>Deep knowledge of research and scholarly communications systems and workflows, and familiarity with the academic research environment </li>
<li>Familiarity with research infrastructure and the open science landscape </li>
<li>Familiarity with a not-for-profit environment and the transparency that entails</li>
<li>Ability to work remotely with small distributed teams across global time zones</li>
<li>Strong, compelling, and clear written, oral, and visual communication</li>
<li>Self-motivated to succeed, take initiative, and seek continuous improvement</li>
</ul>


<h2 id="working-at-ror--crossref"><strong>Working at ROR &amp; Crossref</strong> 
</h2>
<p>As a young start-up initiative, ROR is a dynamic place to work as we are growing quickly and laying the groundwork for long-term sustainability. We are a fun community (we do actually roar sometimes 🦁) but we also take our work seriously! ROR&rsquo;s three operating organizations work closely together and everyone contributing to ROR balances the needs of ROR with those of their home organization. We also work closely with ROR adopters and community stakeholders through working groups and advisory boards and aim for all of these activities to be open and transparent, in line with the <a href="https://principlesofopenscholarlyinfrastructure.org">Principles of Open Scholarly Infrastructure</a> (POSI).</p>
<p>ROR has a Project Lead based at CDL and a Metadata Curation Lead contracted with Crossref. Our previous Adoption Lead has moved over to become our new full-time Technical Lead based at DataCite. We’re now reshaping the previous adoption role on the ROR team as this Technical Community Manager. This is a full-time role and you will be employed at Crossref. This means you will need to balance being part of two teams: ROR; and Crossref.</p>
<p>Crossref is committed to supporting ongoing professional development opportunities and promoting self-learning for its <a href="https://www.crossref.org/people">40+ people</a>. Crossref&mdash;and ROR&mdash;are dedicated to an open and fair research ecosystem and that’s reflected in our ethos and staff culture.”</p>


<h2 id="thinking-of-applying"><strong>Thinking of applying?</strong> 
</h2>
<p>We especially encourage applications from people with backgrounds historically underrepresented in research and scholarly communications.</p>
<p>The role will be accountable to the <a href="https://ror.org/governance/">ROR operations team</a> and within Crossref will report to Ginny Hendricks who will review applications along with Project Lead Maria Gould and Technical Lead Liz Krznarich. Candidates who meet the qualifications will be invited to a 30-minute screening call. Those subsequently shortlisted will be invited to a 90-minute online interview which will include an exercise you&rsquo;ll have a chance to prepare for.</p>
<p>To apply, please send a CV and covering letter explaining how your skills match ROR&rsquo;s goals to <a href="mailto:jobs@ror.org">jobs@ror.org</a> by 16th March 2022. Interviews will take place in late March/early April. </p>


<h3 id="equal-opportunities-commitment"><strong>Equal opportunities commitment</strong> 
</h3>
<p>Crossref and ROR are committed to a policy of non-discrimination and equal opportunity for all employees and qualified applicants for employment without regard to race, colour, religion, sex, pregnancy or a condition related to pregnancy, sexual orientation, gender identity or expression, national origin, ancestry, age, physical or mental disability, genetic information, veteran status, uniform service member status, or any other protected class under applicable law. Crossref and ROR will make reasonable accommodations for qualified individuals with known disabilities in accordance with applicable law.</p>
<p><strong>Thanks for your interest and we look forward to hearing from you!</strong></p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">ROR is hiring! We are looking for a full-time Technical Community Manager to expand the adoption and integration of ROR throughout the global scholarly communications ecosystem.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">A New Year at ROR</title><id>https://doi.org/10.71938/gcgc-n083</id><link rel="alternate" href="https://ror.org/blog/2022-02-14-new-year-at-ror/"/><published>2022-02-14T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="Event Recaps"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>The end of January 2022 marked the third anniversary of <a href="/blog/2019-02-10-announcing-first-ror-prototype/">ROR&rsquo;s launch</a> at PIDapalooza 2019 in Dublin. Continuing with a tradition from the prior two years (<a href="/blog/2020-02-10-ror-ing-in-portugal/">2020</a> in Lisbon, and <a href="/blog/2021-02-03-ror-annual-meeting/">2021</a> on Zoom), we marked the occasion with a community celebration to reflect on ROR&rsquo;s progress and discuss the work that lies ahead.</p>
<p>The 2022 ROR annual meeting consisted of 2 virtual sessions with around 100 community members from 26 different countries. We reviewed developments from 2021, discussed upcoming milestones in 2022, <a href="https://www.youtube.com/watch?v=wBBMM_2M1X8&amp;list=PL4n_Cvd0PpoHfsM3_6VfhAovGIfL3Z79x">heard from ROR adopters</a> about how ROR IDs are being implemented in a variety of settings, and brainstormed ideas for the future.</p>
<p><em>Snapshot of community feedback on future directions for ROR</em>
<figure><img src="/img/ror-meeting-hopes.png" width="75%"/>
</figure>
</p>
<p>Read on for a recap of the meeting&rsquo;s highlights. Some of these will be covered in more detail in forthcoming blog posts, so stay tuned!</p>


<h2 id="changes-on-the-ror-team">Changes on the ROR team 
</h2>
<p>The new year for ROR also came with some changes on the ROR team. First of all, we are pleased to introduce <strong>Adam Buttrick</strong> as the newest member of the ROR pride. Adam has been hired as ROR&rsquo;s metadata curation lead to coordinate ongoing updates and improvements to the registry and work closely with ROR&rsquo;s <a href="/governance/#curation-advisory-board">curation advisory board</a> to develop and implement ROR&rsquo;s community-based curation model. A librarian and developer based in Los Angeles, USA, Adam previously worked as a data developer for the Getty Conservation Institute, as an implementation manager for OCLC&rsquo;s Metadata Services, and for the University of Michigan&rsquo;s Art, Architecture, and Engineering Library. We are excited and fortunate to have Adam working on the many aspects of ROR&rsquo;s curation!</p>
<p>ROR&rsquo;s technical team is undergoing a transition this month as our current Technical Lead, <strong>Esha Datta</strong>, is returning full-time to Crossref in March having been on loan to ROR for the past couple of years. Esha worked on the original ROR prototype back in 2018 and more recently developed ROR&rsquo;s new infrastructure to support independent curation of the registry. We are grateful for Esha&rsquo;s many contributions and sorry to see her go, but we know Crossref is eager to have her back.</p>
<p>ROR development work will remain in good hands as <strong>Liz Krznarich</strong> steps into the role of full-time Technical Lead. Based at DataCite, Liz joined the ROR team in late 2020 as our first Adoption Manager and has developed extensive <a href="/blog/2021-07-21-ror-resources-roundup">guidance</a> to support ROR users and <a href="/integrations">integrators</a>. With her keen understanding of how community stakeholders are relying on ROR, and her prior experience as Technical Lead at ORCID, Liz brings a broad skillset to the position. We will start recruiting soon to backfill Liz&rsquo;s adoption-focused position.</p>


<h2 id="updates-to-registry-data-coming-soon">Updates to registry data coming soon 
</h2>
<p>ROR is preparing to release a new update to the registry. This update will be ROR&rsquo;s first independent release following the <a href="/blog/2021-07-12-ror-grid-the-way-forward">planned separation from GRID</a>. We know that many organizations have been waiting patiently over the past year for new ROR IDs to be created or for existing records to be corrected or updated. The complex infrastructure work required to support independent updates to the registry has taken longer than expected and we appreciate everyone&rsquo;s patience. We will be communicating more details soon about the specific timing of the release and what it includes, as well as what to expect in future releases.</p>


<h2 id="rors-journey-to-sustainability">ROR&rsquo;s journey to sustainability 
</h2>
<p>As we have <a href="/blog/2020-08-20-ror-progress-update">discussed previously</a>, ROR has been charting a journey to sustainability that involves several components. ROR is <a href="/supporters">supported</a> through a combination of in-kind support from operating organizations, time-limited grant funds, and community contributions. In the coming years, ROR also plans to launch optional paid services. ROR&rsquo;s sustainability strategy is geared at leveraging diverse sources of funding and establishing a broad base of community support. In 2022, ROR is focused on expanding its community funding model by formalizing a sustaining supporters program for organizations to participate in a collaborative approach to supporting ROR&rsquo;s open infrastructure. More information about the program will be announced shortly. In the meantime, organizations wishing to contribute - or see which organizations have already signed on to support ROR - can do so <a href="/supporters">here</a>.</p>
<p>2021 was a busy year for ROR and 2022 is already off to a busy start! Thank you as always for your support.</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">The end of January 2022 marked the third anniversary of ROR&amp;rsquo;s launch at PIDapalooza 2019 in Dublin. Continuing with a tradition from the prior two years (2020 in Lisbon, and 2021 on Zoom), we marked the occasion with a community celebration to reflect on ROR&amp;rsquo;s progress and discuss the work that lies ahead.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Job Posting: ROR Technical Lead</title><id>https://doi.org/10.71938/jx9x-ka20</id><link rel="alternate" href="https://ror.org/blog/2021-12-08-hiring-technical-lead/"/><published>2021-12-08T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="Team Updates"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>ROR is looking for a Senior Software Engineer to join as our Technical Lead, based at DataCite. You will be leading, developing, and architecting ROR services and infrastructure. ROR is a collaborative effort with resourcing from multiple organizations. The ROR Technical Lead will work on the ROR project according to priorities agreed with the ROR Operations Team, and will be part of the DataCite development team, and report to DataCite&rsquo;s Engineering Director.</p>


<h1 id="about-ror">About ROR 
</h1>
<p>ROR is a community-led project to develop an open, sustainable, usable, and unique identifier for every research organization in the world.</p>
<p>ROR <a href="https://ror.org/2019-02-10-announcing-first-ror-prototype">launched in January 2019</a> for use by the research community, for the purposes of increasing the use of organization identifiers in the community and enabling connections between organization records in various systems. Implementation of ROR IDs in scholarly infrastructure and metadata enables more efficient discovery and tracking of research outputs across institutions and funding bodies.</p>
<p>ROR is run by a small group of steering organizations in collaboration with a broad network of community advisors and supporters. Read more about <a href="https://ror.org/governance">ROR&rsquo;s governance structure</a> and <a href="https://ror.org/supporters">community supporters</a>.</p>


<h1 id="about-datacite">About DataCite 
</h1>
<p>DataCite is a global community-led organization. Our mission is to be a world leading provider of persistent identifier services to help make research outputs and resources findable, citable, connected and reused globally.</p>


<h2 id="how-we-work">How we work 
</h2>
<p>We are a fully remote team, this means we do not have a physical office and we work where it is most comfortable. The team is distributed across Europe and North America, supporting a global community of organizational members.</p>
<p>All our work is publicly available on <a href="https://github.com/ror-community">https://github.com/ror-community</a> and <a href="https://github.com/datacite">https://github.com/datacite</a></p>


<h1 id="responsibilities-and-duties">Responsibilities and duties 
</h1>
<ul>
<li>Lead the ROR technical strategy in consultation with various stakeholders, including the ROR Operations Team</li>
<li>Collaborate closely with the ROR team on day to day activities, including the ROR Project Lead, Adoption Manager and Data Manager</li>
<li>Participate as an active member of the ROR Operations Team (including stakeholders from the governing organizations)</li>
<li>Collaborate with the ROR team to develop specifications and prioritize development tasks</li>
<li>Develop full-stack product features in an open source software environment, including user-facing front-end services and backend APIs</li>
<li>Refactor applications and infrastructure as needed  and to ensure services run smoothly and reliably</li>
<li>Test, deploy and monitor applications, and react to problems</li>
<li>Collaborate with the rest of the DataCite Engineering team and follow best practices and processes in place  to provide a quality and iterative environment</li>
<li>Support the DataCite development team in shared services</li>
<li>Advocate for product quality, security and performance</li>
<li>Follow industry best practices for highly scalable web applications and services, and work with the team to provide suggestions for improvement</li>
<li>Develop and maintain documentation for common operational tasks</li>
</ul>


<h1 id="requirements">Requirements 
</h1>
<ul>
<li>Experience architecting and leading global projects</li>
<li>Preferably based in a timezone that aligns with the Americas</li>
<li>Proven experience as a Full stack Developer</li>
<li>Proficiency with fundamental front-end technologies, HTML/CSS/Javascript</li>
<li>3+ years of experience with server-side languages such as Python or Ruby</li>
<li>Hands-on experience architecting, developing, deploying, and maintaining web applications, including system administration, monitoring, log management, and support</li>
<li>Familiarity with relational databases and search engines such as ElasticSearch</li>
<li>Familiarity with cloud computing and DevOps practices including Infrastructure as Code</li>
<li>Excellent communication skills, and the flexibility that is needed for working in a small, distributed team</li>
</ul>


<h1 id="nice-to-have">Nice to have 
</h1>
<ul>
<li>Experience with AWS, Terraform and Git/Github</li>
<li>Experience with Django or similar</li>
<li>Strong interest in open science and data sharing</li>
<li>Experience with performance investigation and optimization</li>
<li>Understanding of code review processes</li>
<li>Good technical documentation skills</li>
<li>Computer science education or equivalent experience</li>
<li>Interest to learn and use other programming languages</li>
<li>Familiarity with events driven architecture, distributed systems, serverless architecture and async processing</li>
</ul>
<p>Above all what we value are smart and engaged software engineers who care about the work they do and the quality of the end result, this makes up for any lack of experience you may have in any of the above requirements.</p>


<h1 id="equal-opportunities-commitment">Equal opportunities commitment 
</h1>
<p>DataCite is committed to a policy of non-discrimination and equal opportunity for all employees and qualified applicants for employment without regard to race, color, religion, sex, pregnancy or a condition related to pregnancy, sexual orientation, gender identity or expression, national origin, ancestry, age, physical or mental disability, genetic information, veteran status, uniform service member status, or any other protected class under applicable law. DataCite will make reasonable accommodations for qualified individuals with known disabilities in accordance with applicable law.</p>


<h1 id="why-work-for-us">Why work for us 
</h1>
<ul>
<li>Full-time, fully remote position</li>
<li>Healthcare and retirement benefits in alignment with local conventions</li>
<li>30 days vacation time annually and 1 day paid extra for volunteer work of your choice</li>
<li>Flexible working hours</li>
<li>Option to work in co-working with a paid contribution from us.</li>
<li>Opportunity to learn something new every day, we are an evolving community and team</li>
<li>Adaptable - we embrace change and uncertainty and work together to adapt to new situations</li>
</ul>


<h1 id="to-apply">To apply 
</h1>
<p>Please send a resume and statement of interest to <a href="mailto:sarala@datacite.org">sarala@datacite.org</a>. We only accept applications by individuals.</p>
<p>Interviews will be scheduled from the end of December 2021. We would like the candidate to start in this role on February 1, 2022. Please don&rsquo;t hesitate to contact us with any questions.</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">ROR is looking for a Senior Software Engineer to join as our Technical Lead, based at DataCite. You will be leading, developing, and architecting ROR services and infrastructure.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">ORCID Integrates ROR</title><id>https://doi.org/10.71938/jspc-9949</id><link rel="alternate" href="https://ror.org/blog/2021-10-22-orcid-integrates-ror/"/><link rel="related" href="https://info.orcid.org/add-research-institution-identifiers-with-ror/"/><published>2021-10-22T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Tom Demeranville</name><uri>https://orcid.org/0000-0003-0902-4386</uri></author><author><name>Paula Demain</name><uri>https://orcid.org/0000-0001-9389-7387</uri></author><author><name>Dan Dineen</name><uri>https://orcid.org/0000-0003-1281-5891</uri></author><category term="Adoption News"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us"><![CDATA[

<h2 id="add-research-institution-identifiers-with-ror">Add Research Institution Identifiers with ROR! 
</h2>
<p>As part of our ongoing commitment to increase value to our researchers and our members, we are pleased to announce that the <a href="https://ror.org">Research Organization Registry (ROR)</a> has been added to the ORCID Registry as a disambiguated Organization ID. ROR can now be used with the API and our Affiliation Manager to record research institution identifiers and metadata, thus making it easier to track the impact of institutional research. ORCID’s own staff affiliations have now been updated to include a ROR ID!</p>
<p><em>Example of ROR in an ORCID record</em>
<figure><img src="/img/ror-in-orcid-record.png"/>
</figure>
</p>
<p><em>Example API code for ROR</em>
<figure><img src="/img/ror-in-orcid-api.png"/>
</figure>
</p>


<h2 id="a-short-history-of-rors-inception">A short history of ROR’s inception 
</h2>
<p>The scholarly research community depends on a series of open identifier and metadata infrastructure systems to accurately identify and disambiguate researchers, their work, and the institutions that support that work.</p>
<p>To help complete a missing piece of the infrastructure puzzle, <a href="https://ror.org/about//history">ROR was launched in 2019</a>. It  is now a collaborative initiative run by <a href="https://cdlib.org">California Digital Library (CDL)</a>, <a href="https://www.crossref.org">Crossref</a>, and <a href="https://datacite.org">DataCite</a>.</p>


<h2 id="why-ror-why-now">Why ROR? Why now? 
</h2>
<p>There are numerous ways data is used within the persistent identifier (PID) ecosystem, and affiliation is the key. Knowing where research is coming from, accessing clean and consistent data, and being able to report on the output of institutional research is central to many research stakeholders.</p>
<p>Because ROR is completely open with their datasets and API, it is a natural fit with ORCID’s mission and vision and helps simplify a complex mix of metadata systems. With persistent IDs and metadata for more than 102,000 organizations—and with these IDs also supported in other systems like Crossref and DataCite—ROR further enables research outputs to be more effectively linked to institutions.</p>


<h2 id="a-look-ahead">A Look Ahead 
</h2>
<p>When thinking about the future, it becomes clear that the future is now. The PID ecosystem has been developing over the past decade, and the organizational metadata provided by ROR is a critical piece of the PID puzzle.</p>
<p>Having a core set of open, trusted, globally unique IDs makes research more discoverable and more useful to everyone within the ecosystem. Now that more pieces are in place, more tools can be built to extract meaningful information from all of the PID connections.</p>
<p>Here at ORCID, we continue with our <a href="https://trello.com/c/JEkqoTb5/67-epic-integrate-ror-research-organization-registry-and-rationalize-organization-ids">work to integrate ROR into the Registry</a>. With our focus now switched from the API to the UI, we know that choosing an organization from the current dropdown list in the record can be confusing for users due to duplicates or formatting issues. We plan to streamline this experience to include ROR data as the default when users are manually adding affiliations to their ORCID record. This will result in a simplified experience for the record holder and will also mean the majority of manually-added affiliations will be linked to a ROR identifier.</p>
<p>If you need assistance adopting ROR within your ORCID integration, please reach out to your Consortial Lead or your Engagement Lead.</p>
<p><em>Tom is ORCID&rsquo;s Product Director, he ensures our technology is responsive to our community and aligned with our mission.</em></p>
<p><em>Paula is ORCID&rsquo;s Product Manager, responsible for gathering, managing and refining the requirements of the ORCID user community, with a specific focus on members and integrators.</em></p>
<p><em>Dan is our UX designer. He works making sure we have usable, accessible products that are a joy to use.</em></p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">The Research Organization Registry (ROR) has been added to the ORCID Registry as a disambiguated Organization ID. ROR can now be used with the ORCID API and Affiliation Manager to record research institution identifiers and metadata, thus making it easier to track the impact of institutional research.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Now Hiring: Metadata Curation Lead</title><id>https://doi.org/10.71938/8k5f-tp29</id><link rel="alternate" href="https://ror.org/blog/2021-10-01-hiring-curation-lead/"/><published>2021-10-01T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="Team Updates"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>ROR is looking for a data manager to lead metadata curation activities for the registry, including coordinating registry updates and maintenance, working with ROR’s community curation advisors, and developing and implement long-term curation policies and practices. This is an exciting time to join the ROR team and support ROR&rsquo;s emerging independent, community-based curation model.</p>
<p>The job description and application instructions follow below. Please consider applying or help spread the word to your networks.</p>


<h1 id="ror-data-manager">ROR Data Manager 
</h1>
<p>The Research Organization Registry (ROR) is a community-led registry of open, sustainable, usable, and unique identifiers for every research organization in the world.</p>
<p>ROR is a cross-organization collaborative led by California Digital Library, Crossref, and DataCite. Each organization provides input on decisions and overall strategy that support the growth and sustainability of ROR by engaging in outreach to funders, potential adopters, and other stakeholders in their respective networks.</p>
<p>The ROR registry includes IDs and metadata for more than 100,000 research organizations. ROR receives regular input from its global community of users and stakeholders about records that should be added to or updated in the registry. ROR also seeks and incorporates input from the broader community about policies and processes for maintaining registry data. ROR has implemented a community-based curation workflow for processing registry feedback and implementing registry updates.</p>
<p>ROR is looking for a data manager to lead metadata curation activities for the registry, including coordinating registry updates and maintenance, working with ROR’s community curation advisors, and developing and implement long-term curation policies and practices.</p>


<h2 id="responsibilities">Responsibilities 
</h2>


<h3 id="1-coordinate-community-based-curation-processes">(1) Coordinate community-based curation processes 
</h3>
<ul>
<li>Triage incoming requests to prepare for community review</li>
<li>Oversee community review process to make sure requests are reviewed in a timely and accurate manner</li>
<li>Optimize curation workflow as needed to improve experience for requestors and community curators</li>
<li>Maintain guidance and documentation for community curators</li>
<li>Schedule and facilitate regular curator meetings</li>
<li>Onboard and offboard community curators</li>
</ul>


<h3 id="2-coordinate-registry-updates">(2) Coordinate registry updates 
</h3>
<ul>
<li>Maintain regular schedule of registry updates</li>
<li>Identify which updates will be in a given release</li>
<li>Prepare metadata for records being added to or updated in ROR</li>
<li>Work with curators handling metadata records</li>
<li>Ensure metadata records pass validation/QA</li>
<li>Review and test release candidates</li>
<li>Deploy changes to production via Github-based workflow</li>
<li>Publish release notes and announce releases to users</li>
<li>Make public data dump available on Zenodo</li>
<li>Gather data and generate reports on curation processes to track volume, turnaround times, types of requests, etc.</li>
</ul>


<h3 id="3-metadata-management-and-qa">(3) Metadata management and QA 
</h3>
<ul>
<li>Maintain schema documentation and metadata policies</li>
<li>Analyze current registry data to identify opportunities for metadata QA and future improvements</li>
<li>Work with ROR’s development team on schema updates and metadata clean-up</li>
</ul>


<h3 id="4-community-engagement">(4) Community engagement 
</h3>
<ul>
<li>Respond to support questions about curation policies and release timelines</li>
<li>Provide updates at community calls and webinars</li>
<li>Collect feedback from community to inform curation policies and process</li>
</ul>


<h2 id="required-skills-and-qualifications">Required skills and qualifications 
</h2>
<ul>
<li>University degree</li>
<li>Experience in metadata curation</li>
<li>Experience in data analysis</li>
<li>Experience in workflow development and optimization</li>
<li>Experience with Github and Markdown</li>
<li>Experience working with RESTful APIs and related web services/technologies</li>
<li>Ability to work with small distributed remote teams across time zones</li>
<li>Ability to lead small groups</li>
<li>Strong, compelling, and clear written, oral, and visual communication</li>
<li>Self-motivated to succeed and take initiative and seek continuous improvement</li>
</ul>


<h2 id="desired-skills">Desired skills 
</h2>
<ul>
<li>Outreach experience</li>
<li>Data science skills</li>
<li>Familiarity with scholarly research infrastructures and the open science landscape</li>
<li>Familiarity with product management</li>
<li>Experience working in an international environment</li>
</ul>


<h2 id="position-details">Position details 
</h2>
<p>This is a part-time contracted position (up to 20 hours a week) for two years. This position will be in contract with Crossref but will be accountable to the three organizations that operate ROR (Crossref, California Digital Library, and DataCite).</p>
<p>This is a remote position. The ROR team is distributed across Europe and North America. Some travel (1-3 times/year) may be expected depending on global public health guidelines.</p>


<h2 id="to-apply">To apply 
</h2>
<p>To apply, send a resume and statement of interest to <a href="mailto:jobs@ror.org">jobs@ror.org</a>. Applications will be accepted until the position is filled. Review of applications will begin October 25.</p>


<h2 id="equal-opportunity-statement">Equal opportunity statement 
</h2>
<p>ROR, and the collaborating organizations behind it, California Digital Library, Crossref, and DataCite, are committed to non-discrimination and making opportunities available to anyone without regard to race, color, religion, sex, pregnancy or a condition related to pregnancy, sexual orientation, gender identity or expression, national origin, ancestry, age, physical or mental disability, genetic information, veteran status, uniform service member status, or any other protected class under applicable law.</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">ROR is looking for a data manager to lead metadata curation activities for the registry, including coordinating registry updates and maintenance, working with ROR’s community curation advisors, and developing and implement long-term curation policies and practices.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">New ROR Update and Final GRID Sync</title><id>https://doi.org/10.71938/a5fg-7e94</id><link rel="alternate" href="https://ror.org/blog/2021-09-29-ror-grid-final-sync/"/><published>2021-09-29T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="Technical News"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>In July, ROR and GRID <a href="/blog/2021-07-12-ror-grid-the-way-forward">announced</a> that the two registries would begin to diverge in Q4 2021 following GRID&rsquo;s final public release.</p>
<p>GRID has now completed its latest and final public release (<a href="https://grid.ac/downloads">available here</a>). ROR has incorporated the updates in this release into its own data. With these two releases, there is a 1:1 correspondence of GRID IDs to ROR IDs, and vice versa.</p>
<p>The new ROR dataset includes IDs and metadata for 102,392 organizations.</p>
<ul>
<li>Access the <a href="https://doi.org/10.5281/zenodo.5534442.">ROR data dump</a></li>
<li>Query the <a href="https://api.ror.org/organizations">ROR API</a></li>
<li>Search the <a href="https://ror.org/search">ROR registry</a></li>
</ul>
<p>ROR has always planned to become independent of GRID. Since the July announcement, ROR has been working with community members to support their needs and use cases for transitioning from GRID to ROR.</p>


<h2 id="how-can-i-switch-from-grid-to-ror">How can I switch from GRID to ROR? 
</h2>
<p>The following resources are a good starting point for those transitioning from GRID to ROR. If you have questions, please post in the <a href="https://groups.google.com/a/ror.org/g/ror-api-users">ROR Tech Support discussion group</a> so other users can be part of the conversation.</p>
<ul>
<li><a href="https://ror.readme.io/docs/grid">ROR/GRID transition FAQ</a></li>
<li><a href="https://ror.readme.io/docs/rest-api">Technical docs</a> about ROR tools and services</li>
<li><a href="https://ror.readme.io/docs/mapping">Implementation guides</a> for mapping GRID as well as other identifier types to ROR</li>
<li><a href="https://github.com/ror-community/ror-utilities">Utility scripts</a> for working with the ROR API</li>
</ul>


<h2 id="when-will-rors-first-independent-release-happen">When will ROR&rsquo;s first independent release happen? 
</h2>
<p>ROR is developing its infrastructure to support independent updates to the registry. The timeframe for ROR&rsquo;s first independent release is not yet finalized but we will continue to share information with the community about the projected schedule.</p>
<p>Anyone can submit requests to ROR to add or update records in the registry. ROR is implementing a community-based curation model to review and process requests. You can read more about the curation model and follow specific requests on <a href="https://github.com/ror-community/ror-updates/issues">Github</a>.</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">In July, ROR and GRID announced that the two registries would begin to diverge in Q4 2021 following GRID&amp;rsquo;s final public release.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">ROR Resources Roundup</title><id>https://doi.org/10.71938/4beg-0319</id><link rel="alternate" href="https://ror.org/blog/2021-07-21-ror-resources-roundup/"/><published>2021-07-27T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Liz Krznarich</name><uri>https://orcid.org/0000-0001-6622-4910</uri></author><category term="Technical News"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>ROR adoption is ramping up, and we’ve been hard at work during the past few months creating resources (or should we say <em>ROR</em>-sources?!) to support those of you integrating ROR into your systems. We’re excited to share the following new resources with you:</p>


<h2 id="support-site--code-examples">Support site &amp; code examples 
</h2>
<p>ROR is all grown up and has its very own <a href="https://ror.readme.io">support site</a>!</p>
<p>On this site you’ll find:</p>
<ul>
<li><a href="https://ror.readme.io/docs/rest-api">Technical docs</a> about ROR tools and services</li>
<li><a href="https://ror.readme.io/docs/mapping">Implementation guides</a> with tips for using these tools/services to address specific use cases</li>
<li>Links to relevant code examples in the new <a href="https://github.com/ror-community/ror-utilities">ror-utilities Github repository</a> and out in the world</li>
</ul>
<p>If you can’t find what you need or get stuck along the way, please post on the <a href="https://groups.google.com/a/ror.org/g/ror-api-users">ROR Tech Support Google Group</a> (formerly ROR API Users - we recently changed the name to make it clear that we welcome all sorts of tech questions/discussion).</p>
<p>This site is a work in progress, and we’ll be adding to it in the coming months. We welcome your feedback! If you notice an issue on a specific page, please use the Suggest Edits button to let us know. If you have more general suggestions, contact us at <a href="mailto:info@ror.org">info@ror.org</a></p>


<h2 id="display-guidelines">Display guidelines 
</h2>
<p>ROR IDs are intended to be primarily internal-facing, but earlier this year we heard from a few adopters in the community who were eager to feature ROR IDs in their user interfaces and asked us for recommendations. After several rounds of review and community input, we’re now happy to share the final version of the <a href="/display-guidelines">ROR Display Guidelines</a>.</p>
<p>An important point to note is that ROR IDs are still considered primarily internal-facing, and displaying ROR IDs to end users is not ideal for many applications (and certainly not required). <strong>We recommend that you carefully consider whether displaying ROR IDs publicly adds value for your community. If so, then these new guidelines are for you!</strong></p>


<h2 id="interactive-integrations-list">Interactive integrations list 
</h2>
<p>Curious about who’s using ROR and how they’re using it? Check out our <a href="/integrations">updated integrations list</a>, which includes more details than our previous list as well as options to sort and filter by integration type, status and other facets.</p>
<ul>
<li><strong>Is your integration missing?</strong> Tell us about it via the <a href="https://bit.ly/ror-integration-form">integration info form</a></li>
<li><strong>Is your integration on the list, but in need of updates?</strong> Let us know at <a href="mailto:info@ror.org">info@ror.org</a></li>
</ul>
<p><a href="/integrations"><figure><img src="/img/2021-07-ror-integrations.gif"/>
</figure>
</a></p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">ROR adoption is ramping up, and we’ve been hard at work during the past few months creating resources (or should we say ROR-sources?</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">ROR and GRID: The Way Forward</title><id>https://doi.org/10.71938/4jrg-4e16</id><link rel="alternate" href="https://ror.org/blog/2021-07-12-ror-grid-the-way-forward/"/><published>2021-07-12T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>ROR Leadership Team</name><uri>https://ror.org/about/team/#operations-team</uri></author><category term="General Updates"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>Earlier today, <a href="https://www.digital-science.com/press-release/grid-passes-torch-to-ror/">GRID announced that it will discontinue its schedule of public releases in Q4 2021</a>. This decision marks an important and exciting milestone in the evolution of both organization registries.</p>
<p>ROR&rsquo;s core mission is to be a <em>community-led registry of open organization identifiers</em>. While GRID has maintained an open registry of organization identifiers available CC0 to the community since 2015, it did not intend to serve as a community-driven initiative. Therefore, it was a natural arrangement to jump-start ROR with seed data from GRID, and accept ongoing updates from GRID while developing ROR to ultimately function independently as the community registry of record. The plan has always been that ROR would inevitably need to be able to diverge from GRID in order to more fully address the requirements and use cases that come with maintaining a community-based initiative. GRID&rsquo;s recent decision aligns perfectly with the progress ROR has already made towards this goal.</p>
<p>Since its launch in 2019, ROR&rsquo;s <a href="https://ror.org/governance">operating organizations</a> have been working to shore up resourcing and infrastructure. ROR has installed a community-based <a href="https://ror.org/community/#community-advisory-group">advisory group</a> and <a href="https://ror.org/community/#steering-group">steering group</a>, secured <a href="https://ror.org/blog/2020-08-20-ror-progress-update/">grant funding and community donations</a>, created a <a href="https://ror.org/about">governance structure and sustainability model</a>, implemented basic <a href="https://github.com/ror-community/ror-updates#readme">community-based curation workflows</a>, and began building the necessary infrastructure to be able to deploy registry updates independently. During this time, ROR has also seen <a href="https://ror.org/integrations">wide adoption across the scholarly infrastructure landscape</a> and recognition as a core persistent identifier for organizations by publishers and repositories as well as by national research offices around the world (e.g., <a href="https://repository.jisc.ac.uk/id/eprint/7840">UK</a>, <a href="https://www.arl.org/news/new-report-provides-recommendations-for-effective-data-practices-based-on-national-science-foundation-research-enterprise-convening/">US</a>, <a href="https://www.nistep.go.jp/en/?p=4916">Japan</a>, <a href="https://www.nwo.nl/en/news/nwo-persistent-identifier-strategy-will-lead-increased-efficiency-and-insight">Netherlands</a>, <a href="https://www.ouvrirlascience.fr/deuxieme-plan-national-pour-la-science-ouverte/">France</a>, <a href="http://doi.org/10.5281/zenodo.5039512">Australia</a>). ROR has been working with community members to support them in <a href="https://ror.readme.io/docs/mapping">mapping ROR to other organizational identifiers</a>, including GRID IDs, as part of broader efforts to drive ROR adoption. ROR has also recently started a process to <a href="https://tinyurl.com/ror-request">receive requests from the community</a> to add and update ROR records.</p>
<p>Below we are cross-posting a joint statement from Digital Science and ROR about this transition. We are excited about the milestone that this represents for both registries.</p>
<hr>


<h2 id="grid-passes-the-torch-to-ror">GRID Passes the Torch to ROR 
</h2>
<p>In 2015 Digital Science first released the Global Research Identifier Database (GRID), an open database of unique research-related organisation identifiers they had developed in-house over several years, for public use by the research community. In 2019 ROR, the Research Organization Registry, was founded as a community-driven initiative, mirroring the GRID database. With ROR coming of age and becoming independent from GRID, Digital Science has decided to pass on the torch to ROR and retire GRID from the public space, with a last public release in Q4 of 2021.</p>
<p>This might come as a surprise, as GRID and ROR have been co-existing and collaborating for quite some time now. GRID was initially created to fill a void, as no open organisation identifier was available for the open research space. As a community-driven initiative has now built upon GRID&rsquo;s first initiative, two open organisation identifiers could be perceived as competing against each other. Digital Science has therefore decided to formally hand the torch over to ROR as the leading open organisation identifier. Digital Science will continue to use GRID internally- but focused on the Digital Science products and their users and clients.</p>
<p>In Q4 of 2021, Digital Science&rsquo;s GRID will discontinue its schedule of public releases in order to provide clarity and space for the Research Organization Registry (ROR) to develop further.  ROR was originally seeded by GRID data and supported by Digital Science as one of the founding partners.  It was always envisaged that ROR, being community-owned and driven, would eventually become the principal identity for institutions and now, after 3 years of &ldquo;dual-running&rdquo;, the moment has come for ROR to take on that mantle.  The inclusion of ROR in the DataCite, Crossref and ORCID datasets, along with the level of development of the ROR infrastructure means that ROR has reached a level of community adoption that will ensure its long-term place in the persistent identifier and data infrastructure environment.</p>
<p>Digital Science has supported the ROR initiative as a founding member ever since a chat with other stakeholders in Girona in 2018. When ROR launched, GRID provided the seed data of over 100,000 records. The GRID team has also been working to regularly update the data for the past three years to help jump-start ROR. For the last three years, significant efforts have taken place both within ROR and Digital Science to keep the two identifiers synchronised.  This next step in the evolution of ROR will allow it to diverge from GRID where needed. We are excited that we have reached this important milestone together!</p>
<p>ROR and Digital Science are committed to making the transition as smooth as possible for anyone who may wish to make the switch from GRID to ROR.</p>
<p>You can find resources about transitioning from GRID to ROR on the <a href="https://ror.readme.io/docs/mapping">ROR support site</a>, and more resources will be added soon. Please reach out directly to the <a href="mailto:contact@grid.ac">GRID Team</a> and the <a href="https://pidforum.org/c/ror-chat-room/16">ROR Team via the ROR PID Forum</a> Chat Room in case you have any questions, or see the FAQ section.</p>
<p>Digital Science and ROR are extremely excited to see how well the project has developed, and look forward to seeing how ROR is used in the future. The GRID and ROR teams would like to thank the academic community for their engagement and involvement.</p>
<p><a href="https://www.digital-science.com/blog/2021/07/grid-passes-torch-to-ror/">See additional FAQs about this transition here</a>.</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Earlier today, GRID announced that it will discontinue its schedule of public releases in Q4 2021. This decision marks an important and exciting milestone in the evolution of both organization registries.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">ROR’s 2020 Annual Report: A ROR-port on the Past Year and a Look to What’s Next</title><id>https://doi.org/10.71938/w58p-n751</id><link rel="alternate" href="https://ror.org/blog/2021-03-23-ror-annual-report/"/><published>2021-03-23T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="Year in Review"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>ROR is a specific type of identifier and a specific type of initiative that does not always fit neatly into pre-defined notions and categories. The registry’s focus on providing an open, noncommercial, and community-driven solution to the problem of identifying research organization affiliations sets it apart from similar types of organization identifiers. The collaborative nature of ROR’s approach also sets it apart from other types of identifier providers and infrastructure organizations.</p>
<p>As an initiative, ROR is an effort <a href="/governance/">led by a group of trusted and stable organizations</a> in the research infrastructure community. This means that ROR is not an independent organization by design. ROR is forging a <a href="/blog/2020-08-20-ror-progress-update/">unique path to sustainability</a> that is not dependent on incorporating into a structure that would create unnecessary overhead or require a paid membership model that limits who can participate.</p>
<p>While it is fair to say that ROR strives to be different, it is also true that in order to achieve its goals, ROR should aim to hold itself accountable to a core set of qualities and standards common to many organizations and initiatives in the open research infrastructure space. One way in which ROR has sought to align itself with these common frameworks is by <a href="/blog/2020-12-16-aligning-ror-with-posi/">signing on to the Principles of Open Scholarly Infrastructure</a> (POSI).</p>
<p>Another way ROR seeks to frame and present its activities is by preparing our first-ever Annual Report (a “ROR-port,” if you will). Adopting the model of annual reports distributed by nonprofit organizations, this document aims to provide an overview of ROR’s progress in the past year and a preview of work to come. In this first ROR-port, we reflect back on 2020 and look ahead to 2021.</p>
<p><a href="https://doi.org/10.5281/zenodo.4620044">View the 2020 Annual Report</a></p>
<p>The 2020 report also includes the results of the 2020 ROR Community Survey, which was initiated in November and is expected to run on an annual basis going forward. The 2020 survey received 179 responses from all continents except for Antarctica but primarily Europe and North America.</p>
<figure><img src="/img/ror-survey-excerpts.png"/>
</figure>

<p>The results highlight the range of stakeholders interested in supporting and integrating ROR, as well as opportunities and challenges for continued engagement and adoption. This input is valuable for ROR as it continues to grow in the coming years.</p>
<p>Community is at the heart of ROR, and this means accountability, active participation, and consulting with each of you. There are many ways to be involved in ROR. Please get in touch!</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">ROR is releasing our first-ever Annual Report. Adopting the model of annual reports distributed by nonprofit organizations, this document aims to provide an overview of ROR’s progress in 2020 and a preview of work to come in 2021.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">ROR-ing Around the Clock at PIDapalooza21</title><id>https://doi.org/10.71938/3v41-7p13</id><link rel="alternate" href="https://ror.org/blog/2021-02-12-ror-pidapalooza21/"/><published>2021-02-12T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="Event Recaps"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>In the same week that <a href="/blog/2021-02-03-ror-annual-meeting">ROR celebrated its third birthday</a>, <a href="https://pidapalooza.org">PIDapalooza</a> celebrated the fifth festival of persistent identifiers, also as a virtual event. Across three tracks, seven languages, and twenty-hour hours, PIDapalooza21 highlighted the latest updates from the wide world of persistent identifiers, with a focus how open infrastructure and rich metadata are key to harnessing the power of PIDs. As a prime example of open identifier infrastructure that is designed to enrich scholarly metadata, ROR featured prominently in many PIDapalooza21 sessions. Here are some highlights, with links to the recordings:</p>
<ul>
<li>A series of &ldquo;PIDs for Beginners&rdquo; sessions held in multiple languages included information about ROR in <a href="https://youtu.be/666QvvB6qJ0">Japanese</a>, <a href="https://youtu.be/c8b4aM_98cA">Chinese</a>, French, <a href="https://youtu.be/JoFs5CZyYg8">Portuguese</a>, <a href="https://youtu.be/r25OYsl6WNE">German</a>, and <a href="https://youtu.be/jav---epwkQ">English</a>.</li>
<li>Hindawi intern Alessandra Audino&rsquo;s presentation, &ldquo;<a href="https://youtu.be/6AwNY2VoK58">Connecting PIDs and Open Infrastructure to publisher workflows: Hindawi as a use case</a>,&rdquo; outlines the many use cases for publishers to incorporate organization identifiers into their workflows and systems, using ROR as a model.</li>
<li>In &ldquo;<a href="https://youtu.be/R4KBcLhnyVA">Extending ROR: Wag the Lion</a>,&rdquo; ROR community members Carolyn Grant, Quinn Hart, and Arthur Smith, shared their project to build an extension for ROR to identify departmental units within organizations</li>
<li>Natasha Simons and Fiona Murphy discussed <a href="https://youtu.be/gI3B7GNyTXs">RAiD</a>, the initiative to develop an identifier and infrastructure for research project identifiers, and how RAiD can use ROR to identify institutions.</li>
<li>In the session &ldquo;<a href="https://youtu.be/Vr8axfTCTDI">Positive people present their persistent pursuit of practically perfect priority PID partnerships</a>,&rdquo; Chris Brown and Josh Brown discussed efforts to develop a unified national approach to PIDs in the UK, including developing a list of priority PIDs for research infrastructure, including ROR.</li>
<li>Last but not least, Juan Pablo Alperin&rsquo;s <a href="https://youtu.be/1C9Ainp1Hu8">plenary session</a> took the form of an epic poem about PKP&rsquo;s approach to integrating PIDs, including ROR.</li>
</ul>
<p>ROR also presented &ldquo;The ABCs of ROR,&rdquo; a general orientation to the basics of ROR in PIDapalooza&rsquo;s PIDs 101 track geared at introducing newcomers to different identifiers. This session was run twice, once in <a href="https://youtu.be/MruefxeOwR0">English</a> and once in <a href="https://youtu.be/GSoN5gvbawc">Spanish</a>.</p>
<p>All of the PIDapalooza videos can be accessed on <a href="https://www.youtube.com/channel/UCna9Pn-eSt_CGSnZjS7eBXg">YouTube</a>.</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">In the same week that ROR celebrated its third birthday, PIDapalooza celebrated the fifth festival of persistent identifiers, also as a virtual event.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Unmute to ROAR! ROR's Third Annual (and First Virtual) Community Meeting</title><id>https://doi.org/10.71938/a9m0-z816</id><link rel="alternate" href="https://ror.org/blog/2021-02-03-ror-annual-meeting/"/><published>2021-02-03T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="Event Recaps"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>ROR had a birthday last week and marked the occasion just like anyone else celebrating a birthday during the pandemic: with a virtual party! More than eighty attendees from around the world came together for the now third annual ROR community meeting, held this year on Zoom across two sessions to reach ROR&rsquo;s global community in as many timezones as possible.</p>
<p>At the meeting, we reflected on milestones and highlights from the past year and discussed the goals, opportunities, and challenges that lie ahead. The two main areas of focus for ROR in 2021 are adoption and curation.</p>
<p>Integrating ROR IDs across our scholarly infrastructure is crucial for realizing the power and potential of ROR&rsquo;s open registry of identifiers for research organizations. Those interested in adopting ROR and/or working with ROR data need support and guidance. Please get in touch with ROR if you are working on any integrations or have any suggestions or feedback about how to support wider adoption of ROR, or if you are interested in joining a working group for ROR adopters or API users.</p>
<p>For those integrating ROR, guidelines for how (or even if) to display ROR IDs are also needed. We have developed draft guidance on best practices that is available for community feedback through 28 February. <a href="https://docs.google.com/document/d/1z8ZvIx_UkgzvnXBikX_Y7BmW4ozWsVnsU1WBvRSoDfc">Review the draft guidelines</a>.</p>
<p>The ability to maintain the data in ROR and curate it over time is also critical. ROR needs to become independent from GRID, which provided the seed data for the registry, and implement its own infrastructure and workflows for adding and updating records. ROR is developing community-based processes and policies for reviewing changes. Please get in touch with ROR if this is something you are interested in helping with.</p>
<p>While we wished we could have seen everyone in person for the 2021 annual meeting, the virtual format made it possible to bring even more people together who would not have ordinarily been able to travel to attend the meeting in person. Fortunately, we even figured out a way to ROAR together on Zoom!</p>
<figure><img src="/img/roring-on-zoom.png"/>
</figure>

<p>Thank you for being part of the ROR Community&mdash;whether you&rsquo;ve been part of the journey since the very beginning or are just now joining the pride. We&rsquo;re happy you&rsquo;re here.</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">ROR had a birthday last week and marked the occasion just like anyone else celebrating a birthday during the pandemic: with a virtual party!</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Aligning ROR with the Principles of Open Scholarly Infrastructure</title><id>https://doi.org/10.71938/n0kg-4k60</id><link rel="alternate" href="https://ror.org/blog/2020-12-16-aligning-ror-with-posi/"/><published>2020-12-16T00:00:00+00:00</published><updated>2026-02-26T10:44:17-05:00</updated><author><name>ROR Leadership Team</name><uri>https://ror.org/about/team/#operations-team</uri></author><category term="General Updates"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>The scholarly community depends on a network of open identifier and metadata infrastructure. Content identifiers and contributor identifiers are foundational components of this network. But an additional component has long been missing from this picture: open, stakeholder-governed infrastructure for research organization identifiers and their associated metadata.</p>
<p>ROR launched in January 2019 with the specific aim of filling this gap. Our work is the culmination of several years of planning and collaboration across multiple organizations to develop a shared vision for an open registry of research organization identifiers.</p>
<p>ROR is now fully operational as a community-driven registry of open, sustainable, usable, and unique identifiers for every research organization in the world. ROR is a cross-organizational and multi-stakeholder initiative run by a small steering group in collaboration with a broad network of community advisors and supporters. The governing organizations are California Digital Library, Crossref, and DataCite. During ROR&rsquo;s startup phase, these organizations have worked closely with additional Steering Committee members from around the globe, including Digital Science, who were instrumental in kickstarting the registry with seed data from their GRID database.</p>
<p>Reflecting and reinforcing core principles of openness, transparency, and sustainability for research infrastructure has been a central aim and focus for ROR since the very beginning. As we approach the two-year anniversary of ROR&rsquo;s official launch, and as we are seeing greater adoption of ROR and implementation of open affiliation identifiers across the scholarly landscape, this is a key moment to evaluate how ROR has remained aligned to these principles and where it might need to focus and improve in the years ahead. The <a href="https://openscholarlyinfrastructure.org/">Principles of Open Scholarly Infrastructure</a> (POSI) provide a useful framework for this evaluation.</p>
<p>POSI&rsquo;s set of sixteen principles was proposed in 2015 with the aim of encouraging the scholarly community to more critically assess the infrastructure they rely on and hold each other to account. POSI has since been built upon and discussed by others in the community; last month, the principles were codified and now live on an independent website: <a href="https://openscholarlyinfrastructure.org">https://openscholarlyinfrastructure.org</a>.</p>
<p>In addition to codifying the principles, the website also aims to guide and encourage infrastructure organizations and initiatives to make a commitment to POSI. In recent weeks, both <a href="https://www.crossref.org/blog/crossrefs-board-votes-to-adopt-the-principles-of-open-scholarly-infrastructure/">Crossref</a> and <a href="https://blog.datadryad.org/2020/12/08/dryads-commitment-to-the-principles-of-open-scholarly-infrastructure/">Dryad</a> have responded to this by posting self-assessments and a public statement of intent to comply with POSI. ROR is following their lead in preparing its own evaluation, which appears below.</p>


<h2 id="understanding-the-principles">Understanding the principles 
</h2>
<p>The POSI framework focuses on three key areas for open infrastructure organizations and initiatives to garner the trust of the broader scholarly community: accountability (governance), funding (sustainability), and protection of community interests (insurance). As part of the self-assessment process, activities in these three areas are categorized as green (meets goal), yellow (partially meets goal, and/or effort to meet goal is in process), and red (does not meet goal) depending on how closely they align with POSI. To understand what this means in the context of ROR&rsquo;s evaluation, it is important to bear a few considerations in mind:</p>
<ol>
<li>
<p>If an item is marked as green, this doesn&rsquo;t mean ROR does this perfectly. It simply means that we have internal processes that focus on this commitment and we have evidence that these processes have been working thus far. It also does not mean that ROR&rsquo;s work in this area is done.</p>
</li>
<li>
<p>The principles must be considered as a whole; one is not more important than the other. It would be counterproductive to focus specifically on reaching one item&rsquo;s &ldquo;green&rdquo; status if it would have a detrimental impact on another commitment.</p>
</li>
<li>
<p>Adherence to the principles is an ongoing process. Since ROR has taken an innovative approach of offering open infrastructure through a multi-organization collaboration instead of a standalone organization, some of these principles are less applicable than others. Furthermore, it is newly implemented infrastructure and it will take time to meet all of the principles in an ideal way. This is to be expected.</p>
</li>
</ol>


<h2 id="governance-principles">Governance Principles 
</h2>
<figure><img src="/img/ror-posi-governance.png"/>
</figure>

<blockquote>
<p><em>Coverage across the research enterprise &ndash; it is increasingly clear that research transcends disciplines, geography, institutions and stakeholders. The infrastructure that supports it needs to do the same.</em></p>
</blockquote>
<p>As a global registry, ROR includes identifiers and metadata for close to 99,000 organizations in 217 countries. ROR covers many types of research organizations, including universities, government bodies, research facilities, medical facilities, funders, publishers, and research-producing companies. In this way, ROR IDs can be used to identify affiliations for many types of research activities.</p>
<p>ROR stakeholders also reflect a diversity of institutions and geographies. The organizations in ROR&rsquo;s Steering Group represent five different countries (United States, United Kingdom, Germany, South Africa, and Japan) and a range of organization types, including academic library, infrastructure provider, research policy, and government funder. The ROR Community Advisory Group includes members from 16 countries and a similarly heterogeneous mix of organization types, including libraries, research institutes, infrastructure providers, funders, and government agencies.</p>
<blockquote>
<p><em>Stakeholder Governed &ndash; a board-governed organisation drawn from the stakeholder community builds more confidence that the organisation will take decisions driven by community consensus and consideration of different interests.</em></p>
</blockquote>
<p>As a multi-organization collaboration, ROR is by design not a standalone organization and consequently has no official mechanism for board governance. However, the ROR Steering Group, which advises the strategic direction of ROR, is drawn from the wider stakeholder community. This group will continue to evolve as adoption and uptake of ROR grows as well.</p>
<blockquote>
<p><em>Non-discriminatory membership &ndash; we see the best option as an &ldquo;opt-in&rdquo; approach with a principle of non-discrimination where any stakeholder group may express an interest and should be welcome. The process of representation in day to day governance must also be inclusive with governance that reflects the demographics of the membership.</em></p>
</blockquote>
<p>As an open, community-based initiative, ROR has no membership model by design but rather operates on the basis of self-selected participation. Anyone is welcome to join the ROR Community Advisory Group and take part in meetings and discussions about ROR&rsquo;s progress and future directions. Participation is not determined by an individual or organization&rsquo;s location, resources, industry, or beliefs.</p>
<blockquote>
<p><em>Transparent operations &ndash; achieving trust in the selection of representatives to governance groups will be best achieved through transparent processes and operations in general (within the constraints of privacy laws).</em></p>
</blockquote>
<p>ROR&rsquo;s day-to-day operations are overseen by its governing organizations: California Digital Library, Crossref, and DataCite. These organizations have signed a memorandum of agreement that states their long-term resource commitments to ROR. This agreement has been discussed and approved by the leadership at each governing organization. As ROR approaches the start of its third year, we will continue to develop a framework to reflect the unique nature of ROR as a collaborative initiative, not an organization.</p>
<p>This is therefore an area for improvement. As ROR solidifies its operations, it plans to formalize and publicize operational documentation, such as financial and governance documents.</p>
<blockquote>
<p><em>Cannot lobby &ndash; the community, not infrastructure organisations, should collectively drive regulatory change. An infrastructure organisation&rsquo;s role is to provide a base for others to work on and should depend on its community to support the creation of a legislative environment that affects it.</em></p>
</blockquote>
<p>ROR began as a collaborative effort across multiple organizations in the scholarly research and research infrastructure community and continues to be run as a multi-stakeholder collaboration. ROR&rsquo;s community advisory groups meet regularly and provide a central focus and channel for ROR&rsquo;s strategic directions. As a registry of open identifiers for affiliations, ROR provides infrastructure that can be implemented and iterated upon in a variety of settings. ROR does not lobby nor does ROR include regulatory change as part of its remit. ROR&rsquo;s sole purpose is to offer data and tools to help its community of users solve problems and achieve their goals.</p>
<blockquote>
<p><em>Living will &ndash; a powerful way to create trust is to publicly describe a plan addressing the condition under which an organisation would be wound down, how this would happen, and how any ongoing assets could be archived and preserved when passed to a successor organisation. Any such organisation would need to honour this same set of principles.</em></p>
</blockquote>
<p>A signed agreement between ROR&rsquo;s governing organizations describes how ROR would be shut down and how responsibilities for coordinating ROR could be transferred to successors, if applicable. This plan includes how the ROR team would handle remaining assets to ensure long-term preservation, access, and stewardship of the registry. These assets would include code repositories, public data dumps, cash funds, and technical infrastructure. ROR data will always be open and available under a CC0 waiver. Transfer of responsibilities for management of the registry will not impact the openness and availability of ROR data.</p>
<blockquote>
<p><em>Formal incentives to fulfil mission &amp; wind-down &ndash; infrastructures exist for a specific purpose and that purpose can be radically simplified or even rendered unnecessary by technological or social change. If it is possible the organisation (and staff) should have direct incentives to deliver on the mission and wind down.</em></p>
</blockquote>
<p>As a relatively small and focused effort, ROR is already lean by design. By deliberately operating as an initiative and not forming an organization, ROR is more nimble and won&rsquo;t face a scenario in which an organization has to be dissolved. ROR identifiers will continue to be needed for as long as affiliation data is a core aspect of research infrastructure. The mission of ROR will only go away when this need goes away.</p>


<h2 id="sustainability-principles">Sustainability Principles 
</h2>
<figure><img src="/img/ror-posi-sustain.png"/>
</figure>

<blockquote>
<p><em>Time-limited funds are used only for time-limited activities – day to day operations should be supported by day to day sustainable revenue sources. Grant dependency for funding operations makes them fragile and more easily distracted from building core infrastructure.</em></p>
</blockquote>
<p>ROR is designed to be operated and sustained through a mix of funding sources. These include:</p>
<ul>
<li>In-kind contributions by ROR’s governing organizations</li>
<li>Contributory investments by community stakeholders</li>
<li>Grant funding for discrete project work</li>
<li>Optional paid service fees for organizations that require additional technical support, (any such fee would be based on services and not on access to ROR data, described in more detail below)</li>
</ul>
<p>ROR has a planned dependency on community fundraising and grants for two years, through the end of 2022. It is our plan to have this window to to shore up plans and technical infrastructure to support an optional service fee model geared towards power users of the registry.</p>
<blockquote>
<p><em>Goal to generate surplus – organisations which define sustainability based merely on recovering costs are brittle and stagnant. It is not enough to merely survive, it has to be able to adapt and change. To weather economic, social and technological volatility, they need financial resources beyond immediate operating costs.</em></p>
</blockquote>
<p>ROR is still in its startup phase and prides itself on running operations that are as lean as possible. During the startup phase, ROR’s governing organizations contribute in-kind support to maintain the resources needed to keep ROR running. Since its inception the banking and funds management of the ROR project has remained in sequestered financial reporting structures. The ROR budget is managed by the leadership of each governing organization and all accounts are easily auditable.</p>
<p>ROR understands that to maintain community trust, we must innovate and iterate over time. We also understand that innovation requires investment of additional resources. With financial reporting structures in place, ROR is well positioned to build a firm financial footing. ROR has explored and will continue to explore ways of building a surplus to maintain stability and ensure continuous innovation.</p>
<blockquote>
<p><em>Goal to create contingency fund to support operations for 12 months – a high priority should be generating a contingency fund that can support a complete, orderly wind down (12 months in most cases). This fund should be separate from those allocated to covering operating risk and investment in development.</em></p>
</blockquote>
<p>ROR’s aim is to provide a centralized curated registry for research organizations that is updated regularly and made openly available to the public on a regular basis so that if and when the ROR team is no longer able to maintain its current workflow the community will be able to make uses of the data because public data dumps will be available. Additional infrastructure overlaid on top of the data is handled by ROR’s governing organizations, who are bound to a memorandum of agreement that commits them to contributing in-kind resources to ROR. This agreement has also outlined how winding down and/or handing off would be coordinated within a specified timeframe of 120 days. ROR will look into further securing this contingency with further contractual obligations coordinated in the future.</p>
<blockquote>
<p><em>Mission-consistent revenue generation – potential revenue sources should be considered for consistency with the organisational mission and not run counter to the aims of the organisation. For instance…</em></p>
</blockquote>
<p>ROR currently plans to implement an optional paid service tier in 2022 geared towards power users and/or those that require unique technical support. This will be developed with the commitment that the core ROR dataset will remain freely and openly available.</p>
<p>Ensuring the full availability of the ROR dataset over the long term is a core value. Any revenue sources that ROR explores in the future will not interfere with this aim.</p>
<blockquote>
<p><em>Revenue based on services, not data – data related to the running of the research enterprise should be a community property. Appropriate revenue sources might include value-added services, consulting, API Service Level Agreements or membership fees.
The fees associated with any future paid ROR tier will be focused on optional value-added services, such as a separate API Service Level Agreement or custom data dumps.</em></p>
</blockquote>
<p>ROR data will remain open and free in the spirit of making it available as a community property.</p>


<h2 id="insurance-principles">Insurance Principles 
</h2>
<figure><img src="/img/ror-posi-insurance.png"/>
</figure>

<blockquote>
<p><em>Open source – All software required to run the infrastructure should be available under an open source license. This does not include other software that may be involved with running the organisation.</em></p>
</blockquote>
<p>ROR is a community project as well as a software project. All of ROR’s code and software processes are openly managed on Github and the development roadmap and prioritization are discussed in regular communications and community calls. Our goal is to maintain openness throughout the technical product management process. As part of this, all code is published openly on Github under a fully permissible MIT License. Whenever possible we leverage open source components and we work hard to ensure that our documentation allows other projects to leverage our CC0 data file, open API, and other tools.</p>
<blockquote>
<p><em>Open data (within constraints of privacy laws) – For an infrastructure to be forked it will be necessary to replicate all relevant data. The CC0 waiver is best practice in making data legally available. Privacy and data protection laws will limit the extent to which this is possible</em></p>
</blockquote>
<p>At the most fundamental level, ROR’s core purpose is to deliver a fully open, CC0 registry of identifiers and metadata for research organizations. We maintain the data file on Github and in a public data dump with a citable DOI.</p>
<blockquote>
<p><em>Available data (within constraints of privacy laws) – It is not enough that the data be made “open” if there is not a practical way to actually obtain it. Underlying data should be made easily available via periodic data dumps.</em></p>
</blockquote>
<p>As stated above, ROR code is freely and openly available on Github under the terms of a MIT License. The ROR dataset can be accessed on Github as well as via a public data dump that ROR releases upon every update to the registry (at present, these updates occur approximately every 3-4 months). The ROR API is publicly and openly available at <a href="https://api.ror.org/organizations">https://api.ror.org/organizations</a>.</p>
<blockquote>
<p><em>Patent non-assertion – The organisation should commit to a patent non-assertion covenant. The organisation may obtain patents to protect its own operations, but not use them to prevent the community from replicating the infrastructure.</em></p>
</blockquote>
<p>We value ourselves as a fully open and public registry of factual information about research organizations. The metadata and curatorial value that we contribute to the ROR registry are not considered the intellectual property of any ROR-related entity. As facts, information stored in the registry, by its nature, cannot be patented. In addition, ROR fully asserts no ownership of the information by making it available to the public under a CC0 1.0 public domain dedication.</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">The scholarly community depends on a network of open identifier and metadata infrastructure. Content identifiers and contributor identifiers are foundational components of this network.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">The Path to ROR Adoption in Scholarly Publishing and Beyond</title><id>https://doi.org/10.71938/sj7z-4037</id><link rel="alternate" href="https://ror.org/blog/2020-12-11-the-path-to-ror-adoption/"/><published>2020-12-11T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="Adoption News"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>ROR offers an open and community-driven solution for tracking research outputs by institutions. ROR identifiers for research organizations are not meant to exist on their own. Their potential will be fully realized with wide adoption of ROR IDs in scholarly infrastructure and metadata.</p>
<p>Although ROR is still relatively new, ROR IDs are already being integrated and used in various ways. In a previous post, we shared the story of <a href="https://ror.org/blog/2019-07-10-ror-ing-together-with-dryad/">how Dryad relies on ROR</a> to capture affiliation data for its 30,000 datasets and counting. Dryad sends metadata to DataCite, which supports ROR IDs in its schema. This along with metadata provided by other DataCite members allows for <a href="https://blog.datacite.org/affiliation-facet-new-in-datacite-search/">searching research affiliated with specific institutions</a> and makes it possible to connect this research to other identifiers for scholarship and present these connections in <a href="https://blog.datacite.org/datacite-commons-at-your-service/">discovery services</a>.</p>
<p>In this post, we highlight ROR integration work focused on scholarly publishing, an area that depends heavily on the identification of institutional affiliations and one that can benefit from ROR’s open approach to this challenge.</p>
<p>Affiliations are a central component of publishing practices in a number of ways. They enable more complete identification of author details, which is important both during the peer review process (e.g., to minimize conflicts of interest by excluding editors or reviewers from the same institution as the author) and after publication (supporting the proper assignment of credit as well as easier tracking of research by institution). Using an affiliation identifier rather than a free-text field makes these processes more efficient and accurate.</p>
<p>Affiliations are also an increasingly crucial data point for open access publishing agreements. Publishers need to know if authors are covered under any agreements with their institutions or funders. Libraries negotiating agreements need to know how much publishing volume to expect by authors at their institutions.</p>
<p>The <a href="https://www.oaswitchboard.org/">OA Switchboard</a> and cOAlitionS <a href="https://journalcheckertool.org/">Journal Checker Tool</a> are two examples of how affiliation data based on ROR can be used by publishers, institutions, authors, and publishing stakeholders to help navigate this landscape. There are additional implementations taking place in the context of manuscript submission systems: Rockefeller University Press has built a ROR integration into its system, a team is developing a <a href="https://github.com/withanage/ror#installation">ROR plugin</a> for PKP’s Open Journal Systems (OJS) platform, and we expect to see additional integrations in place in early 2021.</p>
<p>With the <a href="https://www.crossref.org/blog/publishers-are-you-ready-to-ror/">forthcoming arrival of the new Crossref metadata schema</a>, ROR IDs can be included in the metadata that Crossref members deposit and will be available for others to use via Crossref’s API. This opens up a world of possibilities for discovering and linking research by institutions.</p>
<p>The value proposition of ROR truly depends on wide adoption and focusing on the potential to leverage affiliations across scholarly metadata is a core theme for the coming year. To that end, we are thrilled to <a href="https://blog.datacite.org/working-at-the-intersection-of-people-and-technology/">welcome Liz Krznarich</a> to the team as ROR’s new (and first) Adoption Manager. Based at DataCite (one of the organizations behind ROR), where she will also be supporting adoption efforts more generally, Liz’s work with ROR will be focused on driving wide adoption of ROR IDs, including support and guidance for the various ways in which ROR can be implemented. Liz’s background as a librarian and programmer, and her prior experience at ORCID, make her an ideal fit for this position. Stay tuned for more news from Liz as adoption projects get underway!</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">In this post, we highlight ROR integration work focused on scholarly publishing, an area that depends heavily on the identification of institutional affiliations, and we welcome Liz Krznarich as ROR's new Adoption Manager.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">What Does it Mean to Be in ROR?</title><id>https://doi.org/10.71938/wyy1-ec95</id><link rel="alternate" href="https://ror.org/blog/2020-11-18-what-does-it-mean-to-be-in-ror/"/><published>2020-11-16T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="Explainers"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>Some of the most frequent questions ROR receives are about what it means when an organization is in ROR, and how organizations end up in the registry in the first place. Many of you are understandably curious about how ROR records are added and updated. So, we thought this would be a good time to talk about how the registry is being maintained and how this process evolving.</p>


<h2 id="what-does-it-mean-if-an-organization-is-in-ror">What does it mean if an organization is in ROR? 
</h2>
<p>ROR is a registry of identifiers and metadata about research organizations&mdash;that is, organizations that are involved in research in some way. This can mean producing, facilitating, disseminating, funding, employing, and educating research(ers). The purpose of a ROR identifier is to be able to uniquely and unambiguously identify the organizations affiliated with research. These identifiers can be used in scholarly infrastructure to more efficiently discover and track research.</p>
<p>If an organization has a ROR ID, it means that it has been designated as a research organization and that the organization has been or will be linked to research outputs and activities. Inclusion in ROR does not imply anything about an organization&rsquo;s reputation or ranking or the quality of the research it is associated with. Inclusion in ROR does indicate that the organization is involved in research and that its ROR ID is or will be used to identify the organization in research infrastructure and metadata.</p>


<h2 id="is-my-organization-already-in-ror">Is my organization already in ROR? 
</h2>
<p>ROR already includes records for close to 99,000 organizations. You can look up organizations on the <a href="https://ror.org/search">ROR Search page</a> and via the <a href="https://api.ror.org/organizations">ROR API</a>.</p>


<h2 id="how-are-organizations-added-to-ror">How are organizations added to ROR?  
</h2>
<p>ROR was initially launched using seed data from GRID. The contents of ROR currently mirror those in GRID. ROR has all of the organizations that are in GRID as well as a subset of the GRID metadata.</p>
<p>Anyone can submit a request to add or update a record in ROR. These requests can be submitted to ROR via a <a href="https://ror.org/curation">public form</a>. For the time being, ROR coordinates these requests with GRID. The changes are made in GRID and then ROR syncs the changes. This happens every few months.</p>
<p>This situation is going to change in the future. Read on for more details!</p>


<h2 id="how-will-ror-be-maintained-over-time">How will ROR be maintained over time? 
</h2>
<p>The work of maintaining ROR over time, which we refer to as curation, involves infrastructure and workflows to support:</p>
<ul>
<li>Receiving, reviewing, and approving requests to add, update, merge, and deprecate records</li>
<li>Consistent and transparent policies around how records are added and maintained</li>
<li>A ROR metadata schema that is appropriate for ROR&rsquo;s global stakeholders</li>
<li>Tracking the history of registry changes over time</li>
<li>A regular schedule of public releases</li>
</ul>
<p>As mentioned above, ROR has already been doing some of this curation work, in coordination with GRID. The plan has always been for ROR to do all of its curation independently. This is what ROR is working toward now. ROR will continue to include GRID IDs in its metadata, and we expect that GRID will continue to map to ROR. There may be some inevitable points of divergence between the registries given that ROR is being designed as a community-based project (more on this below), while GRID is primarily designed to provide metadata for Digital Science&rsquo;s product portfolio.</p>


<h2 id="a-vision-for-community-based-curation">A vision for community-based curation 
</h2>
<p>ROR is a collaborative effort supported and adopted by stakeholders around the globe. <a href="https://ror.org/about/history">Community participation shaped the early vision for ROR</a> and <a href="https://ror.org/blog/2020-11-11-who-is-the-ror-community">continues to drive the project today</a> and distinguish it from similar initiatives.</p>
<p>From the very beginning, community involvement in the curation of ROR has been central to this vision. ROR&rsquo;s users and stakeholders have wide-ranging experiences working with persistent identifiers and scholarly infrastructure, whether in the context of libraries, research administration, platforms and repositories, and beyond. It is important that they be involved in decisions about how the registry is maintained.</p>
<p>As part of a <a href="https://www.imls.gov/grants/awarded/lg-246305-ols-20">funded project with IMLS</a> starting this fall, ROR is establishing a curation advisory board comprising community members with the expertise and the ethos to help develop policies and processes to ensure the integrity and longevity of registry data. This approach will allow ROR to: </p>
<ul>
<li>Provide open channels for community members interested in suggesting additions and updates</li>
<li>Centralize and scale curation processes through a trusted board of advisors</li>
<li>Make curation decisions that are appropriate for ROR&rsquo;s global stakeholders and that also can be applied consistently and transparently across the dataset</li>
</ul>
<p>This model will evolve over time as ROR grows and is more widely adopted. We are focused on reaching key milestones while continuing to make adjustments as we go along. We will share more about ROR&rsquo;s plans with curation in forthcoming updates. Please <a href="mailto:info@ror.org">get in touch with ROR</a> if you have any questions or suggestions at this time.</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Some of the most frequent questions ROR receives are about what it means when an organization is in ROR, and how organizations end up in the registry in the first place.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">What (and Who) Is the ROR Community?</title><id>https://doi.org/10.71938/7b4z-vs35</id><link rel="alternate" href="https://ror.org/blog/2020-11-11-who-is-the-ror-community/"/><published>2020-11-11T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="Explainers"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>The Research Organization Registry is a cross-organizational and multi-stakeholder initiative. ROR is run by a small group of steering organizations in collaboration with a broad network of community advisors and supporters.</p>
<p>This approach to operating ROR means that it does not fit neatly into existing notions of organizations in the scholarly communications and open infrastructure space:</p>
<ul>
<li>ROR is <strong>not an organization</strong> (fun fact: this means that ROR does not have its own ROR ID!)</li>
<li>ROR does <strong>not belong to any one organization</strong></li>
<li>ROR is <strong>not structured around a membership model</strong></li>
</ul>
<p>Anyone can access and use ROR data and anyone is welcome to get involved in ROR meetings and working groups.</p>
<p>Especially as ROR is still relatively new, keeping things relatively lightweight and nimble means that ROR can be flexible and responsive as the community continues to grow and evolve around it. Involving community stakeholders in all aspects of ROR&rsquo;s activities is central to the vision of ROR as a community-led project.</p>
<p>This blog post highlights three key aspects of ROR&rsquo;s community focus:</p>
<ul>
<li>ROR is a <strong>collaborative, global, multi-stakeholder</strong> community</li>
<li>There are a <strong>number of ways to get involved</strong> in ROR</li>
<li>It is important to have mechanisms for &ldquo;taking the temperature&rdquo; of the ROR community. So we are collecting data and input in the <strong>first-ever <a href="https://tinyurl.com/ror-survey-2020">ROR Community Survey</a></strong>, which we hope to run on a regular basis going forward.</li>
</ul>


<h2 id="who-runs-ror">Who runs ROR? 
</h2>
<p>CDL, Crossref, and DataCite are the operational team leading ROR. They are the <a href="https://ror.org/governance/">governing organizations</a> that share the overall responsibility for coordinating ROR activities and making decisions that support the growth and sustainability of ROR. These three organizations also contribute in-kind resources in the form of dedicated and ad hoc personnel for the ROR project team, which works on product development, outreach, and adoption, as well as infrastructural resources such as server hosting.</p>
<p>The <a href="https://ror.org/community/#steering-group">ROR Steering Group</a> consists of representatives of additional organizations, including Digital Science, which provided the seed data from GRID that was used to launch ROR. This group provides input on decisions and overall strategy, and support for ROR&rsquo;s growth and sustainability through outreach to funders, potential adopters, and other stakeholders in their respective networks.</p>


<h2 id="who-is-the-ror-community">Who is the ROR Community? 
</h2>
<p>The ROR Community Advisory Group is a voluntary group of about fifty users and stakeholders interested in adopting and championing ROR in their work and networks. Members participate in regular meetings to discuss project updates and provide feedback.</p>
<p>Group members are located around the world, reinforcing ROR&rsquo;s global reach. It would be great to see even greater coverage in the future.</p>
<figure><img src="/img/ror-community-map.png"/>
</figure>

<p>Group members also represent a range of organizations, reinforcing the range of stakeholders with an interest in ROR.</p>
<figure><img src="/img/ror-community-orgtypes.png"/>
</figure>

<p>ROR also coordinates smaller working groups in specific domains, one focused on publisher adoption and the other focused on curation. Some members of these groups also participate in the activities of the broader advisory group.</p>
<p><a href="https://ror.org/supporters/">ROR Supporters</a> are community members who have invested financially in ROR and/or signed on to a <a href="https://ror.org/supporters/#ror-signatories">public statement of support for ROR</a>.</p>


<h2 id="how-to-get-involved-in-ror">How to get involved in ROR 
</h2>
<p>Get in touch with ROR via <a href="mailto:info@ror.org">info@ror.org</a> if you would like to join the Community Advisory Group and/or get involved in ROR adoption or curation.</p>
<p>You can also support ROR and raise awareness about the project by integrating ROR in your system, or asking service providers to integrate ROR. There are also some community efforts outside of ROR, such as a <a href="https://github.com/pkp/pkp-lib/issues/5912">project to build a ROR plugin in OJS</a>. If you are active in the Wiki community, you could get involved in working with <a href="https://www.wikidata.org/wiki/Property:P6782">ROR in Wikidata</a>, or perhaps you&rsquo;d like to suggest a new Wikipedia page for ROR?</p>


<h2 id="take-the-ror-community-survey">Take the ROR Community Survey! 
</h2>
<p>As ROR grows and as the community grows around it, it will be increasingly important to have a good sense for who is working with ROR and what needs and interests are out there. We are collecting this data and input in the first-ever ROR Community Survey, which we hope to run on a regular basis going forward. <a href="https://tinyurl.com/ror-survey-2020">Take the survey now!</a></p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">The Research Organization Registry is a cross-organizational and multi-stakeholder initiative. ROR is run by a small group of steering organizations in collaboration with a broad network of community advisors and supporters.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Sustaining and Strengthening ROR</title><id>https://doi.org/10.71938/zzgq-n571</id><link rel="alternate" href="https://ror.org/blog/2020-08-20-ror-progress-update/"/><published>2020-08-20T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="General Updates"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>We&rsquo;re more than halfway through 2020, and it has already been a year like no other. In the midst of global upheaval and uncertainty, work on the Research Organization Registry continues. Building and sustaining community and connections through open scholarly infrastructure seems more important than ever.</p>
<p>Thanks to your support, engagement, and hard work, ROR has been making great progress toward our key goals for this year: building out ROR&rsquo;s infrastructure, driving adoption and integration of ROR IDs, and setting ROR up for long-term sustainability. We&rsquo;re excited to share news and updates on these and other efforts below.</p>


<h2 id="sustainability">Sustainability 
</h2>
<p>Sustainability planning is a key activity for ROR this year as the project matures. ROR&rsquo;s sustainability strategy has been developed according to and motivated by the following goals and principles.</p>
<ul>
<li><strong>Focus on providing free and open data</strong>: ROR data is CC0. It can never be owned by anyone, and future paid services won&rsquo;t impact access to the core data.</li>
<li><strong>Keep the scope manageable</strong>: ROR is focused exclusively on providing core high-quality open data and infrastructure that is specific to the affiliation use case, can be easily implemented and integrated, and can be leveraged and built upon by external services and projects.</li>
<li><strong>Cultivate early and sustained community engagement and governance</strong>: Community is key to ROR&rsquo;s success and longevity. Since Day 1, ROR has been focused on establishing a core base of community users invested in ROR&rsquo;s long-term success, driving adoption across the research landscape, and ensuring that ROR is attentive and adaptable to community needs.</li>
<li><strong>Maintain low overhead</strong>: ROR does not need to be its own organization, but can succeed as a joint collaboration by steering organizations and wider community. In this way, costs and other overhead can be contained.</li>
<li><strong>Diversify resourcing streams</strong>: ROR is not reliant on a single source of funding but rather strategically resourced through a combination of in-kind funds from steering organizations, grant funding, community contributions, and (in future) optional add-on service fees.</li>
</ul>


<h2 id="fundraising">Fundraising 
</h2>
<p>Last October, ROR kicked off a <a href="https://ror.org/blog/2019-10-16-help-sustain-ror/">fundraising campaign</a> to shore up resourcing for crucial development work and adoption efforts through a combination of grant funding and community investments, supplementing in-kind contributions from ROR&rsquo;s partner organizations. A number of you stepped up and pitched in to pledge support, providing an early and crucial boost to the campaign.</p>
<p>We are thrilled to announce that ROR has been awarded funding from the Institute of Museum and Library Services (IMLS) and the National Science Foundation (NSF) to implement key projects over the next two years. This funding will support development of a comprehensive community-based curation model in ROR (<a href="https://www.imls.gov/grants/awarded/lg-246305-ols-20">IMLS project</a>), and ongoing efforts to drive wider adoption of ROR IDs and promote the use of open affiliation identifiers in research infrastructure (<a href="https://www.nsf.gov/awardsearch/showAward?AWD_ID=2031172">NSF project</a>).</p>
<p>With this funding along with the community contributions received to date, ROR is now in a stronger position to be able to scale up its development and adoption efforts this year and in the years to come.</p>
<p>It is tremendously meaningful for ROR to have this support to count on from community partners as well as funders. Thank you!</p>


<h2 id="development">Development 
</h2>
<p>ROR development work is continuing to set up the registry&rsquo;s infrastructure to support independent management and curation of registry data. This is the top development priority for 2020. There are some key aspects to this effort:</p>
<ul>
<li>Create a <strong>ROR schema</strong></li>
<li>Add <strong>additional metadata fields</strong> to ROR (based on the original GRID seed file)</li>
<li>Identify <strong>records by status</strong> (e.g., active or inactive) and support redirects and tombstone pages for merged and deprecated records</li>
<li>Build an <strong>input mechanism for users to submit requests</strong> to make changes to the registry</li>
<li>Build a mechanism to <strong>make changes directly in ROR</strong></li>
</ul>
<p>As part of this development work, ROR is also building out workflows and policies to support long-term curation of the registry. We have been piloting these workflows with a small group of community group members and will be expanding this community-based curation model later this year.</p>


<h2 id="ror-adoption-and-integrations">ROR adoption and integrations 
</h2>
<p>A number of systems have already integrated or are working on integrating with ROR to collect and display clean and consistent affiliation data. The ROR website is keeping a running list of <a href="https://ror.org/integrations">current and pending adopters</a> that we know about. <a href="mailto:info@ror.org">Contact ROR</a> to add other examples to the list.</p>
<p>ROR has also convened an adoption working group for publishers and publishing systems working on or planning ROR integrations. If you are working on an integration and wish to be part of this group, please get in touch.</p>


<h2 id="outreach-and-events">Outreach and events 
</h2>
<p>ROR outreach and events are continuing this year and we appreciate the ability to stay connected virtually.</p>
<p>Catch the recordings of some recent conferences where ROR presented:</p>
<ul>
<li><a href="https://doi.org/10.5446/48011">VIVO Conference 2020</a></li>
<li><a href="https://youtu.be/eQlG7BQfAGs">Coalition for Networked Information (CNI) Spring 2020 meeting</a></li>
<li><a href="https://bit.ly/3cTES97">UKSG Annual Conference</a> (note: registration required)</li>
</ul>
<p>We started holding an &ldquo;Introduction to ROR&rdquo; webinar for those interested in getting a basic overview of the project. The inaugural webinar was held in April (watch the recording <a href="https://youtu.be/W61JMsC3Dho">here</a>) and the next one will take place in October (date to be confirmed soon).</p>
<p>ROR also has a brand-new <a href="https://www.youtube.com/channel/UCQBOpOpW-JEKoVCUlmCK1Eg/featured?view_as=subscriber">YouTube channel</a> where we will be sharing recordings of conference presentations as well as demos and tutorials for working with the registry.</p>


<h2 id="stay-connected">Stay connected 
</h2>
<p>ROR is a community-driven effort and we encourage anyone who is interested to get involved.</p>
<p>You can follow project updates on <a href="https://twitter.com/ResearchOrgs">Twitter</a> and here on the ROR Blog, and participate in discussions in the ROR Slack. If you&rsquo;re working on a ROR integration, check out the <a href="https://github.com/ror-community/ror-api">API documentation on Github</a>.</p>
<p>The ROR Community Advisory Group holds community calls every other month. Contact <a href="mailto:info@ror.org">info@ror.org</a> to join this group.</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">We&amp;rsquo;re more than halfway through 2020, and it has already been a year like no other. In the midst of global upheaval and uncertainty, work on the Research Organization Registry continues.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Early Adopters of Organization Identifiers in DataCite</title><id>https://doi.org/10.71938/ffx0-y253</id><link rel="alternate" href="https://ror.org/blog/2020-03-10-early-adopters-of-organization-ids/"/><link rel="related" href="https://doi.org/10.59350/g96gh-x2361"/><published>2020-03-10T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Ted Habermann</name><uri>https://orcid.org/0000-0003-3585-6733</uri></author><category term="Explainers"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>Version 4.3 of the DataCite Metadata Schema released during August, 2019 included (among other things), the capability to provide persistent identifiers for affiliated organizations in the metadata (Dasler and deSmaele, <a href="https://blog.datacite.org/identify-your-affiliation-with-metadata-schema-4-3/">Identify your affiliation with Metadata Schema 4.3</a>, 2019). This capability builds on the work and enthusiasm generated by the ROR Community that has championed the concept of open organization identifiers for several years (Gould, <a href="https://ror.org/blog/2019-12-17-year-in-review/">A Reflection on ROR&rsquo;s First Year</a>, 2019). This is a critical step towards consistent integration of organizations into the growing web of connections across the scholarly communications landscape and research communities in all disciplines.</p>
<p>The first organizational identifiers in DataCite metadata were funderIdentifiers introduced in Version 4.0 during late 2016. These identifiers might now be considered mature in DataCite metadata and will not be considered.  Here we focus on organizational identifiers introduced last August and are integrated into the DataCite schema as affiliationIdentifiers associated with either creators or contributors. In this blog I identify the early adopters that have already started to build these important connections into their metadata.</p>


<h2 id="finding-early-adopters">Finding Early Adopters 
</h2>
<p>The DataCite API makes it easy to find repositories that are adopting these new identifiers with two requests and a small bit of python (both functional but certainly could be improved):</p>
<pre tabindex="0"><code>  Creators:
import requests
import json
#
# Query DataCite for records that contain affiliationIdentifiers for creators
#
URL = &#34;&#34;&#34;https://api.datacite.org/dois?\
query=creators.affiliation.affiliationIdentifier:*\
&amp;affiliation=true&amp;page%5Bsize%5D=1&#34;&#34;&#34;
r = requests.get(URL)
for i in r.json()[&#39;meta&#39;][&#39;clients&#39;]:
    print(i[&#39;title&#39;], i[&#39;id&#39;], i[&#39;count&#39;])

Contributors:
import requests
import json
#
# Query DataCite for records that contain affiliationIdentifiers for contributors
#
URL = &#34;&#34;&#34;https://api.datacite.org/dois?\
query=contributors.affiliation.affiliationIdentifier:*\
&amp;affiliation=true&amp;page%5Bsize%5D=1&#34;&#34;&#34;
r = requests.get(URL)
for i in r.json()[&#39;meta&#39;][&#39;clients&#39;]:
    print(i[&#39;title&#39;], i[&#39;id&#39;], i[&#39;count&#39;])
</code></pre><p>Note that both of these queries list only the fifteen repositories with the largest number of identifiers.</p>
<p>The results of these queries (as of 2020-02-17), shown in the Table below, indicate that identifiers for creators, in 15+ repositories, are currently more common than identifiers for contributors, in 8 repositories, and that three repositories (bl.imperial, bl.nerc, and tib.wdcc) currently have some identifiers of both types. The counts in this Table are numbers of records with affiliations, not the number of affiliations.</p>
<figure><img src="/img/ror-adoption-table.png"/>
</figure>

<p>The numbers in this Table allow us to identify the early adopters of organizational identifiers, both DataCite members and repositories. These members of the community are the “guinea pigs” in this effort that are committed enough to the benefits of organizational identifiers in metadata to take action and serve as good examples for the community. All should be recognized as pioneers!</p>
<p>In an earlier blog (Habermann, <a href="https://metadatagamechangers.com/blog/2019/11/10/how-many-rors-do-we-need">How Many ROR’s Do We Need?</a>, 2019) I made the somewhat surprising observation that many DataCite repositories contained only a small number of unique affiliations and, therefore, those repositories could uniquely identify all organizations in their metadata with a small number of organizational identifiers. These earlier data are shown in this Figure. It indicates that 108 repositories only need one ROR and that 235 need five or less. This is great news for the adoption of RORs.</p>
<figure><img src="/img/ror-adoption-chart.png"/>
</figure>

<p>The early adopters identified above include examples where many metadata records contain a small number of affiliations – low hanging fruit along the path to adoption. Others face much more difficult tasks because of large numbers of affiliations spread across many records. We look forward to learning lessons as all of these groups forge forward.</p>
<p><em>Ted Habermann is a member of the <a href="/supporters#ror-community-advisors">ROR Community Advisory Group</a>. He is currently the founder and CEO of <a href="https://metadatagamechangers.com">Metadata Game Changers</a>, working with communities to understand and improve how they use metadata to share data and knowledge. This has been re-posted from the <a href="https://metadatagamechangers.com/blog/2020/3/5/early-adopters-of-organizational-identifiers">Metadata Game Changers blog</a>.</em></p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Version 4.3 of the DataCite Metadata Schema released during August, 2019 included (among other things), the capability to provide persistent identifiers for affiliated organizations in the metadata (Dasler and deSmaele, Identify your affiliation with Metadata Schema 4.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">ROR-ing Together in Portugal: A Community Celebration</title><id>https://doi.org/10.71938/bfsw-x908</id><link rel="alternate" href="https://ror.org/blog/2020-02-10-ror-ing-in-portugal/"/><published>2020-02-10T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="Event Recaps"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>ROR had a party in Portugal last month! Sixty friends - some new, some old - came together in Lisbon on the eve of <a href="https://www.pidapalooza.org">PIDapalooza 2020</a> to celebrate ROR’s unofficial first birthday, marking one year since the registry debuted at a community meeting in Dublin in January 2019. This year’s gathering was a chance to review the milestones that ROR has passed in the last twelve months, highlight early implementations of ROR, and discuss the work that lies ahead for the next year and beyond with ROR’s active community.</p>
<figure><img src="/img/lions-in-lisbon.jpg"/>
</figure>

<p>The review of ROR&rsquo;s first year covered many highlights, including:</p>
<ul>
<li><a href="https://ror.org/blog/2019-02-10-announcing-first-ror-prototype/">Launch of the registry</a> (with seed data from GRID) and assignment of ROR IDs (January 2019)</li>
<li>Formation of the <a href="/supporters">ROR Community group</a> (March 2019)</li>
<li>Addition of ROR IDs in <a href="https://www.wikidata.org/wiki/Property:P6782">Wikidata</a> (May 2019)</li>
<li><a href="/blog/2019-07-02-ror-development-update/">Updated codebase</a>, auto-sync to GRID, and [public data dump] (<a href="https://figshare.com/authors/ROR_Community/6832769">https://figshare.com/authors/ROR_Community/6832769</a>) (June/July 2019)</li>
<li>Support for ROR IDs in <a href="https://blog.datacite.org/identify-your-affiliation-with-metadata-schema-4-3/">DataCite metadata</a> (August 2019)</li>
<li>Implementation in <a href="/blog/2019-07-10-ror-ing-together-with-dryad/">Dryad</a> (September 2019)</li>
<li>Announcement of <a href="/blog/2019-10-16-help-sustain-ror/">ROR sustainability campaign</a> (October 2019)</li>
<li>Expansion of the <a href="/blog/2019-11-22-meet-the-ror-steering-group/">ROR Steering Group</a> (November 2019)</li>
</ul>
<p>These developments have been key to ROR&rsquo;s growth, adoption, and sustainability, resulting in:</p>
<ul>
<li>ROR IDs and metadata for close to <strong>98,000 organizations</strong></li>
<li>A set of <strong>tools for interacting with registry data</strong>, including the ability to match free-text affiliation strings to ROR IDs</li>
<li><strong>$100,000 in community investments</strong> to date, positioning ROR to be able to hire a technical lead to coordinate and accelerate development work in 2020 and beyond</li>
<li><strong>Active community engagement</strong> in bimonthly community calls, a formal statement of support for ROR, and community working groups on data curation and ROR integrations</li>
</ul>
<p>The meeting also showcased a variety of ways in which ROR is being used and implemented across the community:</p>
<ul>
<li>Simon Porter (Digital Science) showed how <strong>ROR IDs are being collected in submission workflows</strong> for the <a href="https://journal.physiomeproject.org/">Physiome Journal</a></li>
<li>Jamie McKee (Altum) shared how <strong>ROR IDs are being used in institutional profiles</strong> in the <a href="https://www.altum.com/grant-making-management-system-software-tools">ProposalCentral</a> grants management platform</li>
<li>Martin Fenner (DataCite) presented on how <strong>ROR IDs are supported in DataCite metadata</strong> and how <a href="https://blog.datacite.org/affiliation-facet-new-in-datacite-search/">DataCite&rsquo;s search faceting uses ROR</a> to identify research outputs by a given affiliation</li>
<li>Christine Ferguson (EMBL-EBI) discussed recent <a href="https://www.project-freya.eu/en/deliverables/freya_d4-4.pdf">work by the FREYA project</a> to analyze opportunities to integrate organization IDs in PID services and scholarly communication systems, with <strong>ROR IDs as a predominant option</strong></li>
<li>Ted Habermann (Metadata Game Changers) examined the <strong>relationship between affiliation &ldquo;heterogeneity&rdquo; and ROR adoption</strong> among DataCite repositories</li>
<li>Richard Wynne (Rescognito) demoed a <strong>researcher recognition platform that uses ROR IDs</strong> to identify organizations</li>
<li>Carolyn Grant (Harvard Center for Astrophysics) previewed work to <strong>link ROR IDs to local department-level identifiers</strong> in the <a href="https://ui.adsabs.harvard.edu/">Astrophysics Data System</a></li>
</ul>
<blockquote class="twitter-tweet"><p lang="en" dir="ltr">Interesting use case of PIDs implementation in the Physiome Journal, presented by <a href="https://twitter.com/sjcporter?ref_src=twsrc%5Etfw">@sjcporter</a> (<a href="https://twitter.com/digitalsci?ref_src=twsrc%5Etfw">@digitalsci</a>) at <a href="https://twitter.com/ResearchOrgs?ref_src=twsrc%5Etfw">@ResearchOrgs</a> community meeting <a href="https://twitter.com/hashtag/letsRORtogether?src=hash&amp;ref_src=twsrc%5Etfw">#letsRORtogether</a> <a href="https://t.co/MDIiVIRpgH">pic.twitter.com/MDIiVIRpgH</a></p>&mdash; Gabriela Mejias ⭐️⭐️⭐️ @gabioshka@mastodon.social (@gabioshka) <a href="https://twitter.com/gabioshka/status/1222183096126267394?ref_src=twsrc%5Etfw">January 28, 2020</a></blockquote>
<script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script>


<p>In the second part of the meeting, the focus was on what comes next for ROR. The key themes for 2020 will be on sustainability, curation, and adoption.</p>
<ul>
<li><strong>Sustainability</strong>: ROR&rsquo;s fundraising campaign will continue this year, aiming to reach $400,000 by the end of the 2021 through a combination of community investments and grant funding. The success of the fundraising campaign so far will allow ROR to secure additional resourcing for technical development.</li>
<li><strong>Curation</strong>: Curation is a top priority for ROR development. This will entail technical work to manage ROR data independently, and policy-based work to curate the registry with input and involvement from community members.</li>
<li><strong>Adoption</strong>: Another area of focus will be on driving and supporting greater adoption of ROR IDs in various systems, especially by publishers.</li>
</ul>
<blockquote class="twitter-tweet"><p lang="en" dir="ltr">.<a href="https://twitter.com/ResearchOrgs?ref_src=twsrc%5Etfw">@ResearchOrgs</a> focusing on sustainability, not creating a new org but utilizing partner support and appreciating the community so early in its life is a good model.  <a href="https://twitter.com/hashtag/pidaplooza2020?src=hash&amp;ref_src=twsrc%5Etfw">#pidaplooza2020</a> <a href="https://t.co/KRi4BiHDG6">pic.twitter.com/KRi4BiHDG6</a></p>&mdash; Erin Robinson - @erinmr@mas.to (@connector_erin) <a href="https://twitter.com/connector_erin/status/1222183630287646721?ref_src=twsrc%5Etfw">January 28, 2020</a></blockquote>
<script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script>


<p>Meeting attendees spent time working in breakout groups to discuss priorities for ROR associated with specific stakeholder groups: Funders, Publishers, Repository Managers, System Providers, and Data Wranglers. Common themes across these groups were interoperability, end user support, and connectivity between ROR and other identifiers. These discussions will help to inform future developments with ROR.</p>
<figure><img src="/img/ror-meeting-breakouts.jpg"/>
</figure>

<p>At the end of the meeting, we all roared together!</p>
<figure><img src="/img/ror-ing-in-lisbon.jpg"/>
</figure>

<p>It was a fun party, and now it&rsquo;s time to get back to work! 2020 will be a big year for ROR - thank you for being part of this journey.</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">ROR had a party in Portugal last month! Sixty friends - some new, some old - came together in Lisbon on the eve of PIDapalooza 2020 to celebrate ROR’s unofficial first birthday, marking one year since the registry debuted at a community meeting in Dublin in January 2019.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">A Reflection on ROR's First Year</title><id>https://doi.org/10.71938/1j4z-y533</id><link rel="alternate" href="https://ror.org/blog/2019-12-17-year-in-review/"/><published>2019-12-17T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="Year in Review"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p><em>Flashback to one year ago, December 2018</em>:</p>
<p>The ROR project team was putting the final pieces in place to launch the ROR MVR (minimum viable registry) in January. The ROR ID format was under discussion. A website was under construction. The purchase of the ror.org domain was being negotiated. We were getting ready to ROAR! But we weren&rsquo;t sure at that time who else would be listening, and who might be ready to roar alongside us.</p>
<p>What a year it has been!</p>
<p>Today, in December of 2019, the ROR registry now includes <a href="https://ror.org/search">IDs and metadata for more than 97,000 organizations</a>. Users are working with the <a href="https://github.com/ror-community/ror-api">ROR API</a> and <a href="https://github.com/ror-community/ror-reconciler">OpenRefine reconciler</a> to <a href="https://github.com/ror-community/ror-api#affiliation-matching">match free-text affiliation strings to ROR IDs</a> and clean up messy lists of affiliations. The <a href="https://figshare.com/authors/ROR_Community/6832769">public ROR data dump</a> has been downloaded more than 60 times. ROR IDs have been added to <a href="https://www.wikidata.org/wiki/Property:P6782">Wikidata</a> and to <a href="https://grid.ac/institutes">GRID</a>. Datasets in <a href="https://datadryad.org//">Dryad</a> now have <a href="https://ror.org/blog/2019-07-10-ror-ing-together-with-dryad/">affiliations (with ROR IDs)</a> for the very first time, and thousands of research objects in DataCite can be <a href="https://blog.datacite.org/affiliation-facet-new-in-datacite-search/">searched by affiliations</a> now that ROR IDs are <a href="https://blog.datacite.org/identify-your-affiliation-with-metadata-schema-4-3/">supported in DataCite metadata</a>. ROR is being integrated into platforms and services across the scholarly communication landscape, including grant application systems (<a href="https://www.altum.com/grant-making-management-system-software-tools">Altum&rsquo;s ProposalCentral</a>), citation trackers (<a href="https://cobaltmetrics.com/">Cobaltmetrics</a>), researcher recognition (<a href="https://rescognito.com/">Rescognito</a>), and Crossref metadata (<a href="https://www.crossref.org/blog/proposed-schema-changes-have-your-say/">coming soon!</a>). Other implementations are in the works in publisher systems, institutional repositories, researcher profiles, and more&mdash;all enabling more powerful and efficient discovery and tracking of research outputs by the organizations where they are affiliated.</p>
<p><em>Phew, what a year!</em></p>
<p><strong>But that&rsquo;s not all</strong>: The ROR Community came to life in 2019, energized by the promise and vision of truly community-owned infrastructure for organization identifiers. <a href="https://ror.org/supporters/">Community stakeholders and supporters</a> are providing feedback on ROR development, participating in project discussions, spreading the word about ROR at conferences and events, and signing on as ROR supporters. ROR also welcomed <a href="https://ror.org/blog/2019-11-22-meet-the-ror-steering-group/">new members to its Steering Group</a> to help guide the registry&rsquo;s further development and the community engagement around it.</p>
<p><em>What a year, indeed.</em></p>
<p>**But there is more to do, and more yet to come. **</p>
<p>We have big dreams of <strong>growing the registry</strong>, of <strong>building out more and better tools</strong> for interacting with and implementing ROR data, of making the <strong>ROR website more user-friendly</strong>, of supporting <strong>wider adoption and implementation of ROR IDs</strong>, and at long last setting up ROR&rsquo;s technical systems so the <strong>registry can be curated independently</strong> from GRID.</p>
<p>This work takes time. It also requires more resources beyond the existing grassroots effort fueled by in-kind contributions from the ROR steering organizations. While these organizations are committed to ROR for the long term, <strong>it is time to have dedicated support</strong> to strengthen and sustain ROR&rsquo;s existing capacity.</p>
<p>The <strong>ROR sustainability campaign <a href="https://ror.org/blog/2019-10-16-help-sustain-ror/">launched in October</a></strong> to secure community investment in the registry for the future. Thanks to the generosity of the community, this campaign has <strong>raised $48,000 to date</strong>, confirming that there is a clear consensus that this type of initiative is <a href="https://ror.org/blog/2019-12-02-note-of-thanks/">valuable and is worth supporting</a>.</p>
<p><strong>Can you help ROR to do even more in the year(s) to come?</strong> If you have not yet contributed to the fundraising campaign, consider <a href="https://ror.org/supporters/">chipping in</a> to bring ROR closer to its sustainability goal (email <a href="mailto:mailto:donate@ror.org">donate@ror.org</a> to make a pledge). You can also help spread the word to other organizations that might be able to contribute. Your support will go directly to ROR&rsquo;s development and will help to maintain ROR as trusted, open, and community-led infrastructure.</p>
<p>It has been thrilling to see so much enthusiasm for ROR in the past year, and so much trust in the initiative. We take this responsibility seriously&mdash;and know you do as well.</p>
<p>Thank you for supporting ROR, for energizing the project, and for roaring along with us every step of the way.</p>
<p><strong>#LetsRORtogether</strong></p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">What a year it has been! Here's what happened in ROR's first year and what we plan for the future.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">A Note of Thanks from ROR</title><id>https://doi.org/10.71938/86x3-1r77</id><link rel="alternate" href="https://ror.org/blog/2019-12-02-note-of-thanks/"/><published>2019-12-02T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="General Updates"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>As we announced previously, ROR <a href="https://ror.org/blog/2019-10-16-help-sustain-ror/">launched a fundraising campaign in October</a> to ensure the registry&rsquo;s long-term sustainability. We are grateful for the community supporters who have already contributed to this campaign. The first round of supporters was <a href="https://ror.org/blog/2019-11-13-ror-fundraising-update/">announced a couple of weeks ago</a>. We are excited in this post to acknowledge the following additional contributors:</p>
<ul>
<li><a href="https://www.aps.org/">American Physical Society</a></li>
<li><a href="https://www.apa.org/">American Psychological Association</a></li>
<li><a href="https://www.acm.org/">Association for Computing Machinery</a></li>
</ul>
<p>These new contributions supplement those previously received from:</p>
<ul>
<li><a href="https://www.altum.com/">Altum</a></li>
<li><a href="https://www.cwts.nl/">Center for Science and Technology Studies (CWTS), Leiden University</a></li>
<li><a href="https://www.curtin.edu.au/">Curtin University</a></li>
<li><a href="https://datadryad.org/">Dryad</a></li>
<li><a href="https://elifesciences.org/">eLife</a></li>
<li><a href="https://www.tib.eu/en/">German National Library of Science and Technology (TIB)</a></li>
</ul>
<p>The ROR campaign has now raised $36,000 toward our 2019 target of $75,000. As we approach the end of the year, we are calling on our community to help us reach our goal. Contributions in any amount are welcome, and will go directly to support the registry&rsquo;s growth and development. To start your contribution, email <a href="mailto:donate@ror.org">donate@ror.org</a> to make a pledge and request an invoice.</p>
<p>If you have already made a contribution, or if you are not able to support the campaign financially, please consider spreading the word to your networks and sharing your views about how and why open and community-led infrastructure for organization identifiers matters to you. You can reference the <a href="https://ror.org/blog/2019-10-16-help-sustain-ror/">blog post</a> announcing the campaign, and mention <a href="https://twitter.com/ResearchOrgs">@ResearchOrgs</a> on Twitter (<a href="https://twitter.com/ResearchOrgs/status/1184879636267913216">here is the campaign kick-off Tweet</a>).</p>
<p>In this season of giving thanks, the ROR team is especially grateful for your support and engagement. #LetsRORtogether!</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">As we announced previously, ROR launched a fundraising campaign in October to ensure the registry&amp;rsquo;s long-term sustainability. We are grateful for the community supporters who have already contributed to this campaign.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Untangling Affiliation Strings with the ROR API</title><id>https://doi.org/10.71938/36jw-rs79</id><link rel="alternate" href="https://ror.org/blog/2019-11-25-untangling-affiliation-strings/"/><published>2019-11-25T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Dominika Tkaczyk</name><uri>https://orcid.org/0000-0001-5055-7876</uri></author><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="Technical News"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>ROR <a href="/blog/2019-02-10-announcing-first-ror-prototype">launched in January 2019</a> with records for nearly 100,000 research organizations, all with unique IDs and associated metadata. ROR data is useful for a variety of reasons and for a variety of users, including both humans and machines. It is essential for ROR to have robust mechanisms for searching, retrieving, and filtering.</p>
<p>Since launching the registry, we have been making improvements to the codebase to strengthen and enhance these mechanisms. To start off, we <a href="/blog/2019-07-02-ror-development-update">rewrote the API in Python</a>. Another change that we made was to deprecate the parameters <em>query.name</em> and <em>query.names</em> and redirect these to the <em>query</em> parameter to <a href="https://github.com/ror-community/ror-api#querying">support finding organizations using specific terms</a>.</p>
<p>A third improvement that we have recently implemented is affiliation matching to support searching for organizations in a full affiliation string. Previously, the API only supported specific types of tasks, like querying based on a few important terms, or filtering on specific categories like country or organization type. We heard feedback from users that they wanted the API to be able to handle unstructured affiliation strings, including those containing multiple affiliations. A number of ROR stakeholders are working on projects to explore assigning ROR IDs to existing research outputs that have free-text affiliation strings. This type of matching functionality is potentially very useful for such projects.</p>
<p>In the rest of this post we go into detail about what affiliation matching means, why it is useful, and how it works in practice. </p>


<h2 id="what-is-affiliation-matching---and-why-is-it-needed">What is affiliation matching - and why is it needed? 
</h2>
<p>Affiliation matching refers to detecting organizations mentioned in raw affiliation strings. For example, given an affiliation string:</p>
<p><em>Department of Civil and Industrial Engineering, University of Pisa, Largo Lucio Lazzarino 2, Pisa 56126, Italy</em></p>
<p>we expect to detect the organization University of Pisa with ROR ID <a href="https://ror.org/03ad39j10">https://ror.org/03ad39j10</a></p>
<p>Previously, the only way to find an organization&rsquo;s ROR ID based on some version of its name was the ROR API&rsquo;s search functionality. This search functionality can be accessed through API&rsquo;s <em>query</em> parameter. When a user queries the API providing a number of search words, the internal search engine scores ROR organizations according to the relevance to the search words. All ROR organizations with a non-zero relevance are then sorted according to the relevance (from most to least relevant). The sorted list is finally returned to the user.</p>
<p>In theory, this standard search functionality can be also used to find the ROR ID of the organization mentioned in an affiliation string. Indeed, we could simply use the affiliation string as the search query words and accept the top result as the mentioned organization. There are, however, a number of problems with this approach.</p>
<p>Firstly, traditional search assumes that all given query words describe the item we are looking for. Unfortunately, this is not the case when the input is a raw affiliation string. A typical affiliation string contains some form of the name of the organization, but also less relevant fragments such as the name of the department, address, author&rsquo;s name, or author&rsquo;s email address.</p>
<p>Secondly, traditional search assumes that the query words describe only one item. Again, this is not the case in affiliation matching. One affiliation string quite often contains mentions of several organizations. It would be good to be able to detect all of them.</p>
<p>Finally, traditional search returns a list of results sorted by relevance, but does not make any decision as to which (if any) of the returned organizations are in fact mentioned in the input affiliation string.</p>
<p>For these reasons, using the traditional search for affiliation matching will make the results much less precise. It will also be difficult to integrate such an approach into an automated affiliation processing workflow.</p>


<h2 id="how-affiliation-matching-works">How affiliation matching works 
</h2>
<p>To solve these issues, we have added a separate affiliation matching functionality to the ROR API. It can be accessed through API&rsquo;s <em>affiliation</em> parameter, for example:</p>
<p><a href="https://api.ror.org/organizations?affiliation=Department+of+Civil+and+Industrial+Engineering,+University+of+Pisa,+Largo+Lucio+Lazzarino+2,+Pisa+56126,+Italy">https://api.ror.org/organizations?affiliation=Department+of+Civil+and+Industrial+Engineering,+University+of+Pisa,+Largo+Lucio+Lazzarino+2,+Pisa+56126,+Italy</a></p>
<p>Here&rsquo;s an example of how it works:
<a href="https://gyazo.com/2af8f34a5e09d5d92c7ff5e4df4edb40"><img src="https://i.gyazo.com/2af8f34a5e09d5d92c7ff5e4df4edb40.gif" alt="Image from Gyazo"></a></p>
<p>Similarly as in the case of the traditional search, the output of affiliation matching also contains a list of organizations sorted by relevance. However, there are important differences:</p>
<ul>
<li>Each organization in the output has been matched to a substring of the input rather than the entire input.</li>
<li>Each organization in the output has a normalized similarity score ranging from 0 to 1.</li>
<li>Each organization in the output has a binary field indicating whether it should be considered positively matched to a given affiliation substring. The output may include multiple such organizations positively matched to different substrings.</li>
</ul>
<p>More information about the output fields can be found <a href="https://github.com/ror-community/ror-api#affiliation-matching">here</a>.</p>
<p>How does the matching algorithm work? First, candidate substrings are extracted from the input affiliation string. This is done based on separators such as commas or semicolons. Next, each candidate substring is used separately to query the ROR API in a more traditional way. The resulting organizations are scored based on the similarity with the candidate substring. Then, the results for all candidate substrings are combined together into a final list. Finally, organizations are marked as successfully matched if the following conditions are met:</p>
<ul>
<li>The similarity score is at least 0.9.</li>
<li>If the affiliation string contains the name of the country, it must be the same as the matched organization&rsquo;s country.</li>
<li>The substring cannot overlap with another substring successfully matched to an organization.</li>
</ul>
<p>You can access the affiliation matching code <a href="https://github.com/ror-community/ror-api/blob/master/rorapi/matching.py">here</a> (along with other helpful API documentation). We hope you give it a try and let us know what you think.</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">ROR launched in January 2019 with records for nearly 100,000 research organizations, all with unique IDs and associated metadata. ROR data is useful for a variety of reasons and for a variety of users, including both humans and machines.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Meet the ROR Steering Group</title><id>https://doi.org/10.71938/a2dz-qa63</id><link rel="alternate" href="https://ror.org/blog/2019-11-22-meet-the-ror-steering-group/"/><published>2019-11-22T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="General Updates"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>ROR is thrilled to announce that we are welcoming new members to the <a href="https://ror.org/about/#steering-group">ROR Steering Group</a>. The group now consists of the following members:</p>
<ul>
<li><strong>Matt Buys</strong>, <a href="https://ror.org/04wxnsj81">DataCite</a></li>
<li><strong>John Chodacki</strong>, <a href="https://ror.org/03yrm5c26">California Digital Library</a></li>
<li><strong>Daniel Hook</strong>, <a href="https://ror.org/02ktfc112">Digital Science</a></li>
<li><strong>Clifford Lynch</strong>, <a href="https://ror.org/043fjtb89">Coalition for Networked Information</a></li>
<li><strong>Ritsuko Nakajima</strong>, <a href="https://ror.org/00097mb19">Japan Science and Technology Agency</a></li>
<li><strong>Ed Pentz</strong>, <a href="https://ror.org/02twcfp32">Crossref</a></li>
<li><strong>Judy Ruttenberg</strong>, <a href="https://ror.org/053mpbz30">Association of Research Libraries</a></li>
<li><strong>Ina Smith</strong>, <a href="https://ror.org/02qsf1r97">Academy of Science of South Africa</a></li>
</ul>
<figure><img src="/img/ror-steering-group.png" width="75%"/>
</figure>

<p>Approximately one year ago, a ROR project team formed as the outcome of the prior <a href="https://ror.org/about//history">OrgID initiative</a> to initiate a <a href="https://ror.org/blog/2018-12-02-the-ror-of-the-crowd/">startup effort to develop the first iteration of the ROR registry</a>. This team, a collaboration between California Digital Library, Crossref, DataCite, and Digital Science, <a href="https://ror.org/blog/2019-02-10-announcing-first-ror-prototype/">launched the registry</a> in January 2019.</p>
<p>Expanding the ROR Steering Group beyond the four organizations that launched ROR is key to driving adoption of ROR IDs, reinforcing community engagement, and supporting long-term sustainability. The Steering Group will help to guide ROR&rsquo;s strategic directions and champion the use of open identifiers for research organizations across the scholarly communication landscape.</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">ROR is thrilled to announce that we are welcoming new members to the ROR Steering Group.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Pride in Community: ROR Fundraising Update</title><id>https://doi.org/10.71938/yjg5-we91</id><link rel="alternate" href="https://ror.org/blog/2019-11-13-ror-fundraising-update/"/><published>2019-11-13T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="General Updates"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>ROR <a href="https://ror.org/blog/2019-10-16-help-sustain-ror/">launched a fundraising campaign in October</a> to call on community stakeholders to pitch in toward supporting ROR’s long-term sustainability. While the overall goal of this campaign is to raise $175,000 from community supporters over the next two years, we set an initial target of $75,000 by the end of 2019. In our fourth full week of the campaign, we are excited to announce that we have so far raised $13,500 in contributions from six different supporting organizations. We are grateful to these early supporters for getting us started off on the right foot!</p>
<ul>
<li><a href="https://www.altum.com/">Altum</a></li>
<li><a href="https://www.cwts.nl/">Center for Science and Technology Studies (CWTS), Leiden University</a></li>
<li><a href="https://www.curtin.edu.au/">Curtin University</a></li>
<li><a href="https://datadryad.org/">Dryad</a></li>
<li><a href="https://elifesciences.org/">eLife</a></li>
<li><a href="https://www.tib.eu/en/">German National Library of Science and Technology (TIB)</a></li>
</ul>
<p>ROR’s growing community of supporters speaks to the importance of building and sustaining open infrastructure for scholarly communications.</p>
<p>Steve Pinchotti, CEO of Altum—which has integrated ROR IDs into 26,000 institution profiles in its ProposalCentral grants platform—stated: “ROR is a critical component of a connected research data landscape. As a software company focused on the advancement of research, Altum recognizes our responsibility to financially support and sustain the key research infrastructure initiatives like ORCID and ROR that enable open science and open global identifiers for research outputs, research contributors, and research institutions.”</p>
<p>Melissa Harrison, eLife Head of Production Operations, adds: “The distribution of high-quality metadata using various persistent identifiers is a great tool for advancing connections and the interlinking of scholarly content with other aspects of the ecosystem. We are delighted to support this community-led initiative for an open persistent identifier for research organizations to complement those we at eLife already use for content, peer review, data, people and funding.&quot;</p>
<p>We look forward to crossing other milestones soon in the coming weeks as we approach our year-end goal of $75,000. But we can’t do this alone! How can you help? By spreading the word far and wide—about the campaign, about the project, and about why open community-driven infrastructure for organization IDs matters to you. You can reference the <a href="https://ror.org/blog/2019-10-16-help-sustain-ror/">blog post</a> introducing the campaign, and mention <a href="https://twitter.com/ResearchOrgs">@ResearchOrgs</a> on Twitter (<a href="https://twitter.com/ResearchOrgs/status/1184879636267913216">here is the campaign kick-off Tweet</a>). And how can you donate? Simply email <a href="mailto:donate@ror.org">donate@ror.org</a> to make a pledge and we’ll prepare an invoice.</p>
<p>Thank you for supporting ROR!</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">ROR launched a fundraising campaign in October to call on community stakeholders to pitch in toward supporting ROR’s long-term sustainability. While the overall goal of this campaign is to raise $175,000 from community supporters over the next two years, we set an initial target of $75,000 by the end of 2019.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Help Build Open Infrastructure for Organization Identifiers: A Call to Support and Sustain ROR</title><id>https://doi.org/10.71938/c110-cc85</id><link rel="alternate" href="https://ror.org/blog/2019-10-16-help-sustain-ror/"/><published>2019-10-16T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="General Updates"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>ROR is the <a href="https://ror.org">Research Organization Registry</a>, a community-led project to develop an open, sustainable, usable, and unique identifier for every research organization in the world. </p>
<p>ROR emerged to fill a crucial gap in the scholarly communication landscape: while we already had an open network of identifiers for research outputs (DOIs for publications and data) and research contributors (ORCID IDs), open infrastructure for research organizations was a missing piece. </p>
<p>ROR is a robust and stable registry of identifiers for close to 100K organizations (and counting!), and provides open tools for using and implementing ROR IDs, including a <a href="https://ror.org/search">front-end search interface</a>, an <a href="https://github.com/ror-community/ror-api">API</a>, a <a href="https://github.com/ror-community/ror-reconciler">reconciler</a> that works with OpenRefine, and a <a href="https://doi.org/10.6084/m9.figshare.9172553">public data dump</a>. Everything is openly available on <a href="https://github.com/ror-community">Github</a>.</p>
<p>The promise of ROR lies in connecting identifiers throughout the scholarly communication landscape. ROR IDs are being captured in systems and platforms where researcher affiliations are collected, and will be supported in Crossref and <a href="https://blog.datacite.org/identify-your-affiliation-with-metadata-schema-4-3/">DataCite metadata</a> to enable discovery and tracking of research outputs across institutions and funding bodies.</p>
<p>As we grow the registry, we will be building curation tools for maintaining ROR records over time, establishing a community curation board, and developing more support for system integrations and for downstream uses of registry data.</p>
<p>ROR is committed to being a community-driven effort. We are a <a href="https://ror.org/about">collaborative project led by academic and nonprofit organizations</a> with deep expertise in this space. We have <a href="https://ror.org/supporters">supporters and advisors</a> from across industries and around the world.</p>


<h2 id="supporting-and-sustaining-ror">Supporting and sustaining ROR 
</h2>
<p>All of ROR&rsquo;s work so far has been completed through in-kind donations from ROR steering organizations. In the coming years, we want to further develop ROR to enable greater adoption and downstream uses. </p>
<p>Our organizations are committed to ROR for the long-term but we can&rsquo;t move forward without dedicated community support. We are <strong>launching a fundraising campaign in order to be able to scale up our operations, hire dedicated staff, and develop and deliver new features</strong>, with a plan to launch a paid service tier in 2022 to recover costs while keeping the registry&rsquo;s data open and free for all.</p>
<p><strong>ROR aims to raise $175,000 in donations over the next two years</strong>. As a supporter, you&rsquo;ll have an opportunity to be part of this exciting community effort from the beginning and to ensure its long-term growth and success. Our first fundraising target is <strong>$75,000 by the end of 2019</strong> in order to secure enough funds to hire a technical lead and to organize an in-person ROR Community planning meeting at <a href="https://www.pidapalooza.org">PIDapalooza</a> in January 2020. </p>


<h2 id="supporter-opportunities">Supporter opportunities 
</h2>
<p>Donations of all shapes and sizes are welcome. Suggested amounts are as follows:</p>
<ul>
<li>Lioness: $10000 </li>
<li>Tiger: $7000</li>
<li>Jaguar: $3500 </li>
<li>Leopard: $1000</li>
</ul>
<p><em>Why these animals?</em> They are the only four big cats in the world that have the ability to ROAR!
<figure><img src="/img/RoaringCats.png"/>
</figure>
</p>
<p>Donors in 2019 and 2020 will be listed as founding supporters on the ROR website. </p>
<p>You can also support ROR by spreading the word and by actively implementing ROR in systems and workflows to help our communities fully realize the benefits that ROR IDs help us to unlock. If you have engineering experience and want to get involved in the registry&rsquo;s technical development, please let us know. </p>


<h2 id="ready-to-contribute">Ready to contribute?  
</h2>
<p>Please email <a href="mailto:donate@ror.org">donate@ror.org</a>  to indicate your interest in making a donation. As part of its in-kind contributions to the ROR project, Crossref will process donations on ROR&rsquo;s behalf. Crossref will report donation details and updates to the ROR Steering Group. The donations will be used exclusively for ROR activities and spending of the funds will be approved by the ROR Steering Group.  </p>


<h2 id="thank-you-for-supporting-ror-and-the-ror-community">Thank you for supporting ROR and the ROR Community 
</h2>
<ul>
<li><strong>John Chodacki</strong>, Director of University of California Curation Center (UC3), <a href="https://ror.org/03yrm5c26">California Digital Library</a></li>
<li><strong>Ed Pentz</strong>, Executive Director, <a href="https://ror.org/02twcfp32">Crossref</a></li>
<li><strong>Matt Buys</strong>, Executive Director, <a href="https://ror.org/04wxnsj81">DataCite</a></li>
</ul>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">ROR is the Research Organization Registry, a community-led project to develop an open, sustainable, usable, and unique identifier for every research organization in the world.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">ROR-ing Together: Implementing Organization IDs in Dryad</title><id>https://doi.org/10.71938/gftz-ac63</id><link rel="alternate" href="https://ror.org/blog/2019-07-10-ror-ing-together-with-dryad/"/><published>2019-07-10T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><author><name>Daniella Lowenberg</name><uri>https://orcid.org/0000-0003-2255-1869</uri></author><category term="Adoption News"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>How many datasets have been published in Dryad from researchers at the University of California? This question is surprisingly complicated. A short answer might be, <em>we don&rsquo;t know</em>! A better answer could be, <em>coming soon - stay tuned</em>!</p>
<p>And a more complete and detailed answer might go something like this:</p>
<p>It&rsquo;s not easy to determine how many datasets in Dryad are affiliated with the University of California - or any other institution, for that matter. This is the result of two main factors: (1) Dryad historically did not collect affiliation information from authors submitting datasets; and (2) even if Dryad had collected this information, it likely would have done so in a free-text field that allowed authors to write their affiliation in any number of ways (think &ldquo;UC Berkeley,&rdquo; &ldquo;University of California-Berkeley,&rdquo; or &ldquo;Berkeley,&rdquo; for example). Why? Because until recently, the scholarly research and publishing community did not have an easy and open option to rely on a standard set of affiliation names and related IDs to identify and disambiguate institutions.</p>
<p>This changed a few months ago with the launch of <a href="https://ror.org">ROR</a> - the Research Organization Registry. ROR is a community-led project to develop an open, sustainable, usable, and unique identifier for every research organization in the world. The ROR MVR (minimum viable registry) <a href="https://ror.org/blog/2019-02-10-announcing-first-ror-prototype/">launched in January 2019</a> and began assigning unique ROR IDs to more than 91,000 organizations.</p>
<p>At its core, ROR is focused on filling a very specific and crucially important gap in scholarly research and publishing infrastructure: information about the organizations affiliated with researchers and research outputs. The rise of DOIs to identify datasets and publications and ORCID IDs to identify researchers and contributors has facilitated more efficient discovery and tracking of research outputs. But without being able to identify where these outputs and authors are affiliated, this discovery and tracking can only go so far. At best, an immense amount of additional and manual work is involved in extracting this information to fill the gap. At worst? The gap never gets filled in. With ROR IDs, the idea is that both of these scenarios no longer happen. ROR is intended for use by the research community, for the purposes of increasing the use of organization identifiers in the community and enabling connections between organization records in various systems.</p>
<p>ROR and <a href="https://www.datadryad.org">Dryad</a> joined forces this spring to tackle two different yet related challenges. Following the launch of the MVR, ROR was interested in finding a partner to pilot a simple yet effective implementation of the ROR API. Dryad was interested in implementing a solution to the problem of missing affiliation data. As a <a href="https://blog.datadryad.org/2018/05/30/dryad-partnering-with-cdl-to-accelerate-data-publishing/">longstanding community partner</a> in data publishing and open infrastructure projects, the Dryad team was eager to be an early adopter of ROR and blaze the trail toward wider implementation and collection of ROR IDs across multiple systems and platforms.</p>
<p>Dryad&rsquo;s developers working on the new Dryad platform (launching later this summer) quickly got to work creating an affiliation field in the dataset submission form that calls the ROR API. When an author starts typing an affiliation, the field lookup searches for a matching name in ROR and shows the author a dropdown list of possible matches to choose from.</p>
<p><a href="https://gyazo.com/9a7093ec2d13a10b6d8b53853e02167e"><img src="https://i.gyazo.com/9a7093ec2d13a10b6d8b53853e02167e.gif" alt="Image from Gyazo"></a></p>
<p>This will work regardless of whether the author starts entering a known abbreviation or the full name of the organization, as shown below.</p>
<p><a href="https://gyazo.com/ccd2c12e07263f66929a7a555d562a9a"><img src="https://i.gyazo.com/ccd2c12e07263f66929a7a555d562a9a.gif" alt="Image from Gyazo"></a></p>
<p>The author chooses the match and proceeds with the submission. The ROR ID is stored in the database - the author doesn&rsquo;t even have to know it exists!</p>
<p>At this point you are probably curious about a few things: Can users override the matching and type whatever they want? What happens if a user&rsquo;s affiliation is not found in the lookup? And how easy is it to implement this super-cool functionality in my platform?</p>
<p>We&rsquo;ll address these questions in order:</p>
<p>Can users override the matching? Yes, the system will not prevent them from typing in an affiliation instead of choosing from the list. This is necessary to ensure a smooth submission process and also to allow for rare cases in which the user&rsquo;s affiliation is not easily found in the lookup. In both of these situations, this is where Dryad&rsquo;s curation workflow comes into play. A team of curators who go through each data submission will note if the affiliation is not a ROR ID, alter it if there is an existing one, or flag it for the ROR team to investigate and add to their corpus.</p>
<p>Now, how easy is it to implement this functionality in other systems? You can do it right now! Dryad&rsquo;s code base is open-source and the team is happy to walk you through the implementation of ROR look up and autofill. To discuss the implementation you can <a href="mailto:dlowenberg@datadryad.org">get in touch here</a>.</p>
<p>DataCite&rsquo;s DOI registration system, known as Fabrica, <a href="https://blog.datacite.org/doi-fabrica-form-gets-a-facelift/">already includes a similar lookup</a> so this is a useful implementation to reference as an example as well.</p>
<p>With the ROR affiliation lookup implemented in Dryad, the future looks bright when it comes to the challenge of identifying research outputs by institution, as every new dataset submitted to Dryad will be associated with a ROR ID. But what about the datasets that are already in Dryad? As you&rsquo;ll recall from the beginning of our story, affiliation details were not previously collected in Dryad at the time these datasets were submitted. This gap represents the work of approximately 90,000 researchers over the past ten years. The Dryad team wanted to ensure that these datasets had ROR IDs as well, so they teamed up with <a href="https://metadatagamechangers.com/">Ted Habermann</a> (Metadata Game Changers) to identify those missing affiliations. By pulling from open APIs (Crossref, PLOS, Unpaywall, etc) and manually looking up affiliations from related articles, Ted is transforming a corpus of raw affiliations into standardized ROR IDs. Though it is a cumbersome project, this will ultimately allow for Dryad to have an entire database of ROR IDs for all past <em>and</em> future authors publishing their data.</p>
<p>The Dryad-ROR collaboration shows the promise and power of implementing organization IDs in publishing platforms to enable better tracking and discovery of research outputs by institution. We&rsquo;re excited about this use of ROR and eager to see other platform providers pursue similar implementations in the coming months. Feel free to get in touch with your ideas and questions!</p>
<p><em>Maria Gould and Daniella Lowenberg are Research Data Specialists at the University of California Curation Center, California Digital Library. Maria is project lead for ROR and Daniella is product manager for Dryad.</em></p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">How many datasets have been published in Dryad from researchers at the University of California? This question is surprisingly complicated. A short answer might be, we don&amp;rsquo;t know!</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">ROR Development Update: Building and Growing an Open Registry of Organizations</title><id>https://doi.org/10.71938/60v1-bk26</id><link rel="alternate" href="https://ror.org/blog/2019-07-02-ror-development-update/"/><published>2019-07-02T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="Technical News"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>ROR is an open registry for every research organization in the world, aiming to solve the problem of identifying which organizations are affiliated with which research outputs. <a href="https://ror.org/blog/2019-02-10-announcing-first-ror-prototype/">When the ROR MVR (minimum viable registry) launched in January</a>, the registry included records for 91,000+ organizations, each with a unique ROR ID.</p>
<p>Since getting the MVR up and running, ROR development updates have been focused on enhancing the technical implementation of the registry. We have now released a shiny new codebase completely rewritten in Python. As before, the code is completely open source and available via <a href="http://www.github.com/ror-community">Github</a>.</p>
<p>A key aspect to this recent update is a new mechanism to synchronize ROR with the GRID database, which provided the seed data for the 91,000+ records that populated the original ROR MVR. This means that we can incorporate updated record metadata from GRID without overwriting existing ROR IDs. For new entries added to GRID, we can automatically create a corresponding record in ROR and assign a new ROR ID. ROR records will continue to include crosswalks to GRID IDs, and GRID will implement a similar mapping process in the coming months. GRID and ROR will ultimately diverge, but these crosswalks will ensure ongoing interoperability.</p>
<p>With this new sync process in place, the ROR registry has now grown to 96,000+ organizations! This reflects the latest version of the GRID database <a href="https://grid.ac/downloads">released in May</a>. We&rsquo;re storing the output in a Github repo, and we&rsquo;ll be making a public data dump available soon.</p>
<p>We have also implemented some other technical adjustments to make ROR data more usable and useful, such as refining search functionality, making error messages consistent, and fixing bugs related to paging and filtering. There&rsquo;s a lot of other work we&rsquo;re excited to tackle next, such as adding additional metadata to ROR records and matching affiliation strings to ROR IDs.</p>
<p>ROR is a community project, and we welcome all sorts of collaborators. We encourage anyone and everyone to use and contribute to the code. And let us know what you think!</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">ROR is an open registry for every research organization in the world, aiming to solve the problem of identifying which organizations are affiliated with which research outputs.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">A ROR-ing Recap from PIDapalooza</title><id>https://doi.org/10.71938/r8s7-tf04</id><link rel="alternate" href="https://ror.org/blog/2019-02-12-ror-recap-from-pidapalooza/"/><published>2019-02-12T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="Event Recaps"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>In the days following the <a href="/blog/2019-02-10-announcing-first-ror-prototype">ROR community meeting in Dublin</a>, we had a chance to spread the word about ROR in presentations at <a href="https://pidapalooza.org/">PIDapalooza</a>, the annual festival for persistent identifiers. Members of the ROR project team led an <a href="https://pidapalooza19.sched.com/event/JaVO/lets-ror-together">interactive session</a> that included an affiliation-matching game to demonstrate the messiness of identifying and aligning metadata about an institution, and discuss how ROR IDs can address these challenges. The session culminated with participants looking up the ROR IDs for their own institutions and drawing them on their very own lion masks, reflecting the twin themes of uniqueness and community that are so central to ROR&rsquo;s aims.
<figure><img src="/img/banners/ror-matching-game-collage.png"/>
</figure>
</p>
<p>In another session, California Digital Library&rsquo;s Maria Gould and Daniella Lowenberg spoke about <a href="https://zenodo.org/record/2548914#.XFiNCM9KjOQ">integrating ROR IDs into data publishing platforms</a> like Dryad to capture affiliation information for datasets and fill a crucial missing piece in the tracking and reporting of institutional research outputs.</p>
<p>We also heard ROR referenced in several other sessions during the course of the conference, from Carolyn Grant&rsquo;s &ldquo;<a href="https://zenodo.org/record/2548887#.XFiSNM9KjOQ">Implementation of Organizational IDs in NASA&rsquo;s ADS Abstract Service</a>&rdquo; to Arthur Smith&rsquo;s &ldquo;<a href="https://zenodo.org/record/2547997#.XFn1Ic9KjOQ">Who Moved My Institution: A PID Integration Story</a>&rdquo; to the British Library&rsquo;s &ldquo;<a href="https://zenodo.org/record/2549228#.XFiSJM9KjOQ">From Standard to Community Resource: A View on ISNIs and ORG IDs</a>.&rdquo;</p>
<p>Mentions of ROR at PIDapalooza extended to the virtual sphere as well, with a number of enthusiastic reactions on Twitter:</p>
<figure class="blog-figure"><img src="/img/leigh-dodds-tweet-2019.png"
         alt="Leigh Dodds tweet reading Great to see ResearchOrgs poster at PIDapalooza19. I helped do some of the early technical exploration so very pleased to see it moving forward. Collaboratively maintained data infrastructure #opendata - Leigh Dodds (@dodds) January 23, 2019"/>
</figure>

<figure class="blog-figure"><img src="/img/nicole-kearney-tweet-2019.png"
         alt="Nicole Kearney tweet reading As someone who registers DOIs via Crossref for Museums Victoria, I&#39;m so very excited about this! Three cheers for open persistent identifiers!"/>
</figure>

<p>A common theme in PIDapalooza sessions and in surrounding discussions was the power and value of interconnecting PIDs and the importance of ensuring that new identifiers like ROR map to other identifiers in use across systems and platforms. To which we say, we concur! The PIDapalooza community is working on many important and inspiring projects and we&rsquo;re excited to be in this space together.</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">In the days following the ROR community meeting in Dublin, we had a chance to spread the word about ROR in presentations at PIDapalooza, the annual festival for persistent identifiers.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Hear us ROR! Announcing our First Prototype and Next Steps</title><id>https://doi.org/10.71938/0eh7-xg96</id><link rel="alternate" href="https://ror.org/blog/2019-02-10-announcing-first-ror-prototype/"/><published>2019-02-10T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>Maria Gould</name><uri>https://orcid.org/0000-0002-2916-3423</uri></author><category term="Technical News"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>What has hundreds of heads, 91,000 affiliations, and roars like a lion? If you guessed the Research Organization Registry community, you&rsquo;d be absolutely right!</p>
<p>Last month was a big and busy one for the ROR project team: we released a working API and search interface for the registry, we held our first ROR community meeting, and we showcased the initial prototypes at PIDapalooza in Dublin.</p>
<p>We&rsquo;re energized by the positive reception and response we&rsquo;ve received and we wanted to take a moment to share information with the community. Here are the links to our latest work, a recap of everything that happened in Dublin, some of the next steps for the project, and how the community can continue to be involved.</p>


<h2 id="-ta-da-the-first-ror-prototype">🎉 Ta da! The first ROR prototype 
</h2>
<p>The Research Organization Registry (ROR) is finally here! We&rsquo;re thrilled to officially announce the launch of our ROR MVR (minimum viable registry). The MVR consists of the following components, which are ready for anyone to use right now.</p>
<ul>
<li>
<p><strong>ROR IDs:</strong> Starting with seed data from <a href="https://www.grid.ac/">GRID</a>, ROR has begun assigning unique identifiers to approximately 91,000 organizations in its registry. ROR IDs include a random, unique, and opaque 9-character string and are expressed as URLs that resolve to the organization&rsquo;s record. For instance, here is the ROR ID for California Digital Library: <a href="https://ror.org/03yrm5c26">https://ror.org/03yrm5c26</a></p>
</li>
<li>
<p><strong>Search:</strong> We also built a search interface to look up organizations in the registry: <a href="https://ror.org/search">https://ror.org/search</a>.</p>
</li>
</ul>
<p><img src="https://lh5.googleusercontent.com/UQfE-D0oO6TNUdWPapf3LT-hj6v5l9NdD4LzGDR_A_ZPSKjvTKOlS9LsiTSVEgh_ia--yAbVWBukOHVmucYEymzxPvpAhp15zv1R0bYcQy_OArLAeiasDaIlPXaunVhPbU_Ebrg8" alt=""></p>
<ul>
<li><strong>ROR records:</strong> ROR IDs are stored with additional metadata about the organization, such as alternate names/abbreviations, external URLs (e.g., an organization&rsquo;s official website), and other identifiers, such as Wikidata, ISNI, and the Open Funder Registry. This metadata will allow ROR to be interoperable with other identifiers and across different systems. The current schema is based on GRID&rsquo;s dataset and we plan to incorporate other metadata fields over time and according to community needs.</li>
</ul>
<p><img src="https://lh6.googleusercontent.com/0e54ZDo4MMbXFcwFCjFR27ZC7c1EmqAiybwEV12a4wLSvQNbIIyMeIdKyBJNk2SQLYPXNsLXMmDoUozf4fHSF7Qjlhvq1UtnP_poFPPkdavmd9YQaTN5JvJ9zL_9lVPdVyU83l1M" alt=""></p>
<ul>
<li>
<p><strong>API:</strong> The ROR API is now public. You can access the JSON files at <a href="https://api.ror.org/organizations">https://api.ror.org/organizations</a>.</p>
</li>
<li>
<p><strong>OpenRefine reconciler:</strong> We&rsquo;ve released an OpenRefine reconciler that can map your internal identifiers to ROR identifiers: <a href="https://github.com/ror-community/ror-reconciler">https://github.com/ror-community/ror-reconciler</a>.</p>
</li>
<li>
<p><strong>Documentation:</strong> We have begun storing documentation on Github and will be adding more as we go along. Please feel free to follow and contribute:  <a href="https://github.com/ror-community/ror-reconciler">https://github.com/ror-community</a>.</p>
</li>
</ul>


<h2 id="community-meeting-recap">Community meeting recap 
</h2>
<p>On January 22, 60+ representatives from across the research and publishing community gathered in Dublin to see what the ROR project team has been up to, demo the first prototypes in action, and discuss where we want to go next - and, of course, to practice ROR-ing together.</p>
<p><img src="https://lh4.googleusercontent.com/vjpqcKxVChphTaTD6bryQ9HJ68hZp5tmLJhctsW9qOznAZfAODAlI85CmEGATyU7A9srq-QPPaYxOxBzgHAtiuSYvTISFiHcxuZqK5URrhCyH7nEQnCFqhXYyll5e6joYJJTJw0A" alt=""></p>
<p>In the second half of the meeting, attendees split into discussion groups to identify specific aspirations for ROR and brainstorm concrete actions needed to achieve these goals, focusing on the main use case of exposing and capturing all research outputs of a given institution. The proposed ideas covered a spectrum of possibilities for ROR, highlighting the following themes:</p>


<h3 id="ror-as-seamlessly-integrated-and-sometimes-invisible-infrastructure">ROR as seamlessly-integrated and sometimes invisible infrastructure 
</h3>
<ul>
<li>
<p>Integration between and within existing systems (and in new ones!)</p>
</li>
<li>
<p>Auto-detection of ROR IDs for example in manuscript tracking and funding application platforms</p>
</li>
<li>
<p>As such, researchers don&rsquo;t ever have to be responsible for knowing what a ROR is and using it appropriately - the systems they use will do this for them.</p>
</li>
</ul>


<h3 id="ror-as-a-critical-piece-of-funder-workflows-and-infrastructure">ROR as a critical piece of funder workflows and infrastructure 
</h3>
<ul>
<li>
<p>Demonstrate to funders how ROR can help them analyze impact of research they fund</p>
</li>
<li>
<p>Conduct outreach with key international funders, especially those interested in open infrastructure</p>
</li>
<li>
<p>Make funders aware of ROR and encourage them to adopt and mandate use of ROR IDs - involve funders at the beginning to collaborate on technology</p>
</li>
<li>
<p>Integrate ROR with existing systems and identifiers already in use by funders and other stakeholders</p>
</li>
</ul>


<h3 id="ror-as-a-trusted-registry-collaborative-partner-and-responsible-steward">ROR as a trusted registry, collaborative partner, and responsible steward 
</h3>
<ul>
<li>
<p>Culturally sensitive, inclusive, and respectful of what countries are already doing with regard to organizational identifiers, partnering with national bodies working on this and mapping ROR IDs to locally used identifiers.</p>
</li>
<li>
<p>Involve the institutions listed in the registry early on as well as CRIS systems</p>
</li>
<li>
<p>Interoperability with existing communities and governance bodies</p>
</li>
<li>
<p>Workflows to support trust and responsible management of organizational metadata, with policies and procedures for long-term curation and maintenance of records</p>
</li>
</ul>


<h2 id="what-were-hearing">What we&rsquo;re hearing 
</h2>
<p>Now that the ROR MVR is here, we&rsquo;re hearing some really good questions about the data we&rsquo;re capturing, how it can be used, and how we&rsquo;ll be maintaining the registry going forward. We wanted to take a moment to respond to some of these questions.</p>


<h3 id="what-is-the-criteria-for-being-listed-in-ror-what-is-a-research-organization">What is the criteria for being listed in ROR? What is a &ldquo;research organization&rdquo;? 
</h3>
<p>We define the notion of &ldquo;research organization&rdquo; quite broadly as any organization that conducts, produces, manages, or touches research. This is in line with ROR&rsquo;s stated scope, which is to address the affiliation use case and be able to identify which organizations are associated with which research outputs. We use &ldquo;affiliation&rdquo; to describe any formal relationship between a researcher and an organization associated with researchers, including but not limited to their employer, educator, funder, or scholarly society.</p>


<h3 id="will-ror-map-organizational-hierarchies">Will ROR map organizational hierarchies? 
</h3>
<p>No - ROR is focused on being a top-level registry of organizations so we can address the fundamental affiliation use case, and provide a critical source of metadata that can interoperate with other institutional identifiers.</p>


<h3 id="ror-ids-are-cool---what-can-i-do-with-them">ROR IDs are cool - what can I do with them? 
</h3>
<p>Now that we have built our MVR, we will be working to incorporate ROR IDs into relevant pieces of the scholarly communication infrastructure. If you are a publisher, funder, metadata provider, research office, or anyone else interested in capturing affiliations, please get in touch with us to discuss how we might coordinate. If you are a developer, you are welcome to start playing around with the API: <a href="https://api.ror.org/organizations">https://api.ror.org/organizations</a>.</p>


<h3 id="theres-an-error-in-my-organizations-ror-record-----can-you-fix-it">There&rsquo;s an error in my organization&rsquo;s ROR record &mdash; can you fix it? 
</h3>
<p>For the time being, please email <a href="mailto:info@ror.org">info@ror.org</a> to request an update to an existing record in ROR or request that a new record be added. We will formalize our data management policies and procedures in the next stage of the project.</p>


<h3 id="what-is-rors-relationship-to-other-organizational-identifiers">What is ROR&rsquo;s relationship to other organizational identifiers? 
</h3>
<p>For ROR to be useful, it needs to augment the current offerings in a way that is open, trusted, complementary, and collaborative, and not intentionally competitive. We are committed to providing a service that the community finds helpful and not duplicative, and enables as many connections as possible between organization records across systems.</p>


<h3 id="i-have-my-own-dataset-of-institutional-affiliations-----can-i-give-it-to-ror">I have my own dataset of institutional affiliations &mdash; can I give it to ROR? 
</h3>
<p>We are always happy to hear about other efforts to capture affiliation data. Please get in touch with us to discuss how we might coordinate.</p>


<h3 id="can-ror-support-multiple-languages-and-character-sets">Can ROR support multiple languages and character sets? 
</h3>
<p>GRID already supports multiple languages and character sets, so by extension ROR will have this enabled as well. Here is one example: <a href="https://ror.org/01k4yrm29">https://ror.org/01k4yrm29</a>.</p>


<h3 id="how-will-ror-handle-curation-ie-updating-records-if-an-organization-changes-its-name-or-ceases-to-exist">How will ROR handle curation, i.e., updating records if an organization changes its name or ceases to exist? 
</h3>
<p>The curation and long-term management of records will be a cornerstone of our efforts in 2019 and we hope to release a working set of policies and procedures soon.</p>


<h2 id="whats-next-for-ror">What&rsquo;s next for ROR 
</h2>
<p>Now that we have our MVR, what happens next for ROR? We&rsquo;re eager to sustain the momentum from January&rsquo;s stakeholder meeting at the same time we know there are some longer-term plans to put in place, and so we&rsquo;re looking at both some immediate tasks as well as bigger-picture questions.</p>


<h3 id="product-development">Product development 
</h3>
<p>We have a few to-do items on our list following the launch of the MVR to keep everything running smoothly while we develop a comprehensive long-term product roadmap.</p>
<ul>
<li>
<p>Rewrite some of the code for both the API and the OpenRefine reconciler</p>
</li>
<li>
<p>Address a few bugs in our repos</p>
</li>
<li>
<p>Provide guidance for troubleshooting issues</p>
</li>
<li>
<p>Communicate our processes for users to request changes, report bugs, and suggest features</p>
</li>
</ul>
<p>As a reminder, you can access the existing code in Github: <a href="https://github.com/ror-community">https://github.com/ror-community</a></p>


<h3 id="policy-development">Policy development 
</h3>
<p>We&rsquo;ve been emphasizing here and in community conversations that our primary focus now turns to formulating policies and procedures to ensure the successful management of ROR data over the long term. This is something we can&rsquo;t (and shouldn&rsquo;t) do on our own &mdash; we want to work with community stakeholders to develop the right solutions and establish the right frameworks. We understand the urgency of firming up these policies, but we are also aware that something this important can take time to complete and is not something to rush into lightly.</p>


<h3 id="community-development">Community development 
</h3>
<p>To help guide the next stages of the project, we are putting out an open call for participation in the ROR community advisory group. Advisory group members will be involved in giving input on data management, testing out new features, giving feedback on the product roadmap, and discussing ideas for events and outreach. We plan to convene this advisory group through bimonthly calls and asynchronous communication channels through the end of the year. We hope you will consider joining us! Please email <a href="mailto:info@ror.org">info@ror.org</a> if you are interested.</p>
<p>For those who want to stay informed about the project but not necessarily be part of the advisory group, you have other options!</p>
<ul>
<li>
<p>Sign up for our mailing list (via the footer at <a href="https://ror.org">ror.org</a>)</p>
</li>
<li>
<p>Join our community on Slack (<a href="http://www.tinyurl.com/ror-community">www.tinyurl.com/ror-community</a>),</p>
</li>
<li>
<p>Follow us on Twitter (<a href="https://twitter.com/ResearchOrgs">@ResearchOrgs</a>).</p>
</li>
</ul>
<p>You can also always drop us a line at <a href="mailto:info@ror.org">info@ror.org</a>, and let us know if you&rsquo;d ever like to set up a meeting or conference call to talk about the project in more detail.</p>


<h2 id="final-thoughts">Final thoughts 
</h2>
<p>Community engagement has been vital to ROR&rsquo;s beginnings and will likewise be critically important for the next steps that we take. As both a registry of identifiers and a community of stakeholders involved in building open scholarly infrastructure, ROR depends on guidance and involvement at multiple levels. Thank you for being part of the journey thus far, and for joining us on the road that lies ahead. 🦁</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">What has hundreds of heads, 91,000 affiliations, and roars like a lion? If you guessed the Research Organization Registry community, you&amp;rsquo;d be absolutely right!</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">The ROR of the Crowd: Get Involved!</title><id>https://doi.org/10.71938/sna1-zc49</id><link rel="alternate" href="https://ror.org/blog/2018-12-02-the-ror-of-the-crowd/"/><published>2018-12-02T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>California Digital Library</name><uri>https://ror.org/03yrm5c26</uri></author><author><name>DataCite</name><uri>https://ror.org/04wxnsj81</uri></author><author><name>Crossref</name><uri>https://ror.org/02twcfp32</uri></author><author><name>Digital Science</name><uri>https://ror.org/02ktfc112</uri></author><category term="General Updates"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>Earlier this year, the Org ID Working Group <a href="/blog/2018-08-02-working-group-recap">wrapped up</a> their work. There was a lot of talk about governance, with options discussed for creating an entirely new independent organization; and/or having a looser group of stakeholders. After several months of discussion following the January stakeholder meeting in Girona, Spain, there was still no easy answer to the governance question. It&rsquo;s especially tough when getting down to details like timelines, hosting, staffing, and of course, funding. Our organizations each had to question whether we are truly capable of taking a leading role and really commit, having the will and agility to implement the outcomes of the Working Group.</p>
<p>Fortunately, the Product recommendations were resoundingly conclusive: there is a need for a single top-level registry of truly open identifiers for organizations that conduct research. No department level, no ambiguity. This will allow the more granular options out there to interoperate, and fulfill the core use case long missing from research communications:</p>
<blockquote>
<p>What organizations are affiliated with what research outputs?</p>
</blockquote>
<p>Therefore, more recently, four of the original stakeholders decided to take a practical approach, to grasp the mantle and get going with a pilot. We set up this website, calling &lsquo;Org ID&rsquo; the Research Organization Registry (ROR), and are now discussing technical plans to start building something useful to pilot. We haven’t given up on the governance question. We&mdash;or rather, you, the community&mdash;can help evolve the governance models as needed. But, for now, we will start to execute on our shared vision!</p>


<h3 id="an-agile-startup-phase">An agile startup phase 
</h3>
<p>California Digital Library (CDL), Crossref, DataCite, and Digital Science have agreed on the following approach:</p>
<ul>
<li>The four lead organizations will put in resources for a &ldquo;start-up phase&rdquo;</li>
<li>We will leverage the PID-focused Product Manager at CDL <a href="https://www.cdlib.org/cdlinfo/2018/11/14/maria-gould-joins-cdl-as-uc3-product-manager">Maria Gould</a> to act as a central ROR Project Coordinator and Product Manager</li>
<li>Each head of the four organizations will act as a small and nimble steering group with an aim to expand to better reflect the community over time</li>
<li>We will use the GRID data&mdash;which is licensed CC0&mdash;as a basis for a ROR &lsquo;minimal viable registry&rsquo;.</li>
<li>We will see ourselves as custodians for now; ensuring we are committed to continued community involvement (see below)!</li>
</ul>


<h3 id="are-you-ready-to-ror">Are you ready to ROR? 
</h3>
<p>Did you know that lions roar to communicate between members of the same pride or coalition? What is not often appreciated, is how unbelievably accurately lions can pinpoint exactly where another roar is coming from, and identify exactly what type of lion is calling on the other end.</p>
<p>There were 17 organizations previously in the working group and now we are calling for any and all stakeholders to sign up via this website, to get involved with ROR in some capacity. We are planning to set up a community advisory group to input on a ROR minimal viable registry, and feed into developing a more appropriate governance structure. While our four organizations have stepped up to &lsquo;steer&rsquo;, we know that there are numerous universities, funding bodies, and others, who are keen to put some time in to see this exciting vision become a reality.</p>


<h3 id="see-you-in-dublin">See you in Dublin? 
</h3>
<p>We’d like to <a href="https://ror-dublin.eventbrite.com">invite you to our first ROR stakeholder meeting</a>, which will take place in Dublin on the 22nd of January&mdash;prior to PIDapalooza&mdash;from 3-5pm.</p>
<p>Let’s ROR together and make an open organization identifier a reality! 🦁</p>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">Earlier this year, the Org ID Working Group wrapped up their work. There was a lot of talk about governance, with options discussed for creating an entirely new independent organization; and/or having a looser group of stakeholders.</summary></entry><entry xml:lang="en-us"><title xml:lang="en-us">Org ID: A Recap and a Hint of Things to Come</title><id>https://doi.org/10.71938/ftt9-w955</id><link rel="alternate" href="https://ror.org/blog/2018-08-02-working-group-recap/"/><link rel="related" href="https://doi.org/10.5438/67sj-4y05"/><published>2018-08-02T00:00:00+00:00</published><updated>2025-10-19T18:10:13-04:00</updated><author><name>John Chodacki</name><uri>https://orcid.org/0000-0002-7378-2408</uri></author><category term="General Updates"/><content type="html" xml:base="https://ror.org/" xml:lang="en-us">
<![CDATA[<p>Over the past couple of years, a group of organizations with a shared purpose&mdash;California Digital Library, Crossref, DataCite, and ORCID&mdash;invested our time and energy into launching the Org ID initiative, with the goal of defining requirements for an open, community-led organization identifier registry.  The goal of our initiative has been to offer a transparent, accessible process that builds a better system for all of our communities. As the working group chair, I wanted to provide an update on this initiative and let you know where our efforts are headed.</p>


<h3 id="community-led-effort">Community-led effort 
</h3>
<p>First, I would like to summarize all of the work that has gone into this project, a truly community-driven initiative, over the last two years:</p>
<ul>
<li>A series of collaborative workshops were held at the Coalition for Networked Information
(CNI) meeting in San Antonio TX (2016), the FORCE11 conference in Portland OR (2016), and at PIDapalooza in Reykjavik (2016).</li>
<li>Findings from these workshops were summarized in three documents, which we made openly
available to the community for public comment:</li>
<li>Organization Identifier Project: A Way Forward (<a href="https://doi.org/10.5438/2906">PDF</a>)</li>
<li>Organization Identifier Provider Landscape (<a href="https://doi.org/10.5438/4716">PDF</a>)</li>
<li>Technical Considerations for an Organization Identifier Registry ([PDF]
(<a href="https://doi.org/10.5438/7885">https://doi.org/10.5438/7885</a>))</li>
<li>A <a href="https://orcid.org/content/organization-identifier-working-group">Working Group</a> worked
throughout 2017 and voted to approve a set of recommendations and principles for &lsquo;governance&rsquo; and &lsquo;product&rsquo;:</li>
<li><a href="https://figshare.com/articles/ORG_ID_WG_Governance_Principles_and_Recommendations/5402002/1">Governance Recommendations</a></li>
<li><a href="https://figshare.com/articles/ORG_ID_WG_Product_Principles_and_Recommendations/5402047/1">Product Principles and Recommendations</a></li>
<li>We then put out a <a href="https://doi.org/10.23640/07243.5458162.v1">Request for Information</a>
that sought expressions of interest from organizations to be involved in implementing and running an organization identifier registry.</li>
<li>There was a really good response to the RFI; reviewing the responses and thinking about
next steps led to our most recent <a href="https://orcid.org/content/2018-org-id-meeting">stakeholder meeting in Girona</a> in January 2018, where ORCID, DataCite, and Crossref were tasked with drafting a proposal that meets the Working Group&rsquo;s requirements for a community-led, organizational identifier registry.</li>
</ul>


<h3 id="thank-you">Thank you 
</h3>
<p>I want to take this opportunity to thank everyone who has contributed to this effort so far.  We&rsquo;ve been able to make good progress with the initiative because of the time and expertise many of you have volunteered. We have truly benefited from the support of the community, with representatives from Alfred P. Sloan Foundation; American Physical Society, California Digital Library, Cornell University, Crossref, DataCite, Digital Science, Editeur, Elsevier, Foundation for Earth Sciences, Hindawi, Jisc, ORCID, Ringgold, Springer Nature, The IP Registry, and U.S. Geological Survey involved throughout this initiative.  And we couldn&rsquo;t have done any of it without the help and guidance of our consultants, Helen Szigeti and Kristen Ratan.</p>


<h3 id="the-way-forward">The way forward 
</h3>
<p>The recommendations from our initiative have been converted into a concrete plan for building a registry for research organizations.  This plan will be posted in the coming weeks.</p>
<p>The initiative&rsquo;s leadership group has already secured start-up resourcing and is getting ready to announce the launch plan&mdash;more details coming soon.  </p>
<p>We hope that all stakeholders will continue to support the next phase of our work &ndash; look for announcements in the coming weeks about how to get involved.  </p>
<p>As always, we welcome your feedback and involvement as this effort continues. Please contact me directly with any questions or comments at <a href="mailto:john.chodacki@ucop.edu">john.chodacki@ucop.edu</a>. And thanks again for your help bringing an open organization identifier registry to fruition!</p>


<h3 id="references">References 
</h3>
<ol>
<li>Bilder, G., Brown, J., &amp; Demeranville, T. (2016). Organisation identifiers: current provider survey. ORCID. <a href="https://doi.org/10.5438/4716">https://doi.org/10.5438/4716</a></li>
<li>Cruse, P., Haak, L., &amp; Pentz, E. (2016). Organization Identifier Project: A Way Forward. ORCID. <a href="https://doi.org/10.5438/2906">https://doi.org/10.5438/2906</a></li>
<li>Fenner, M., Paglione, L., Demeranville, T., &amp; Bilder, G. (2016). Technical Considerations for an Organization Identifier Registry. <a href="https://doi.org/10.5438/7885">https://doi.org/10.5438/7885</a></li>
<li>Laurel, H., Bilder, G., Brown, C., Cruse, P., Devenport, T., Fenner, M., … Smith, A. (2017). ORG ID WG Product Principles and Recommendations. <a href="https://doi.org/10.23640/07243.5402047">https://doi.org/10.23640/07243.5402047</a></li>
<li>Laurel, H., Pentz, E., Cruse, P., &amp; Chodacki, J. (2017). Organization Identifier Project: Request for Information. <a href="https://doi.org/10.23640/07243.5458162">https://doi.org/10.23640/07243.5458162</a></li>
<li>Pentz, E., Cruse, P., Laurel, H., &amp; Warner, S. (2017). ORG ID WG Governance Principles and Recommendations. <a href="https://doi.org/10.23640/07243.5402002">https://doi.org/10.23640/07243.5402002</a></li>
</ol>
]]></content><summary type="html" xml:base="https://ror.org/" xml:lang="en-us">An update on the progress of the Org ID initiative led by California Digital Library, Crossref, DataCite, and ORCID.</summary></entry></feed>