Epitope Discovery Program

New contracts awarded for Epitope Discovery

In September 2024, NIAID awarded six contracts under the Innovations in Functional B Cell Immune Epitope Discovery Program and five contracts under the T Cell Immune Epitope Discovery and Mechanisms of T Cell Protection Program.

The Epitope Discovery Program is comprised of two complementary programs: 1) Innovations in Functional B Cell Epitope Discovery, and 2) T Cell Immune Epitope Discovery and Mechanisms of T Cell Protection. Together, these programs support the following areas of research:

Discovery of novel epitopes associated with human responses to pathogens that cause chronic or acute infection, including pathogens with pandemic potential
Discovery of novel epitopes associated with human allergic responses, development of autoimmune disease, and transplant rejection
Structural characterization of epitopes: monoclonal antibody binding and MHC-peptide: TCR binding
Validation of the functional role of epitopes in human disease (i.e., participation in immune protection or pathogenesis)
Deposition of epitope data into the Immune Epitope Database and Analysis Resource (IEDB)

Additionally, NIAID funds two supporting resources that are available to the biomedical community: the Immune Epitope Database and Analysis Resource (IEDB) and the Tetramer Core Facility.

Contact Information

Email Epitope Discovery for questions or help.

NIH Tetramer Core Facility (TCF)

Reagents to detect and quantify antigen-specific T cells.

Resources

Using the NIAID Data Ecosystem Discovery Portal to Search Across Data Repositories

Data Science Dispatch | April 14, 2025

a doctor pointing to science based icons on a screen

NIAID has developed a platform to help researchers find data related to infectious and immune-mediated disease (IID) across multiple data repositories. The NIAID Data Ecosystem Discovery Portal is a centralized hub cataloging millions of datasets from over 50 sources.

Researchers can use the Discovery Portal to find data, resources, and computational tools from different repositories. This can save them time otherwise spent combing through multiple sources and help them find datasets they weren’t aware of previously.

The Discovery Portal includes resources from IID and generalist repositories. Representative resources include NIAID-sponsored repositories such as AccessClinicalData@NIAID, ImmPort, and VDJServer, as well as repositories funded outside of NIAID but relevant to IID research. Resources in the Discovery Portal include a diverse array of data types spanning multiple domains of IID research, including -omics data, clinical data, epidemiological data, pathogen-host interaction data, flow cytometry, imaging, and other experimental data.

The Discovery Portal supports NIAID objectives of maximizing the impact of scientific data, reducing duplication of efforts in research, and promoting data reuse, data transparency and compliance with data-sharing policies. The portal aligns with many of the principles of findable, accessible, interoperable, and reusable (FAIR) data practices by making data easier to find and access.

Using metadata to drive discovery

The NIAID Data Ecosystem Discovery Portal does not contain data itself. Instead, it contains detailed information about IID datasets and resources drawn from metadata. Users can then access the resources through external links.

The portal uses metadata to support several key features:

Search and Discovery: Users can rapidly search millions of datasets across both IID and generalist repositories using the Search or Advanced Search options. Metadata categories such as funding source, repository, and conditions of access help filter search results and identify relevant research data.
Metadata Compatibility: Each individual dataset in the Discovery Portal has a “metadata compatibility score,” which displays specific metadata elements collected for a given resource. Additionally, the Discovery Portal has metadata compatibility visualizations which capture the breadth of metadata at the repository level. This information can help researchers and data contributors quickly understand a repository’s metadata structure, aiding in decisions about where to deposit or retrieve resources.
Downloadable Metadata: The portal has buttons that allow users to download metadata to perform meta-analyses.

The Discovery Portal is working to fill missing or incomplete metadata fields (such as Pathogen Species, Health Condition, and Host Species) by augmenting and standardizing metadata fields to provide more of this necessary information for users.

New Program Collection tool and other features

One of the new features of the NIAID Data Ecosystem Discovery Portal is the “Program Collection” filter. These are groups of datasets contributed by specialized NIAID research programs and initiatives. The Discovery Portal displays the Program Collection filter on the search page, and current efforts are focused on expanding Program Collection data.

The Program Collection filter allows researchers to discover high-quality, program-specific data relevant to their area of interest and find collections that align with the broader objectives of NIAID’s strategic research efforts. The feature also amplifies the scientific contributions of participating networks and increases the likelihood of researchers using these datasets.

Using the Sources page of the Discovery Portal can also help researchers and data providers make informed decisions about different repositories where they can deposit their data.

The Discovery Portal is now connected to National Center for Biotechnology Information (NCBI) databases through NCBI LinkOut. When NCBI database content is linked to data described in the Portal, a link to the related Portal entry can be found on the NCBI page.

Learn more by visiting the Discovery Portal, reviewing the Getting Started page, and exploring the Knowledge Center.

Understanding Metadata: A Key to Data Sharing and Reuse

Data Science Dispatch | March 14, 2025

Metadata plays a crucial role in sharing and reusing scientific data. Understanding what metadata is and how it is used can accelerate your research and increase the visibility of your work. It can also help to advance the field of infectious and immune-mediated disease (IID) research.

What is metadata?

Metadata is data about data. It provides additional information to help people understand the data, such as its origin, structure, and context.

For example, for a genome sequence, the data is the actual sequence of nucleotides. The metadata is the author of the data, the date the data was collected, the measurement techniques used, the health condition at the focus of dataset (like asthma or autoimmune diseases), and more. You can see another example of data versus metadata in the video on the right (data management and sharing webinar from the National Institute of Diabetes and Digestive and Kidney Diseases, 4:22-6:28).

Examples of common metadata elements that describe IID research data are available at the NIAID Data Ecosystem’s list of common fundamental and recommended metadata elements.

Why is metadata important? When you share scientific data, metadata provides the context that allows others to understand, trust, reproduce, or reuse data. This is particularly important in studies or secondary analyses where data is integrated from multiple sources; comprehensive metadata enables a scientist to combine data from different sources.

Using metadata effectively can also help your data get discovered, reused, and cited—thereby maximizing the value and impact of your research.

Collecting rich metadata during research

Effective metadata use starts with collecting rich metadata throughout the research process. “Rich” metadata is detailed and structured, making it easier for people to quickly learn about your data.

Including standardized formats and schemas makes it clear which metadata components are present and where they can be found. Using common terminologies, ontologies, and data formats takes this a step further by defining specific metadata elements for both people and computers. Machine-readable metadata allows users to learn about and use data using code, helping them quickly learn about many data files.

Some common examples of collecting metadata in a structured way include defining standardized date and time formats and using ORCID IDs for authors to ensure precise identification.

Biomedical researchers can follow some basic steps to ensure that they are collecting comprehensive and standardized metadata.

1. Determine necessary metadata content and formats

Collecting data in the format you intend to share it in is more efficient than reformatting everything at the end. Here are some questions to help you determine data and metadata formats:

Who will use these data and how will they use it? What information do they need to understand the data?
Many research areas have standardized metadata formats that researchers can follow. What metadata standards or schemas do other researchers in your field use? Would using these standards and schemas help researchers understand and reuse these data?
Does the target repository or scientific journal have any specific metadata or formatting requirements? If the repository where you plan to share your data has specific guidance, follow that guidance from the start of your research.

2. Create metadata throughout the data lifecycle

Before data collection, collect protocol documentation and set up systems for data and metadata collection. These systems can collect information using the standards, formats, vocabularies, and ontologies selected, and will save you time when preparing data and metadata for publication.

During the data collection phase, document anything that fits into the target metadata fields. These may include the dates data was collected, variables measured, the units of measurement, the instruments used, and the conditions under which the data was collected.

After data collection, add any remaining metadata elements from your plan. These elements may focus more on describing data processing steps, versioning, authors, or related topics.

3. Prepare to share data and metadata

Verify that metadata meets requirements for where you would like to share your data, and add any elements that you may finalize late in the data lifecycle, like associated publications, license for reuse, or a data author list prior to sharing.

Throughout the process, you can seek guidance from your program officer or the repositories where you intend to share data to ensure that metadata is collected and shared effectively.

Sharing data and metadata

The NIH Data Management and Sharing Policy encourages sharing metadata that describes or supports your scientific data. NIH recommends data management and sharing practices consistent with the FAIR (Findable, Accessible, Interoperable, and Reusable) data principles, and it strongly encourages the use of established data repositories for preserving and sharing data.

In some instances, the full scientific data cannot be shared easily. This may be due to large file sizes — particularly with imaging-related research — or data privacy regulations. However, even if the actual scientific data cannot be shared, sharing metadata is still valuable. This practice ensures that there is a public record of the data's existence and provides important background information that can be used by other researchers.

Metadata is also a powerful tool for finding scientific data in repositories. Researchers can use metadata to search for data sets that match specific criteria. One tool that can help researchers find relevant data is the NIAID Data Ecosystem Discovery Portal, which uses metadata present in data stored in repositories to search across over 50 different IID repositories and data sources.

Learn more about developing a data management and sharing plan and compliance with relevant NIH data sharing policies by reviewing the Data Policy and Guidance page.

Study of the ITK Inhibitor Soquelitinib to Reduce Lymphoproliferation and Improve Cytopenias in Autoimmune Lymphoproliferative Syndrome (ALPS)-FAS Patients

Autoimmune lymphoproliferative syndrome (ALPS) is a rare disorder of the immune system caused by a mutation in the FAS gene. The objective of this study is to determine the efficacy of soquelitinib in reducing spleen volume or target lymph node volume in people with ALPS-FAS.

Contact Information

Office/Contact: Alanvin Orpia, B.S.N.
Phone: 240-669-2935
Email: alanvin.orpia@nih.gov

Gene Editing Approach Paves the Way to First-in-Human Clinical Trial for Rare Genetic Disease

Link Type

Funded-research

Media Type

Article

Publish or Event Date

Thu, 10/10/2024 - 12:00pm

Link URL

https://www.massgeneral.org/news/research-spotlight/gene-editing-approach-for-rare-genetic-disease

Research Institution

Massachusetts General Hospital

Short Title

Gene Editing Approach Paves the Way to First-in-Human Clinical Trial for Rare Genetic Disease

Read more about Gene Editing Approach Paves the Way to First-in-Human Clinical Trial for Rare Genetic Disease

Content Coordinator

Claudia Wair

Content Manager

Jennifer Routh

Unique Immune Response in Lupus Paves Way for New Treatment

Link Type

Funded-research

Media Type

Article

Publish or Event Date

Tue, 10/15/2024 - 12:00pm

Link URL

https://news.weill.cornell.edu/news/2024/10/unique-immune-response-in-lupus-paves-way-for-new-treatment

Research Institution

Weill Cornell Medicine

Short Title

Unique Immune Response in Lupus Paves Way for New Treatment

Read more about Unique Immune Response in Lupus Paves Way for New Treatment

Content Coordinator

Claudia Wair

Content Manager

Jennifer Routh

The Type 1 Diabetes T Cell Receptor and B Cell Receptor Repository in the AIRR Data Commons: a practical guide for access, use and contributions through the Type 1 Diabetes AIRR Consortium

The authors introduce the Type 1 Diabetes Adaptive Immune Receptor Repertoire (AIRR) Consortium goals and outline methods to use and deposit data to this comprehensive repository. The repository’s goal is to facilitate research community access to rich, carefully annotated immune AIRR datasets to enable new scientific inquiry and insight into the natural history and pathogenesis of type 1 diabetes.

Link Type

Data-Sci-Publications

Publish or Event Date

Tue, 10/29/2024 - 12:00pm

Link URL

https://link.springer.com/article/10.1007/s00125-024-06298-y

Short Title

The Type 1 Diabetes T Cell Receptor and B Cell Receptor

Read more about The Type 1 Diabetes T Cell Receptor and B Cell Receptor Repository in the AIRR Data Commons: a practical guide for access, use and contributions through the Type 1 Diabetes AIRR Consortium

Content Coordinator

Lisa Mayer

Content Manager

Reed Shabman

Publication Source

Diabetologia

A Phase I Study of Mozobil in the Treatment of Patients With WHIMS

The purpose of this study is to evaluate whether Mozobil is safe and effective to treat neutropenia (low white blood cell count) in patients with WHIMS and to determine an appropriate treatment dose of Mozobil, within currently approved dosage levels.

Contact Information

Office/Contact: For more information at the NIH Clinical Center contact Office of Patient Recruitment (OPR)
Phone: 800-411-1222
TTY: TTY dial 711
Email: ccopr@nih.gov

Targeting Residual Activity By Precision, Biomarker-Guided Combination Therapies of Multiple Sclerosis (TRAP-MS)

The purpose of this study is to see if signs of inflammation in CSF help predict a persons response to different drugs.

Contact Information

Office/Contact: For more information at the NIH Clinical Center contact Office of Patient Recruitment (OPR)
Phone: 800-411-1222
TTY: TTY8664111010
Email: prpl@cc.nih.gov

An Open-Label, Proof of Consent Study of Vorinostat for the Treatment of Mdoerate-to-Severe Crohn's Disease and Maintenance Therapy With Ustekinumab

The purpose of this study is to see if vorinostat is safe for people with moderate-to-severe CD and to see if it is safe for people with moderate-to-sever CD to receive maintenance therapy using Ustekinumab after successful treatment of Vorinostat.

Contact Information

Office/Contact: For more information at the NIH Clinical Center contact Office of Patient Recruitment (OPR)
Phone: 800-411-1222
TTY: TTY8664111010
Email: prpl@cc.nih.gov

Subscribe to Immunologic Diseases

For Researchers

Research Areas

Featured Disciplines & Approaches

Featured Diseases & Conditions

Latest News Releases

Who We Are

What We Do

Epitope Discovery Program

New contracts awarded for Epitope Discovery

Contact Information

Resources

Using the NIAID Data Ecosystem Discovery Portal to Search Across Data Repositories

Using metadata to drive discovery

New Program Collection tool and other features

Understanding Metadata: A Key to Data Sharing and Reuse

What is metadata?

Collecting rich metadata during research

1. Determine necessary metadata content and formats

2. Create metadata throughout the data lifecycle

3. Prepare to share data and metadata

Sharing data and metadata

Study of the ITK Inhibitor Soquelitinib to Reduce Lymphoproliferation and Improve Cytopenias in Autoimmune Lymphoproliferative Syndrome (ALPS)-FAS Patients

Contact Information

Gene Editing Approach Paves the Way to First-in-Human Clinical Trial for Rare Genetic Disease

Unique Immune Response in Lupus Paves Way for New Treatment

The Type 1 Diabetes T Cell Receptor and B Cell Receptor Repository in the AIRR Data Commons: a practical guide for access, use and contributions through the Type 1 Diabetes AIRR Consortium

A Phase I Study of Mozobil in the Treatment of Patients With WHIMS

Contact Information

Targeting Residual Activity By Precision, Biomarker-Guided Combination Therapies of Multiple Sclerosis (TRAP-MS)

Contact Information

An Open-Label, Proof of Consent Study of Vorinostat for the Treatment of Mdoerate-to-Severe Crohn's Disease and Maintenance Therapy With Ustekinumab

Contact Information