Stockholm format

From WikiMD's medical encyclopedia

Stockholm format is a multiple sequence alignment format used by many bioinformatics and computational biology tools. It is designed to represent both the sequence alignment and annotations related to the alignment. The Stockholm format is versatile, supporting both the alignment data and additional information such as conserved regions, secondary structure predictions, and database references.

Overview

The Stockholm format is a flat-file format that is both human-readable and machine-parseable. It is distinguished by its ability to store not only the sequence alignment itself but also a rich set of annotations for each sequence and the alignment as a whole. This makes it particularly useful in the fields of genomics, proteomics, and molecular biology, where understanding the functional and structural aspects of sequences is crucial.

Format Specification

A Stockholm file consists of a series of lines, each starting with a specific identifier that indicates the type of information contained in that line. The key components of a Stockholm file include:

  • Sequence data: Each sequence in the alignment is represented by a single line, prefixed with the sequence identifier followed by the aligned sequence.
  • GF (Generic File annotations): These lines contain information applicable to the entire file, such as database references or consensus secondary structure.
  • GS (Generic Sequence annotations): These lines provide information specific to a single sequence within the file, such as source organism or accession numbers.
  • GR (Generic per-Residue annotations): These lines contain annotations for individual residues within a sequence, such as secondary structure predictions or residue conservation scores.
  • #=GC (Consensus annotations): These lines are used to represent consensus annotations for the alignment, often used for indicating conserved positions.

The format is bookended by a header line (# STOCKHOLM 1.0) to indicate the start of the file and a terminal line (//) to mark the end of the alignment.

Usage

Stockholm format is widely used in bioinformatics software and databases, including HMMER for homology searches and profile Hidden Markov Model (HMM) building, and the Pfam and Rfam databases for protein and RNA families, respectively. Its ability to carry extensive annotations along with the alignment makes it a preferred format for detailed analysis of sequence features and evolutionary relationships.

Advantages

  • Rich Annotations: The format supports extensive annotations, which are crucial for understanding the biological significance of the sequences.
  • Flexibility: It can represent both nucleotide and amino acid alignments, making it suitable for a wide range of applications in molecular biology.
  • Compatibility: Many bioinformatics tools and databases support the Stockholm format, facilitating easy data exchange and integration.

Limitations

  • Complexity: The richness and flexibility of the format can also make it more complex to parse and generate compared to simpler formats like FASTA.
  • File Size: Annotations can significantly increase the size of the files, which might be a concern when dealing with large datasets.

Conclusion

The Stockholm format plays a crucial role in bioinformatics, offering a comprehensive way to store and share sequence alignments along with valuable annotations. Its widespread adoption across tools and databases underscores its importance in facilitating advanced molecular biology research.


   This article is a bioinformatics-related stub. You can help WikiMD by expanding it!


Navigation: Wellness - Encyclopedia - Health topics - Disease Index‏‎ - Drugs - World Directory - Gray's Anatomy - Keto diet - Recipes

Transform your life with W8MD's budget GLP-1 injections from $125.

W8mdlogo.png
W8MD weight loss doctors team

W8MD offers a medical weight loss program to lose weight in Philadelphia. Our physician-supervised medical weight loss provides:

NYC weight loss doctor appointments

Start your NYC weight loss journey today at our NYC medical weight loss and Philadelphia medical weight loss clinics.

Linkedin_Shiny_Icon Facebook_Shiny_Icon YouTube_icon_(2011-2013) Google plus


Advertise on WikiMD

WikiMD's Wellness Encyclopedia

Let Food Be Thy Medicine
Medicine Thy Food - Hippocrates

Medical Disclaimer: WikiMD is not a substitute for professional medical advice. The information on WikiMD is provided as an information resource only, may be incorrect, outdated or misleading, and is not to be used or relied on for any diagnostic or treatment purposes. Please consult your health care provider before making any healthcare decisions or for guidance about a specific medical condition. WikiMD expressly disclaims responsibility, and shall have no liability, for any damages, loss, injury, or liability whatsoever suffered as a result of your reliance on the information contained in this site. By visiting this site you agree to the foregoing terms and conditions, which may from time to time be changed or supplemented by WikiMD. If you do not agree to the foregoing terms and conditions, you should not enter or use this site. See full disclaimer.
Credits:Most images are courtesy of Wikimedia commons, and templates, categories Wikipedia, licensed under CC BY SA or similar.

Contributors: Prab R. Tumpati, MD