Fastq
FASTQ is a text-based format for storing both a biological sequence (usually nucleotide sequence) and its corresponding quality scores. Both the sequence letter and quality score are each encoded with a single ASCII character for brevity. The format was originally developed at the Wellcome Trust Sanger Institute to bundle a FASTA formatted sequence and its quality data.
Format[edit | edit source]
A FASTQ file normally uses four lines per sequence:
- A line starting with a '@' character, followed by a sequence identifier and an optional description (like a FASTA title line).
- The raw sequence letters.
- A line starting with a '+' character, and optionally followed by the same sequence identifier (and any description).
- Encoded quality values for the sequence in Line 2, and must contain the same number of symbols as letters in the sequence.
Quality Scores[edit | edit source]
The quality scores are typically Phred-scaled and represent the probability of an incorrect base call. The scores are encoded as ASCII characters, with different encoding schemes used by different sequencing technologies.
Example[edit | edit source]
A typical FASTQ entry looks like this:
@SEQ_ID GATTTGGGGTTCAAAGCAGTATCGATCAAATAGTTCAGTGGTAGAATTTCAGT + !*((((*+))%%%++)(%%%%).1*-+*))55CCF>>>>>>CCCCCCC65
Applications[edit | edit source]
FASTQ files are commonly used in next-generation sequencing (NGS) technologies, such as those produced by Illumina, Roche 454, and Ion Torrent. They are essential for storing raw sequence data and quality scores, which are then used in various bioinformatics analyses, including sequence alignment, variant calling, and genome assembly.
Tools and Software[edit | edit source]
Several tools and software packages are available for working with FASTQ files, including:
- FASTQC - A quality control tool for high throughput sequence data.
- Seqtk - A toolkit for processing sequences in FASTA/Q formats.
- Trimmomatic - A flexible read trimming tool for Illumina NGS data.
Related Pages[edit | edit source]
See Also[edit | edit source]
Search WikiMD
Ad.Tired of being Overweight? Try W8MD's physician weight loss program.
Semaglutide (Ozempic / Wegovy and Tirzepatide (Mounjaro / Zepbound) available.
Advertise on WikiMD
WikiMD's Wellness Encyclopedia |
Let Food Be Thy Medicine Medicine Thy Food - Hippocrates |
Translate this page: - East Asian
中文,
日本,
한국어,
South Asian
हिन्दी,
தமிழ்,
తెలుగు,
Urdu,
ಕನ್ನಡ,
Southeast Asian
Indonesian,
Vietnamese,
Thai,
မြန်မာဘာသာ,
বাংলা
European
español,
Deutsch,
français,
Greek,
português do Brasil,
polski,
română,
русский,
Nederlands,
norsk,
svenska,
suomi,
Italian
Middle Eastern & African
عربى,
Turkish,
Persian,
Hebrew,
Afrikaans,
isiZulu,
Kiswahili,
Other
Bulgarian,
Hungarian,
Czech,
Swedish,
മലയാളം,
मराठी,
ਪੰਜਾਬੀ,
ગુજરાતી,
Portuguese,
Ukrainian
Medical Disclaimer: WikiMD is not a substitute for professional medical advice. The information on WikiMD is provided as an information resource only, may be incorrect, outdated or misleading, and is not to be used or relied on for any diagnostic or treatment purposes. Please consult your health care provider before making any healthcare decisions or for guidance about a specific medical condition. WikiMD expressly disclaims responsibility, and shall have no liability, for any damages, loss, injury, or liability whatsoever suffered as a result of your reliance on the information contained in this site. By visiting this site you agree to the foregoing terms and conditions, which may from time to time be changed or supplemented by WikiMD. If you do not agree to the foregoing terms and conditions, you should not enter or use this site. See full disclaimer.
Credits:Most images are courtesy of Wikimedia commons, and templates Wikipedia, licensed under CC BY SA or similar.
Contributors: Prab R. Tumpati, MD