Fastq
FASTQ is a text-based format for storing both a biological sequence (usually nucleotide sequence) and its corresponding quality scores. Both the sequence letter and quality score are each encoded with a single ASCII character for brevity. The format was originally developed at the Wellcome Trust Sanger Institute to bundle a FASTA formatted sequence and its quality data.
Format[edit | edit source]
A FASTQ file normally uses four lines per sequence:
- A line starting with a '@' character, followed by a sequence identifier and an optional description (like a FASTA title line).
- The raw sequence letters.
- A line starting with a '+' character, and optionally followed by the same sequence identifier (and any description).
- Encoded quality values for the sequence in Line 2, and must contain the same number of symbols as letters in the sequence.
Quality Scores[edit | edit source]
The quality scores are typically Phred-scaled and represent the probability of an incorrect base call. The scores are encoded as ASCII characters, with different encoding schemes used by different sequencing technologies.
Example[edit | edit source]
A typical FASTQ entry looks like this:
@SEQ_ID GATTTGGGGTTCAAAGCAGTATCGATCAAATAGTTCAGTGGTAGAATTTCAGT + !*((((*+))%%%++)(%%%%).1*-+*))55CCF>>>>>>CCCCCCC65
Applications[edit | edit source]
FASTQ files are commonly used in next-generation sequencing (NGS) technologies, such as those produced by Illumina, Roche 454, and Ion Torrent. They are essential for storing raw sequence data and quality scores, which are then used in various bioinformatics analyses, including sequence alignment, variant calling, and genome assembly.
Tools and Software[edit | edit source]
Several tools and software packages are available for working with FASTQ files, including:
- FASTQC - A quality control tool for high throughput sequence data.
- Seqtk - A toolkit for processing sequences in FASTA/Q formats.
- Trimmomatic - A flexible read trimming tool for Illumina NGS data.
Related Pages[edit | edit source]
See Also[edit | edit source]
Search WikiMD
Ad.Tired of being Overweight? Try W8MD's physician weight loss program.
Semaglutide (Ozempic / Wegovy and Tirzepatide (Mounjaro / Zepbound) available.
Advertise on WikiMD
WikiMD's Wellness Encyclopedia |
Let Food Be Thy Medicine Medicine Thy Food - Hippocrates |
Translate this page: - East Asian
中文,
日本,
한국어,
South Asian
हिन्दी,
தமிழ்,
తెలుగు,
Urdu,
ಕನ್ನಡ,
Southeast Asian
Indonesian,
Vietnamese,
Thai,
မြန်မာဘာသာ,
বাংলা
European
español,
Deutsch,
français,
Greek,
português do Brasil,
polski,
română,
русский,
Nederlands,
norsk,
svenska,
suomi,
Italian
Middle Eastern & African
عربى,
Turkish,
Persian,
Hebrew,
Afrikaans,
isiZulu,
Kiswahili,
Other
Bulgarian,
Hungarian,
Czech,
Swedish,
മലയാളം,
मराठी,
ਪੰਜਾਬੀ,
ગુજરાતી,
Portuguese,
Ukrainian
WikiMD is not a substitute for professional medical advice. See full disclaimer.
Credits:Most images are courtesy of Wikimedia commons, and templates Wikipedia, licensed under CC BY SA or similar.
Contributors: Prab R. Tumpati, MD