Needleman–Wunsch algorithm
Needleman–Wunsch algorithm is a dynamic programming algorithm used for the optimal alignment of two sequences. It was developed by Saul B. Needleman and Christian D. Wunsch in 1970 and is primarily used in bioinformatics to align protein or nucleotide sequences. The algorithm identifies the best match between two sequences by inserting gaps in the sequences to achieve the highest possible alignment score.
Overview[edit | edit source]
The Needleman–Wunsch algorithm is designed to solve the global alignment problem, where the goal is to align two sequences from beginning to end, maximizing the number of matches and minimizing the number of gaps and mismatches. It uses a scoring matrix to keep track of the alignment scores and a traceback procedure to reconstruct the optimal alignment from the completed matrix.
Algorithm[edit | edit source]
The algorithm consists of three main steps:
- Initialization: A scoring matrix is initialized with the first row and column filled based on gap penalties.
- Matrix filling: The rest of the matrix is filled according to a scoring scheme, which typically awards points for matches, deducts points for mismatches, and applies a gap penalty.
- Traceback: Starting from the bottom-right corner of the matrix, the path of optimal alignment is traced back to the top-left corner.
Scoring System[edit | edit source]
The Needleman–Wunsch algorithm requires a scoring system to evaluate alignments. This system includes:
- Match score: Points added for matching characters.
- Mismatch penalty: Points subtracted for non-matching characters.
- Gap penalty: Points subtracted for introducing a gap in one of the sequences.
Applications[edit | edit source]
The Needleman–Wunsch algorithm is widely used in bioinformatics for tasks such as:
- Sequence alignment
- Phylogenetic analysis
- Database searching for similar sequences
Advantages and Limitations[edit | edit source]
The main advantage of the Needleman–Wunsch algorithm is its ability to find the globally optimal alignment between two sequences. However, its computational complexity is a limitation, making it less practical for very long sequences without modifications or optimizations.
See Also[edit | edit source]
References[edit | edit source]
Search WikiMD
Ad.Tired of being Overweight? Try W8MD's physician weight loss program.
Semaglutide (Ozempic / Wegovy and Tirzepatide (Mounjaro / Zepbound) available.
Advertise on WikiMD
WikiMD's Wellness Encyclopedia |
Let Food Be Thy Medicine Medicine Thy Food - Hippocrates |
Translate this page: - East Asian
中文,
日本,
한국어,
South Asian
हिन्दी,
தமிழ்,
తెలుగు,
Urdu,
ಕನ್ನಡ,
Southeast Asian
Indonesian,
Vietnamese,
Thai,
မြန်မာဘာသာ,
বাংলা
European
español,
Deutsch,
français,
Greek,
português do Brasil,
polski,
română,
русский,
Nederlands,
norsk,
svenska,
suomi,
Italian
Middle Eastern & African
عربى,
Turkish,
Persian,
Hebrew,
Afrikaans,
isiZulu,
Kiswahili,
Other
Bulgarian,
Hungarian,
Czech,
Swedish,
മലയാളം,
मराठी,
ਪੰਜਾਬੀ,
ગુજરાતી,
Portuguese,
Ukrainian
Medical Disclaimer: WikiMD is not a substitute for professional medical advice. The information on WikiMD is provided as an information resource only, may be incorrect, outdated or misleading, and is not to be used or relied on for any diagnostic or treatment purposes. Please consult your health care provider before making any healthcare decisions or for guidance about a specific medical condition. WikiMD expressly disclaims responsibility, and shall have no liability, for any damages, loss, injury, or liability whatsoever suffered as a result of your reliance on the information contained in this site. By visiting this site you agree to the foregoing terms and conditions, which may from time to time be changed or supplemented by WikiMD. If you do not agree to the foregoing terms and conditions, you should not enter or use this site. See full disclaimer.
Credits:Most images are courtesy of Wikimedia commons, and templates Wikipedia, licensed under CC BY SA or similar.
Contributors: Prab R. Tumpati, MD