T-distributed stochastic neighbor embedding
T-distributed stochastic neighbor embedding (t-SNE) is a machine learning algorithm for dimensionality reduction developed by Geoffrey Hinton and Laurens van der Maaten. It is particularly well-suited for the visualization of high-dimensional datasets.
Overview[edit | edit source]
t-SNE is a non-linear technique primarily used for data visualization. It converts similarities between data points to joint probabilities and tries to minimize the Kullback-Leibler divergence between the joint probabilities of the low-dimensional embedding and the high-dimensional data. The algorithm is an extension of Stochastic Neighbor Embedding (SNE) and aims to address the crowding problem inherent in SNE.
Algorithm[edit | edit source]
The t-SNE algorithm consists of two main stages: 1. **Computing pairwise similarities**: In the high-dimensional space, the similarity between two points is measured using a Gaussian distribution. In the low-dimensional space, a Student's t-distribution is used to measure similarity. 2. **Minimizing the Kullback-Leibler divergence**: The algorithm iteratively adjusts the positions of points in the low-dimensional space to minimize the divergence between the high-dimensional and low-dimensional similarity distributions.
Applications[edit | edit source]
t-SNE is widely used in various fields such as:
- Bioinformatics for visualizing gene expression data.
- Natural language processing for visualizing word embeddings.
- Computer vision for visualizing image features.
Advantages and Disadvantages[edit | edit source]
Advantages[edit | edit source]
- Effective in capturing the local structure of the data.
- Produces visually interpretable results.
Disadvantages[edit | edit source]
- Computationally intensive, especially for large datasets.
- The results can be sensitive to the choice of hyperparameters such as perplexity.
Related Techniques[edit | edit source]
- Principal Component Analysis (PCA)
- Multidimensional scaling (MDS)
- Isomap
- Locally Linear Embedding (LLE)
See Also[edit | edit source]
References[edit | edit source]
External Links[edit | edit source]
```
This template is designed for use in marking articles related to machine learning as stubs, which are articles that are too short to provide more than rudimentary information about a subject. When this template is placed on a page, it automatically adds the page to the "Machine learning stubs" category, making it easier for contributors to find and expand short articles in this subject area.
Navigation: Wellness - Encyclopedia - Health topics - Disease Index - Drugs - World Directory - Gray's Anatomy - Keto diet - Recipes
Search WikiMD
Ad.Tired of being Overweight? Try W8MD's physician weight loss program.
Semaglutide (Ozempic / Wegovy and Tirzepatide (Mounjaro / Zepbound) available.
Advertise on WikiMD
WikiMD is not a substitute for professional medical advice. See full disclaimer.
Credits:Most images are courtesy of Wikimedia commons, and templates Wikipedia, licensed under CC BY SA or similar.
Translate this page: - East Asian
中文,
日本,
한국어,
South Asian
हिन्दी,
தமிழ்,
తెలుగు,
Urdu,
ಕನ್ನಡ,
Southeast Asian
Indonesian,
Vietnamese,
Thai,
မြန်မာဘာသာ,
বাংলা
European
español,
Deutsch,
français,
Greek,
português do Brasil,
polski,
română,
русский,
Nederlands,
norsk,
svenska,
suomi,
Italian
Middle Eastern & African
عربى,
Turkish,
Persian,
Hebrew,
Afrikaans,
isiZulu,
Kiswahili,
Other
Bulgarian,
Hungarian,
Czech,
Swedish,
മലയാളം,
मराठी,
ਪੰਜਾਬੀ,
ગુજરાતી,
Portuguese,
Ukrainian
Contributors: Prab R. Tumpati, MD