Author: "Linderman, George C" / Publisher: arxiv - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Linderman, George C"' showing total 2 results

Start Over Author "Linderman, George C" Publisher arxiv

2 results on '"Linderman, George C"'

1. Randomized Near Neighbor Graphs, Giant Components, and Applications in Data Science

Author: Linderman, George C., Mishne, Gal, Kluger, Yuval, and Steinerberger, Stefan
Subjects: FOS: Computer and information sciences, Discrete Mathematics (cs.DM), Statistics - Machine Learning, Computer Science - Data Structures and Algorithms, Probability (math.PR), FOS: Mathematics, Mathematics - Combinatorics, Data Structures and Algorithms (cs.DS), Machine Learning (stat.ML), Combinatorics (math.CO), Mathematics - Probability, Computer Science - Discrete Mathematics
Abstract: If we pick $n$ random points uniformly in $[0,1]^d$ and connect each point to its $k-$nearest neighbors, then it is well known that there exists a giant connected component with high probability. We prove that in $[0,1]^d$ it suffices to connect every point to $ c_{d,1} \log{\log{n}}$ points chosen randomly among its $ c_{d,2} \log{n}-$nearest neighbors to ensure a giant component of size $n - o(n)$ with high probability. This construction yields a much sparser random graph with $\sim n \log\log{n}$ instead of $\sim n \log{n}$ edges that has comparable connectivity properties. This result has nontrivial implications for problems in data science where an affinity matrix is constructed: instead of picking the $k-$nearest neighbors, one can often pick $k' \ll k$ random points out of the $k-$nearest neighbors without sacrificing efficiency. This can massively simplify and accelerate computation, we illustrate this with several numerical examples.
Published: 2017
Full Text: View/download PDF

2. Efficient Algorithms for t-distributed Stochastic Neighborhood Embedding

Author: Linderman, George C., Rachh, Manas, Hoskins, Jeremy G., Steinerberger, Stefan, and Kluger, Yuval
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Statistics - Machine Learning, Astrophysics::High Energy Astrophysical Phenomena, Astrophysics::Solar and Stellar Astrophysics, Machine Learning (stat.ML), Astrophysics::Cosmology and Extragalactic Astrophysics, Machine Learning (cs.LG)
Abstract: t-distributed Stochastic Neighborhood Embedding (t-SNE) is a method for dimensionality reduction and visualization that has become widely popular in recent years. Efficient implementations of t-SNE are available, but they scale poorly to datasets with hundreds of thousands to millions of high dimensional data-points. We present Fast Fourier Transform-accelerated Interpolation-based t-SNE (FIt-SNE), which dramatically accelerates the computation of t-SNE. The most time-consuming step of t-SNE is a convolution that we accelerate by interpolating onto an equispaced grid and subsequently using the fast Fourier transform to perform the convolution. We also optimize the computation of input similarities in high dimensions using multi-threaded approximate nearest neighbors. We further present a modification to t-SNE called "late exaggeration," which allows for easier identification of clusters in t-SNE embeddings. Finally, for datasets that cannot be loaded into the memory, we present out-of-core randomized principal component analysis (oocPCA), so that the top principal components of a dataset can be computed without ever fully loading the matrix, hence allowing for t-SNE of large datasets to be computed on resource-limited machines.
Published: 2017
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

2 results on '"Linderman, George C"'

1. Randomized Near Neighbor Graphs, Giant Components, and Applications in Data Science

2. Efficient Algorithms for t-distributed Stochastic Neighborhood Embedding

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Database

2 results on '"Linderman, George C"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources