Author: "Chu, Tianzhe" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Chu, Tianzhe"' showing total 16 results

Start Over Author "Chu, Tianzhe"

16 results on '"Chu, Tianzhe"'

1. White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is?

Author: Yu, Yaodong, Buchanan, Sam, Pai, Druv, Chu, Tianzhe, Wu, Ziyang, Tong, Shengbang, Bai, Hao, Zhai, Yuexiang, Haeffele, Benjamin D., and Ma, Yi
Subjects: Computer Science - Machine Learning, Computer Science - Computation and Language, Computer Science - Computer Vision and Pattern Recognition
Abstract: In this paper, we contend that a natural objective of representation learning is to compress and transform the distribution of the data, say sets of tokens, towards a low-dimensional Gaussian mixture supported on incoherent subspaces. The goodness of such a representation can be evaluated by a principled measure, called sparse rate reduction, that simultaneously maximizes the intrinsic information gain and extrinsic sparsity of the learned representation. From this perspective, popular deep network architectures, including transformers, can be viewed as realizing iterative schemes to optimize this measure. Particularly, we derive a transformer block from alternating optimization on parts of this objective: the multi-head self-attention operator compresses the representation by implementing an approximate gradient descent step on the coding rate of the features, and the subsequent multi-layer perceptron sparsifies the features. This leads to a family of white-box transformer-like deep network architectures, named CRATE, which are mathematically fully interpretable. We show, by way of a novel connection between denoising and compression, that the inverse to the aforementioned compressive encoding can be realized by the same class of CRATE architectures. Thus, the so-derived white-box architectures are universal to both encoders and decoders. Experiments show that these networks, despite their simplicity, indeed learn to compress and sparsify representations of large-scale real-world image and text datasets, and achieve performance very close to highly engineered transformer-based models: ViT, MAE, DINO, BERT, and GPT2. We believe the proposed computational framework demonstrates great potential in bridging the gap between theory and practice of deep learning, from a unified perspective of data compression. Code is available at: https://ma-lab-berkeley.github.io/CRATE ., Comment: Accepted at Journal of Machine Learning Research. This paper integrates the works arXiv:2306.01129 and arXiv:2308.16271 into a complete story. In this paper, we improve the writing and organization, and also add conceptual, empirical, and theoretical improvements over the previous work. V2: small typo fixes/formatting improvements. V3: improvements from journal revisions. V4: fix figures
Published: 2023

2. Emergence of Segmentation with Minimalistic White-Box Transformers

Author: Yu, Yaodong, Chu, Tianzhe, Tong, Shengbang, Wu, Ziyang, Pai, Druv, Buchanan, Sam, and Ma, Yi
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Transformer-like models for vision tasks have recently proven effective for a wide range of downstream applications such as segmentation and detection. Previous works have shown that segmentation properties emerge in vision transformers (ViTs) trained using self-supervised methods such as DINO, but not in those trained on supervised classification tasks. In this study, we probe whether segmentation emerges in transformer-based models solely as a result of intricate self-supervised learning mechanisms, or if the same emergence can be achieved under much broader conditions through proper design of the model architecture. Through extensive experimental results, we demonstrate that when employing a white-box transformer-like architecture known as CRATE, whose design explicitly models and pursues low-dimensional structures in the data distribution, segmentation properties, at both the whole and parts levels, already emerge with a minimalistic supervised training recipe. Layer-wise finer-grained analysis reveals that the emergent properties strongly corroborate the designed mathematical functions of the white-box network. Our results suggest a path to design white-box foundation models that are simultaneously highly performant and mathematically fully interpretable. Code is at \url{https://github.com/Ma-Lab-Berkeley/CRATE}., Comment: Code: https://github.com/Ma-Lab-Berkeley/CRATE
Published: 2023

3. Image Clustering via the Principle of Rate Reduction in the Age of Pretrained Models

Author: Chu, Tianzhe, Tong, Shengbang, Ding, Tianjiao, Dai, Xili, Haeffele, Benjamin David, Vidal, René, and Ma, Yi
Subjects: Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: The advent of large pre-trained models has brought about a paradigm shift in both visual representation learning and natural language processing. However, clustering unlabeled images, as a fundamental and classic machine learning problem, still lacks an effective solution, particularly for large-scale datasets. In this paper, we propose a novel image clustering pipeline that leverages the powerful feature representation of large pre-trained models such as CLIP and cluster images effectively and efficiently at scale. We first developed a novel algorithm to estimate the number of clusters in a given dataset. We then show that the pre-trained features are significantly more structured by further optimizing the rate reduction objective. The resulting features may significantly improve the clustering accuracy, e.g., from 57\% to 66\% on ImageNet-1k. Furthermore, by leveraging CLIP's multimodality bridge between image and text, we develop a simple yet effective self-labeling algorithm that produces meaningful captions for the clusters. Through extensive experiments, we show that our pipeline works well on standard datasets such as CIFAR-10, CIFAR-100, and ImageNet-1k. It also extends to datasets that are not curated for clustering, such as LAION-Aesthetics and WikiArts. We released the code in https://github.com/LeslieTrue/CPP., Comment: 23 pages, 14 figures
Published: 2023

4. White-Box Transformers via Sparse Rate Reduction

Author: Yu, Yaodong, Buchanan, Sam, Pai, Druv, Chu, Tianzhe, Wu, Ziyang, Tong, Shengbang, Haeffele, Benjamin D., and Ma, Yi
Subjects: Computer Science - Machine Learning
Abstract: In this paper, we contend that the objective of representation learning is to compress and transform the distribution of the data, say sets of tokens, towards a mixture of low-dimensional Gaussian distributions supported on incoherent subspaces. The quality of the final representation can be measured by a unified objective function called sparse rate reduction. From this perspective, popular deep networks such as transformers can be naturally viewed as realizing iterative schemes to optimize this objective incrementally. Particularly, we show that the standard transformer block can be derived from alternating optimization on complementary parts of this objective: the multi-head self-attention operator can be viewed as a gradient descent step to compress the token sets by minimizing their lossy coding rate, and the subsequent multi-layer perceptron can be viewed as attempting to sparsify the representation of the tokens. This leads to a family of white-box transformer-like deep network architectures which are mathematically fully interpretable. Despite their simplicity, experiments show that these networks indeed learn to optimize the designed objective: they compress and sparsify representations of large-scale real-world vision datasets such as ImageNet, and achieve performance very close to thoroughly engineered transformers such as ViT. Code is at \url{https://github.com/Ma-Lab-Berkeley/CRATE}., Comment: 33 pages, 11 figures
Published: 2023

5. Novel chitosan-modified biochar prepared from a Chinese herb residue for multiple heavy metals removal: Characterization, performance and mechanism

Author: Wang, Chengjiu, Qiao, Jixu, Yuan, Jiandan, Tang, Zhentao, Chu, Tianzhe, Lin, Ruifeng, Wen, Hongting, Zheng, Chuan, Chen, Hulan, Xie, Haijiao, Peng, Cheng, and Tan, Yuzhu
Published: 2024
Full Text: View/download PDF

6. Unlocking the hidden potential: Enhancing the utilization of stems and leaves through metabolite analysis and toxicity assessment of various parts of Aconitum carmichaelii

Author: Zhou, Yinlin, Qu, Can, Yan, Hongling, Chu, Tianzhe, Wu, Jing, Kang, Qinzhao, Peng, Cheng, Wang, Yulan, and Tan, Yuzhu
Published: 2024
Full Text: View/download PDF

7. Biochar derived from traditional Chinese medicine residues: An efficient adsorbent for heavy metal Pb(II)

Author: Yuan, Jiandan, Wang, Chengjiu, Tang, Zhentao, Chu, Tianzhe, Zheng, Chuan, Han, Qingrong, Chen, Hulan, and Tan, Yuzhu
Published: 2024
Full Text: View/download PDF

8. TCMPG 2.0: An Enhanced Database of Traditional Chinese Medicine Plant Genomes

Author: Meng, Fanbo, primary, Chu, Tianzhe, additional, Hu, Lianjiang, additional, Zhang, Mengqing, additional, Cheng, Qian, additional, Yang, Xiuping, additional, Liu, Zhuo, additional, Ye, Yuannong, additional, Song, Xiaoming, additional, and Chen, Wei, additional
Published: 2024
Full Text: View/download PDF

9. Unveiling the Therapeutic Potential: Phthalide Extract from the Stem and Leaf of Ligusticum Chuanxiong Alleviates Neuroinflammation in Cerebral Ischemia-Reperfusion Injury Via Casr-Mediated Inhibition of Nlrp3 Inflammasome

Author: Wu, Jing, primary, Li, Chunrong, additional, Li, Gang, additional, Chu, Tianzhe, additional, Wang, Li, additional, Zhang, Dandan, additional, Kang, Qinzhao, additional, Lin, Ruifeng, additional, Peng, Cheng, additional, Han, Xiaoyu, additional, Peng, Fu, additional, Zheng, Chuan, additional, and Tan, Yuzhu, additional
Published: 2024
Full Text: View/download PDF

10. A tetraploidization event shaped the Aquilaria sinensis genome and contributed to the ability of sesquiterpenes synthesis

Author: Meng, Fanbo, Chu, Tianzhe, Tang, Qiang, and Chen, Wei
Published: 2021
Full Text: View/download PDF

11. Genome assembly of Polygala tenuifolia provides insights into its karyotype evolution and triterpenoid saponin biosynthesis

Author: Meng, Fanbo, primary, Chu, Tianzhe, additional, Feng, Pengmian, additional, Li, Nan, additional, Song, Chi, additional, Li, Chunjin, additional, Leng, Liang, additional, Song, Xiaoming, additional, and Chen, Wei, additional
Published: 2023
Full Text: View/download PDF

12. TCMPG: an integrative database for traditional Chinese medicine plant genomes

Author: Meng, Fanbo, primary, Tang, Qiang, additional, Chu, Tianzhe, additional, Li, Xianhai, additional, Lin, Yue, additional, Song, Xiaoming, additional, and Chen, Wei, additional
Published: 2022
Full Text: View/download PDF

13. Additional file 2 of A tetraploidization event shaped the Aquilaria sinensis genome and contributed to the ability of sesquiterpenes synthesis

Author: Meng, Fanbo, Chu, Tianzhe, Tang, Qiang, and Chen, Wei
Abstract: Additional file 2: Figure S1. Homologous Genes Dotplot between A. sinensis and Grape Genomes. Figure S2. Homologous Genes Dotplot between A. sinensis and Cacao Genomes. Figure S3. Local Homologous Alignments of Grape, Cacao, and A. sinensis Genomes. Figure S4. The Retention of Duplicated Genes Residing in Two Subgenomes of A. sinensis using the Grape as Reference. Figure S5. The Retention of Duplicated Genes Residing in Two Subgenomes of A. sinensis using the Cacao as Reference. Figure S6. Near Geometric Distribution of Continually Lost or Translocated Genes between A. sinensis and Grape. Figure S7. Near Geometric Distribution of Continually Lost or Translocated Genes between A. sinensis and Cacao. Figure S8. Histograms and Gaussian Fitted Curves of K S $${K}_{S}$$ between Colinear Homologous Genes. Figure S9. Gene Synteny Analysis among Grape, Cacao, and A. sinensis. Figure S10. The Phylogenetic Tree Constructed using Sesquiterpene Synthesis Genes from Grape, Cacao, and A. sinensis.
Published: 2021
Full Text: View/download PDF

14. Chiral 4-O-acylterpineol as transdermal permeation enhancers: insights of the enhancement mechanisms of a transdermal enantioselective delivery system for flurbiprofen

Author: Chu, Tianzhe, primary, Wang, Chunyan, additional, Wang, Jing, additional, Wang, Heping, additional, Geng, Dandan, additional, Wu, Chensi, additional, Zhao, Linlin, additional, and Zhao, Ligang, additional
Published: 2020
Full Text: View/download PDF

15. Permeation-enhancing effects and mechanisms of O-acylterpineol on isosorbide dinitrate: mechanistic insights based on ATR-FTIR spectroscopy, molecular modeling, and CLSM images

Author: Li, Yan, primary, Wang, Chunyan, additional, Wang, Jian, additional, Chu, Tianzhe, additional, Zhao, Linlin, additional, and Zhao, Ligang, additional
Published: 2019
Full Text: View/download PDF

16. Genome assembly of Polygala tenuifoliaprovides insights into its karyotype evolution and triterpenoid saponin biosynthesis

Author: Meng, Fanbo, Chu, Tianzhe, Feng, Pengmian, Li, Nan, Song, Chi, Li, Chunjin, Leng, Liang, Song, Xiaoming, and Chen, Wei
Abstract: Polygala tenuifoliais a perennial medicinal plant that has been widely used in traditional Chinese medicine for treating mental diseases. However, the lack of genomic resources limits the insight into its evolutionary and biological characterization. In the present work, we reported the P. tenuifoliagenome, the first genome assembly of the Polygalaceae family. We sequenced and assembled this genome by a combination of Illumnina, PacBio HiFi, and Hi-C mapping. The assembly includes 19 pseudochromosomes covering ~92.68% of the assembled genome (~769.62 Mb). There are 36 463 protein-coding genes annotated in this genome. Detailed comparative genome analysis revealed that P. tenuifoliaexperienced two rounds of whole genome duplication that occurred ~39–44 and ~18–20 million years ago, respectively. Accordingly, we systematically reconstructed ancestral chromosomes of P. tenuifoliaand inferred its chromosome evolution trajectories from the common ancestor of core eudicots to the present species. Based on the transcriptomics data, enzyme genes and transcription factors involved in the synthesis of triterpenoid saponin in P. tenuifoliawere identified. Further analysis demonstrated that whole-genome duplications and tandem duplications play critical roles in the expansion of P450 and UGT gene families, which contributed to the synthesis of triterpenoid saponins. The genome and transcriptome data will not only provide valuable resources for comparative and functional genomic researches on Polygalaceae, but also shed light on the synthesis of triterpenoid saponin.
Published: 2023
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

16 results on '"Chu, Tianzhe"'

1. White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is?

2. Emergence of Segmentation with Minimalistic White-Box Transformers

3. Image Clustering via the Principle of Rate Reduction in the Age of Pretrained Models

4. White-Box Transformers via Sparse Rate Reduction

5. Novel chitosan-modified biochar prepared from a Chinese herb residue for multiple heavy metals removal: Characterization, performance and mechanism

6. Unlocking the hidden potential: Enhancing the utilization of stems and leaves through metabolite analysis and toxicity assessment of various parts of Aconitum carmichaelii

7. Biochar derived from traditional Chinese medicine residues: An efficient adsorbent for heavy metal Pb(II)

8. TCMPG 2.0: An Enhanced Database of Traditional Chinese Medicine Plant Genomes

9. Unveiling the Therapeutic Potential: Phthalide Extract from the Stem and Leaf of Ligusticum Chuanxiong Alleviates Neuroinflammation in Cerebral Ischemia-Reperfusion Injury Via Casr-Mediated Inhibition of Nlrp3 Inflammasome

10. A tetraploidization event shaped the Aquilaria sinensis genome and contributed to the ability of sesquiterpenes synthesis

11. Genome assembly of Polygala tenuifolia provides insights into its karyotype evolution and triterpenoid saponin biosynthesis

12. TCMPG: an integrative database for traditional Chinese medicine plant genomes

13. Additional file 2 of A tetraploidization event shaped the Aquilaria sinensis genome and contributed to the ability of sesquiterpenes synthesis

14. Chiral 4-O-acylterpineol as transdermal permeation enhancers: insights of the enhancement mechanisms of a transdermal enantioselective delivery system for flurbiprofen

15. Permeation-enhancing effects and mechanisms of O-acylterpineol on isosorbide dinitrate: mechanistic insights based on ATR-FTIR spectroscopy, molecular modeling, and CLSM images

16. Genome assembly of Polygala tenuifoliaprovides insights into its karyotype evolution and triterpenoid saponin biosynthesis

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

16 results on '"Chu, Tianzhe"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources