Search

Your search keyword '"Shi, Bowen"' showing total 809 results

Search Constraints

Start Over You searched for: Author "Shi, Bowen" Remove constraint Author: "Shi, Bowen"
809 results on '"Shi, Bowen"'

Search Results

1. High Fidelity Text-Guided Music Generation and Editing via Single-Stage Flow Matching

2. Learning Fine-Grained Controllability on Speech Generation via Efficient Fine-Tuning

3. Strict area law implies commuting parent Hamiltonian

4. Meshfree finite difference solution of homogeneous Dirichlet problems of the fractional Laplacian

5. Conformal geometry from entanglement

6. Chiral Virasoro algebra from a single wavefunction

7. XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception

8. Towards Privacy-Aware Sign Language Translation at Scale

9. UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding

10. Audiobox: Unified Audio Generation with Natural Language Prompts

11. AiluRus: A Scalable ViT Framework for Dense Prediction

15. Generative Pre-training for Speech with Flow Matching

16. Immersed figure-8 annuli and anyons

17. Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning

18. Toward American Sign Language Processing in the Real World: Data, Tasks, and Methods

19. EXPRESSO: A Benchmark and Analysis of Discrete Expressive Speech Resynthesis

23. ActionPrompt: Action-Guided 3D Human Pose Estimation With Text and Pose Prompting

24. Hybrid Distillation: Connecting Masked Autoencoders with Contrastive Learners

25. Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale

26. Prompt to GPT-3: Step-by-Step Thinking Instructions for Humor Generation

27. Scaling Speech Technology to 1,000+ Languages

28. SEGA: Structural Entropy Guided Anchor View for Graph Contrastive Learning

29. Highly-confined and tunable plasmonics based on two-dimensional solid-state defect lattices

30. Rethinking Visual Prompt Learning as Masked Visual Token Modeling

31. MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation

32. Pose-Oriented Transformer with Uncertainty-Guided Refinement for 2D-to-3D Human Pose Estimation

33. Universal lower bound on topological entanglement entropy

35. Remote detectability from entanglement bootstrap I: Kirby's torus trick

36. Visual Story Generation Based on Emotion and Keywords

37. ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement

38. Comparative layer-wise analysis of self-supervised speech models

39. Bibliometric Analysis of Advances in mHealth Technology Application in Chronic Disease Management

40. Knots and entanglement

41. u-HuBERT: Unified Mixed-Modal Speech Pretraining And Zero-Shot Transfer to Unlabeled Modality

42. Modular Commutators in Conformal Field Theory

43. Open-Domain Sign Language Translation Learned from Online Video

44. Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT

45. Searching for fingerspelled content in American Sign Language

49. Robust Self-Supervised Audio-Visual Speech Recognition

50. Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction

Catalog

Books, media, physical & digital resources