Search

Your search keyword '"Wang, Zhongyuan"' showing total 1,527 results

Search Constraints

Start Over You searched for: Author "Wang, Zhongyuan" Remove constraint Author: "Wang, Zhongyuan"
1,527 results on '"Wang, Zhongyuan"'

Search Results

3. 52B to 1T: Lessons Learned via Tele-FLM Series

4. GUIDE: A Guideline-Guided Dataset for Instructional Video Comprehension

5. Decoding at the Speed of Thought: Harnessing Parallel Decoding of Lexical Units for LLMs

6. SG-Adapter: Enhancing Text-to-Image Generation with Scene Graph Guidance

7. Learning Multi-dimensional Human Preference for Text-to-Image Generation

8. Tele-FLM Technical Report

9. End-to-end training of Multimodal Model and ranking Model

10. Not All Layers of LLMs Are Necessary During Inference

11. Microsoft Concept Graph: Mining Semantic Concepts for Short Text Understanding

12. DVIS++: Improved Decoupled Framework for Universal Video Segmentation

13. KwaiAgents: Generalized Information-seeking Agent System with Large Language Models

14. Stable Segment Anything Model

15. Paragraph-to-Image Generation with Information-Enriched Diffusion Model

16. Temporal-Aware Refinement for Video-based Human Pose and Shape Recovery

17. Just Ask One More Time! Self-Agreement Improves Reasoning of Language Models in (Almost) All Scenarios

18. Improving Vision-and-Language Reasoning via Spatial Relations Modeling

21. Graph Ranking Contrastive Learning: A Extremely Simple yet Efficient Method

22. KwaiYiiMath: Technical Report

23. Exploring Sentence Type Effects on the Lombard Effect and Intelligibility Enhancement: A Comparative Study of Natural and Grid Sentences

24. Code-Style In-Context Learning for Knowledge-Based Question Answering

25. Towards Practical Capture of High-Fidelity Relightable Avatars

26. Improving Prosody for Cross-Speaker Style Transfer by Semi-Supervised Style Extractor and Hierarchical Modeling in Speech Synthesis

27. Nonlinear Aerodynamic Modeling and Analysis on Body of Fixed Canard Dual-Spin Projectiles

28. TalkSee: Interactive Video Retrieval Engine Using Large Language Model

32. Style-Label-Free: Cross-Speaker Style Transfer by Quantized VAE and Speaker-wise Normalization in Speech Synthesis

33. A Scale-Arbitrary Image Super-Resolution Network Using Frequency-domain Information

34. A Unified Model for Video Understanding and Knowledge Embedding with Heterogeneous Knowledge Graph Dataset

35. Back-Translation-Style Data Augmentation for Mandarin Chinese Polyphone Disambiguation

36. Kuaipedia: a Large-scale Multi-modal Short-video Encyclopedia

37. RaP: Redundancy-aware Video-language Pre-training for Text-Video Retrieval

38. Bridging CLIP and StyleGAN through Latent Alignment for Image Editing

39. InfoCSE: Information-aggregated Contrastive Learning of Sentence Embeddings

40. TokenFlow: Rethinking Fine-grained Cross-modal Alignment in Vision-Language Retrieval

41. ConTextual Masked Auto-Encoder for Dense Passage Retrieval

42. Magic ELF: Image Deraining Meets Association Learning and Transformer

43. Real-time End-to-End Video Text Spotter with Contrastive Representation Learning

44. Estimation of non-symmetric and unbounded region of attraction using shifted shape function and R-composition

45. Deepfake Face Traceability with Disentangling Reversing Network

46. Diagnosing Ensemble Few-Shot Classifiers

47. Augmentation-Aware Self-Supervision for Data-Efficient GAN Training

48. ITTR: Unpaired Image-to-Image Translation with Transformers

49. Domain Generalization via Shuffled Style Assembly for Face Anti-Spoofing

50. Text Smoothing: Enhance Various Data Augmentation Methods on Text Classification Tasks

Catalog

Books, media, physical & digital resources