Search

Your search keyword '"Chen, Jingdong"' showing total 31 results

Search Constraints

Start Over You searched for: Author "Chen, Jingdong" Remove constraint Author: "Chen, Jingdong" Database arXiv Remove constraint Database: arXiv
31 results on '"Chen, Jingdong"'

Search Results

1. POA: Pre-training Once for Models of All Sizes

2. Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight

3. ViTime: A Visual Intelligence-Based Foundation Model for Time Series Forecasting

4. SkySenseGPT: A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding

5. Low algorithmic delay implementation of convolutional beamformer for online joint source separation and dereverberation

6. Enhancing DETRs Variants through Improved Content Query and Similar Query Aggregation

7. Learning Dynamic Tetrahedra for High-Quality Talking Head Synthesis

8. M2-Encoder: Advancing Bilingual Image-Text Understanding by Large-scale Efficient Pretraining

9. Independent low-rank matrix analysis based on the Sinkhorn divergence source model for blind source separation

10. SkySense: A Multi-Modal Remote Sensing Foundation Model Towards Universal Interpretation for Earth Observation Imagery

11. A computationally efficient semi-blind source separation based approach for nonlinear echo cancellation based on an element-wise iterative source steering

12. Large Multimodal Model Compression via Efficient Pruning and Distillation at AntGroup

13. LogicMP: A Neuro-symbolic Approach for Encoding First-order Logic Constraints

14. The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction

15. Mapping EEG Signals to Visual Stimuli: A Deep Learning Approach to Match vs. Mismatch Classification

16. An Anchor-Point Based Image-Model for Room Impulse Response Simulation with Directional Source Radiation and Sensor Directivity Patterns

17. The Multimodal Information based Speech Processing (MISP) 2022 Challenge: Audio-Visual Diarization and Recognition

18. Robust Manifold Nonnegative Tucker Factorization for Tensor Data Representation

19. SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Normalization

20. Hierarchical Memory Learning for Fine-Grained Scene Graph Generation

21. Training Protocol Matters: Towards Accurate Scene Text Recognition via Training Protocol Searching

22. CBNet: A Composite Backbone Network Architecture for Object Detection

23. MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction

24. CMUA-Watermark: A Cross-Model Universal Adversarial Watermark for Combating Deepfakes

25. AISHELL-4: An Open Source Dataset for Speech Enhancement, Separation, Recognition and Speaker Diarization in Conference Scenario

26. Affine Combination of Diffusion Strategies over Networks

27. Partial AUC optimization based deep speaker embeddings with class-center learning for text-independent speaker verification

28. Speaker Verification By Partial AUC Optimization With Mahalanobis Distance Metric Learning

29. End-to-End Model for Speech Enhancement by Consistent Spectrogram Masking

30. Adaptive Parameters Adjustment for Group Reweighted Zero-Attracting LMS

31. Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

Catalog

Books, media, physical & digital resources