271 results on '"Songyang, Zhang"'
Search Results
2. Signal Processing Over Multilayer Graphs: Theoretical Foundations and Practical Applications.
3. SGTR+: End-to-End Scene Graph Generation With Transformer.
4. FedSC: Provable Federated Self-supervised Learning with Spectral Contrastive Objective over Non-i.i.d. Data.
5. MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark.
6. Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks.
7. From Pixels to Graphs: Open-Vocabulary Scene Graph Generation with Vision-Language Models.
8. FoundaBench: Evaluating Chinese Fundamental Knowledge Capabilities of Large Language Models.
9. InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD.
10. Adapting LLaMA Decoder to Vision Transformer.
11. Benchmarking Chinese Commonsense Reasoning of LLMs: From Chinese-Specifics to Reasoning-Memorization Correlations.
12. RadioGAT: A Joint Model-based and Data-driven Framework for Multi-band Radiomap Reconstruction via Graph Attention Networks.
13. InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning.
14. HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance.
15. InternLM-XComposer2: Mastering Free-form Text-Image Composition and Comprehension in Vision-Language Large Model.
16. Current trends of clinical trials involving CRISPR/Cas systems
17. Improving Pixel-based MIM by Reducing Wasted Modeling Capability.
18. RIFormer: Keep Your Vision Backbone Effective But Removing Token Mixer.
19. TG-VQA: Ternary Game of Video Question Answering.
20. To Work-Conserving Packet Scheduling by Load Balance for VOQ Switches.
21. RME-GAN: A Learning Framework for Radio Map Estimation Based on Conditional Generative Adversarial Network.
22. Efficient cross-information fusion decoder for semantic segmentation.
23. Real-Time HIL Emulation of DRM With Machine Learning Accelerated WBG Device Models
24. Make-A-Video: Text-to-Video Generation without Text-Video Data.
25. Learning a Grammar Inducer from Massive Uncurated Instructional Videos.
26. SGTR: End-to-end Scene Graph Generation with Transformer.
27. The Devil is in the Labels: Noisy Label Correction for Robust Scene Graph Generation.
28. Exemplar-Based Radio Map Reconstruction of Missing Areas Using Propagation Priority.
29. Robust Temporally-Coherent Strategy for Few-shot Video Instance Segmentation.
30. Expanding Language-Image Pretrained Models for General Video Recognition.
31. Learning Semantic Correspondence with Sparse Annotations.
32. MUGEN: A Playground for Video-Audio-Text Multimodal Understanding and GENeration.
33. Action Quality Assessment with Temporal Parsing Transformer.
34. Multilayer graph spectral analysis for hyperspectral images
35. The Impact of Informational Intervention on HPV Vaccination Intention among Heterosexual Men
36. RIFormer: Keep Your Vision Backbone Effective While Removing Token Mixer.
37. InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition.
38. PFL-GAN: When Client Heterogeneity Meets Generative Models in Personalized Federated Learning.
39. UFed-GAN: A Secure Federated Learning Framework with Constrained Computation and Unlabeled Data.
40. Diff-GO: Diffusion Goal-Oriented Communications to Achieve Ultra-High Spectrum Efficiency.
41. PixMIM: Rethinking Pixel Reconstruction in Masked Image Modeling.
42. LawBench: Benchmarking Legal Knowledge of Large Language Models.
43. MMBench: Is Your Multi-modal Model an All-around Player?
44. BotChat: Evaluating LLMs' Capabilities of Having Multi-Turn Dialogues.
45. Fake Alignment: Are LLMs Really Aligned Well?
46. Temporal Segment Transformer for Action Segmentation.
47. The Cultural Psychology of Large Language Models: Is ChatGPT a Holistic or Analytic Thinker?
48. PS-FedGAN: An Efficient Federated Learning Framework Based on Partially Shared Generative Adversarial Networks For Data Privacy.
49. Dynamic Grained Encoder for Vision Transformers.
50. Unveiling Cross Modality Bias in Visual Question Answering: A Causal View with Possible Worlds VQA.
Catalog
Books, media, physical & digital resources
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.