Search

Your search keyword '"Chen, Liang-Chieh"' showing total 279 results

Search Constraints

Start Over You searched for: Author "Chen, Liang-Chieh" Remove constraint Author: "Chen, Liang-Chieh"
279 results on '"Chen, Liang-Chieh"'

Search Results

1. 1.58-bit FLUX

2. FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching

3. ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation

4. Randomized Autoregressive Visual Generation

5. MaskBit: Embedding-free Image Generation via Bit Tokens

6. Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models and Time-Dependent Layer Normalization

7. An Image is Worth 32 Tokens for Reconstruction and Generation

8. Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting

9. Towards Open-Ended Visual Recognition with Large Language Models

10. COCONut: Modernizing COCO Segmentation

11. ViTamin: Designing Scalable Vision Models in the Vision-Language Era

12. SPFormer: Enhancing Vision Transformer with Superpixel Representation

13. MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation

14. A Simple Video Segmenter by Tracking Objects Along Axial Trajectories

15. Towards Open-Ended Visual Recognition with Large Language Model

16. PolyMaX: General Dense Prediction with Mask Transformer

18. Superpixel Transformers for Efficient Semantic Segmentation

19. Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP

20. ReMaX: Relaxing for Better Training on Efficient Panoptic Segmentation

21. DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model

22. Video-kMaX: A Simple Unified Approach for Online and Near-Online Video Panoptic Segmentation

23. A Study of Autoregressive Decoders for Multi-Tasking in Computer Vision

24. MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision Models

25. kMaX-DeepLab: k-means Mask Transformer

26. CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation

27. Waymo Open Dataset: Panoramic Video Panoptic Segmentation

28. TubeFormer-DeepLab: Video Mask Transformer

29. DeepLab2: A TensorFlow Library for Deep Labeling

30. STEP: Segmenting and Tracking Every Pixel

31. ViP-DeepLab: Learning Visual Perception with Depth-aware Video Panoptic Segmentation

32. MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers

33. Scaling Wide Residual Networks for Panoptic Segmentation

34. View-Invariant, Occlusion-Robust Probabilistic Embedding for Human Pose

35. DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution

36. Naive-Student: Leveraging Semi-Supervised Learning in Video Sequences for Urban Scene Segmentation

37. Axial-DeepLab: Stand-Alone Axial-Attention for Panoptic Segmentation

38. View-Invariant Probabilistic Embedding for Human Pose

39. Panoptic-DeepLab: A Simple, Strong, and Fast Baseline for Bottom-Up Panoptic Segmentation

40. SegSort: Segmentation by Discriminative Sorting of Segments

41. Panoptic-DeepLab

42. SPGNet: Semantic Prediction Guidance for Scene Parsing

43. Searching for MobileNetV3

44. FEELVOS: Fast End-to-End Embedding Learning for Video Object Segmentation

45. DeeperLab: Single-Shot Image Parser

46. Auto-DeepLab: Hierarchical Neural Architecture Search for Semantic Image Segmentation

47. k-means Mask Transformer

48. Searching for Efficient Multi-Scale Architectures for Dense Image Prediction

49. PersonLab: Person Pose Estimation and Instance Segmentation with a Bottom-Up, Part-Based, Geometric Embedding Model

50. Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation

Catalog

Books, media, physical & digital resources