Search

Your search keyword '"Jin, Lianwen"' showing total 632 results

Search Constraints

Start Over You searched for: Author "Jin, Lianwen" Remove constraint Author: "Jin, Lianwen"
632 results on '"Jin, Lianwen"'

Search Results

1. LEGO: Self-Supervised Representation Learning for Scene Text Images

2. Mini-Monkey: Multi-Scale Adaptive Cropping for Multimodal Large Language Models

3. Generalized Tampered Scene Text Detection in the era of Generative AI

4. TongGu: Mastering Classical Chinese Understanding with Knowledge-Grounded Large Language Models

5. DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimming

6. Puzzle Pieces Picker: Deciphering Ancient Chinese Characters with Radical Reconstruction

7. Deciphering Oracle Bone Language with Diffusion Models

8. C$^{3}$Bench: A Comprehensive Classical Chinese Understanding Benchmark for Large Language Models

9. DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

10. VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-domain Generalization

11. Bridging the Gap Between End-to-End and Two-Step Text Spotting

12. HierCode: A Lightweight Hierarchical Codebook for Zero-shot Chinese Text Recognition

13. DiffChat: Learning to Chat with Text-to-Image Synthesis Models for Interactive Image Creation

14. Datasets for Large Language Models: A Comprehensive Survey

15. An open dataset for oracle bone script recognition and decipherment

16. An open dataset for the evolution of oracle bone characters: EVOBC

17. PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction

18. Progressive Evolution from Single-Point to Polygon for Scene Text

19. FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning

20. UPOCR: Towards Unified Pixel-Level OCR Interface

21. Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and In-depth Evaluation

22. Hierarchical Side-Tuning for Vision Transformers

23. ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer

24. Revisiting Scene Text Recognition: A Data Perspective

25. Scale-Aware Modulation Meet Transformer

26. ViTEraser: Harnessing the Power of Vision Transformers for Scene Text Removal with SegMIM Pretraining

27. DocAligner: Annotating Real-world Photographic Document Images by Simply Taking Pictures

28. ConaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval

29. M$^{6}$Doc: A Large-Scale Multi-Format, Multi-Type, Multi-Layout, Multi-Language, Multi-Annotation Category Dataset for Modern Document Layout Analysis

30. On the Hidden Mystery of OCR in Large Multimodal Models

31. SPTS v2: Single-Point Scene Text Spotting

32. MSDS: A Large-Scale Chinese Signature and Token Digit String Dataset for Handwriting Verification

33. PageNet: Towards End-to-End Weakly Supervised Page-Level Handwritten Chinese Text Recognition

34. Recognition of Handwritten Chinese Text by Segmentation: A Segment-annotation-free Approach

35. Marior: Margin Removal and Iterative Content Rectification for Document Dewarping in the Wild

36. Don't Forget Me: Accurate Background Recovery for Text Removal via Modeling Local-Global Context

37. Bi-VLDoc: Bidirectional Vision-Language Modeling for Visually-Rich Document Understanding

38. Look Closer to Supervise Better: One-Shot Font Generation via Component-Based Discriminator

39. SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Normalization

40. SwinTextSpotter: Scene Text Spotting via Better Synergy between Text Detection and Text Recognition

41. LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document Understanding

42. SLOGAN: Handwriting Style Synthesis for Arbitrary-Length and Out-of-Vocabulary Text

43. SPTS: Single-Point Text Spotting

46. SVC-onGoing: Signature Verification Competition

47. ICDAR 2021 Competition on Integrated Circuit Text Spotting and Aesthetic Assessment

48. MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction

49. Tag, Copy or Predict: A Unified Weakly-Supervised Learning Framework for Visual Information Extraction using Sequences

50. Implicit Feature Alignment: Learn to Convert Text Recognizer to Text Spotter

Catalog

Books, media, physical & digital resources