SCUT-DLVCLab

All

25 repositories

MegaHan97K
Public
[PR 2025] The official GitHub page of "MegaHan97K: A Large-Scale Dataset for Mega-Category Chinese Character Recognition with over 97K Categories"
Python
•5•70•2•0•Updated Dec 22, 2025Dec 22, 2025
AutoHDR
Public
[ACL 2025 main] The official GitHub page of "Reviving Cultural Heritage: A Novel Approach for Comprehensive Historical Document Restoration"
Python
•4•51•2•0•Updated Dec 22, 2025Dec 22, 2025
HisDoc1B
Public
1•17•1•0•Updated Dec 18, 2025Dec 18, 2025
WenMind
Public
WenMind benchmark.
Python
•1•8•0•0•Updated Dec 17, 2025Dec 17, 2025
MCS-Bench
Public
Python
•1•3•0•0•Updated Dec 17, 2025Dec 17, 2025
ACP-RAG
Public
[NAACL 2025] Large-Scale Corpus Construction and Retrieval-Augmented Generation for Ancient Chinese Poetry: New Method and Data Insights (ACP-Corpus; ACP-QA; ACP-RAG)
Python
•0•5•0•0•Updated Dec 17, 2025Dec 17, 2025
OCR-Reasoning
Public
[arXiv: 2505.17163] OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning
Python
•
Apache License 2.0
•3•68•2•0•Updated Dec 17, 2025Dec 17, 2025
TongGu-VL
Public
A Multimodal large language model for Classical Chinese Studies
0•1•0•0•Updated Dec 16, 2025Dec 16, 2025
TVSIP
Public
[ACM MM 2025] The official GitHub page of "From Pixels to Semantics: A Novel MLLM-Driven Approach for Explainable Tampered Text Detection"
Python
•0•8•0•0•Updated Dec 10, 2025Dec 10, 2025
DocHighlight
Public
[PRCV 25] Towards Real-World Document Specular Highlight Removal: The DocHighlight Dataset and DocSHRNet Method
0•2•0•0•Updated Oct 8, 2025Oct 8, 2025
LongHisDoc
Public
A Comprehensive Benchmark for Chinese Long Historical Document Understanding
Python
•0•4•0•0•Updated Sep 23, 2025Sep 23, 2025
MCCD
Public
[ICDAR 2025] The official GitHub page of "MCCD: A Multi-Attribute Chinese Calligraphy Character Dataset Annotated with Script Styles, Dynasties, and Calligraphers"
Python
•0•11•2•0•Updated Sep 2, 2025Sep 2, 2025
DOLPHIN
Public
[IEEE TIFS 2024] Online Writer Retrieval with Chinese Handwritten Phrases: A Synergistic Temporal-Frequency Representation Learning Approach
Python
•
GNU General Public License v3.0
•1•55•1•0•Updated Aug 3, 2025Aug 3, 2025
PAVENet
Public
[IEEE TPAMI 2025] Privacy-Preserving Biometric Verification With Handwritten Random Digit String
Python
•
GNU General Public License v3.0
•0•65•1•0•Updated Aug 3, 2025Aug 3, 2025
SigBench
Public
GNU General Public License v3.0
•0•0•0•0•Updated Jun 19, 2025Jun 19, 2025
AutoScaler
Public
[PR 2026] The official GitHub page of "AutoScaler: Self Scale Alignment for Handwritten Mathematical Expression Recognition"
Python
•0•9•1•0•Updated Jun 8, 2025Jun 8, 2025
C3bench
Public
C3 benchmark
0•3•1•0•Updated Mar 30, 2025Mar 30, 2025
Document-AI-Recommendations
Public
Algorithms, papers, datasets, performance comparisons for Document AI. Continuously updating.
document-understanding table-structure-recognition key-information-extraction document-ai visual-information-extraction
9•203•0•0•Updated Mar 1, 2025Mar 1, 2025
DCOH-120K
Public
1•5•0•0•Updated Feb 20, 2025Feb 20, 2025
RFUND
Public
[MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking for End-to-end Document Pair Extraction"
ocr document-understanding key-information-extraction document-ai visual-information-extraction
0•20•0•0•Updated Dec 4, 2024Dec 4, 2024
TongGu-LLM
Public
[EMNLP 2024] TongGu, a classical Chinese language model.
4•53•6•0•Updated Sep 28, 2024Sep 28, 2024
.github
Public
0•0•0•0•Updated Jun 4, 2024Jun 4, 2024
SCUT-EnsExam
Public
SCUT-EnsExam is a real-world handwritten text erasure dataset for examination paper scenarios, which consists of 545 examination paper images. The dataset is randomly divided into training set and test set of 430 and 115 images, respectively.
0•14•0•0•Updated Dec 5, 2023Dec 5, 2023
GPT-4V_OCR
Public
Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)
Python
•4•125•0•0•Updated Nov 13, 2023Nov 13, 2023
Mnist-99.7-Accuracy-with-Pytorch
Public
A CNN model builds with Pytorch and reaches 99.7% accuracy
gui pyqt5 pytorch mnist
Python
•2•4•0•0•Updated May 1, 2021May 1, 2021