Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
microsoft 's Collections
ChatBench
VibeVoice
MediPhi
Dayhoff Atlas
NatureLM
NextCoder
Phi-4
Phi-3
Phi-1
Controllable Safety Alignment
BitNet
MAI-DS-R1
LLM2CLIP
SpeechT5
TAPEX
Table Transformer
LayoutLM
Biomedical
Orca
UDOP
GIT
Florence
IFMs
MoCapAct

LayoutLM

updated May 1, 2025

The LayoutLM series are Transformer encoders useful for document AI tasks such as invoice parsing, document image classification and DocVQA.

Upvote
19

  • microsoft/layoutlmv3-base

    0.1B • Updated Apr 10, 2024 • 574k • 468

    Note Currently the best LayoutLM model.


  • microsoft/layoutlmv2-base-uncased

    Updated Sep 16, 2022 • 398k • 66

  • microsoft/layoutlm-base-uncased

    0.1B • Updated Apr 16, 2024 • 92.5k • 61

  • microsoft/layoutxlm-base

    Updated Sep 16, 2022 • 5.84k • 73

    Note A multilingual variant trained on 100 languages.


  • impira/layoutlm-document-qa

    Document Question Answering • 0.1B • Updated Mar 18, 2023 • 7.35k • 1.15k

    Note A LayoutLM (v1) model fine-tuned to perform question answering over documents (DocVQA).


  • nielsr/layoutlmv3-finetuned-funsd

    Token Classification • 0.1B • Updated Sep 16, 2023 • 1.79k • • 29

    Note A LayoutLMv3 model fine-tuned on the FUNSD dataset, a benchmark for document parsing.

Upvote
19
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs