LayoutLM - a microsoft Collection

microsoft 's Collections

Controllable Safety Alignment

Table Transformer

LayoutLM

updated May 1, 2025

The LayoutLM series are Transformer encoders useful for document AI tasks such as invoice parsing, document image classification and DocVQA.

microsoft/layoutlmv3-base

0.1B • Updated Apr 10, 2024 • 839k • 487

Note Currently the best LayoutLM model.
microsoft/layoutlmv2-base-uncased

Updated Sep 16, 2022 • 638k • 68
microsoft/layoutlm-base-uncased

0.1B • Updated Apr 16, 2024 • 194k • 62
microsoft/layoutxlm-base

Updated Sep 16, 2022 • 10.6k • 74

Note A multilingual variant trained on 100 languages.
impira/layoutlm-document-qa

Document Question Answering • 0.1B • Updated Mar 18, 2023 • 46.7k • 1.17k

Note A LayoutLM (v1) model fine-tuned to perform question answering over documents (DocVQA).
nielsr/layoutlmv3-finetuned-funsd

Token Classification • 0.1B • Updated Sep 16, 2023 • 5.73k • 32

Note A LayoutLMv3 model fine-tuned on the FUNSD dataset, a benchmark for document parsing.