Image-Text-to-Text
Transformers
ONNX
Safetensors
English
idefics3
conversational
asnassar commited on
Commit
848bd04
·
verified ·
1 Parent(s): ee67c58

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -5
README.md CHANGED
@@ -24,7 +24,7 @@ SmolDocling is a multimodal Image-Text-to-Text model designed for efficient docu
24
  - 📝 **Caption Correspondence** – Links captions to relevant images and figures.
25
  - 📜 **List Grouping** – Organizes and structures list elements correctly.
26
  - 📄 **Full-Page Conversion** – Processes entire pages for comprehensive document transformation.
27
- - 📂 **General Document Processing** – Optimized for non-scientific documents.
28
  - 🔄 **Seamless Docling Integration** – Import into **Docling** and export in multiple formats.
29
  - 📚 **Multi-Page & Full Document Conversion** – *Coming soon!* 🚧
30
 
@@ -33,7 +33,7 @@ SmolDocling is a multimodal Image-Text-to-Text model designed for efficient docu
33
  **Demo [optional]:** [More Information Needed]
34
 
35
 
36
- ## Model Summary
37
 
38
  - **Developed by:** Docling Team
39
  - **Model type:** Multi-modal model (image+text)
@@ -42,12 +42,11 @@ SmolDocling is a multimodal Image-Text-to-Text model designed for efficient docu
42
  - **Finetuned from model:** Based on [Idefics3](https://huggingface.co/HuggingFaceM4/Idefics3-8B-Llama3) (see technical summary)
43
 
44
 
45
- ### How to get started
46
 
47
  You can use transformers or docling to perform inference:
48
 
49
- # Transformers:
50
-
51
 
52
  ```python
53
  import torch
 
24
  - 📝 **Caption Correspondence** – Links captions to relevant images and figures.
25
  - 📜 **List Grouping** – Organizes and structures list elements correctly.
26
  - 📄 **Full-Page Conversion** – Processes entire pages for comprehensive document transformation.
27
+ - 📂 **General Document Processing** – Trained for non-scientific documents and scientific.
28
  - 🔄 **Seamless Docling Integration** – Import into **Docling** and export in multiple formats.
29
  - 📚 **Multi-Page & Full Document Conversion** – *Coming soon!* 🚧
30
 
 
33
  **Demo [optional]:** [More Information Needed]
34
 
35
 
36
+ #### Model Summary
37
 
38
  - **Developed by:** Docling Team
39
  - **Model type:** Multi-modal model (image+text)
 
42
  - **Finetuned from model:** Based on [Idefics3](https://huggingface.co/HuggingFaceM4/Idefics3-8B-Llama3) (see technical summary)
43
 
44
 
45
+ ## How to get started
46
 
47
  You can use transformers or docling to perform inference:
48
 
49
+ #### Transformers:
 
50
 
51
  ```python
52
  import torch