docling-project
diff --git a/‎.github/workflows/checks.yml
Lines changed: 3 additions & 0 deletions b/‎.github/workflows/checks.yml
Lines changed: 3 additions & 0 deletions
diff --git a/‎.pre-commit-config.yaml
Lines changed: 3 additions & 3 deletions b/‎.pre-commit-config.yaml
Lines changed: 3 additions & 3 deletions
diff --git a/‎README.md
Lines changed: 26 additions & 23 deletions b/‎README.md
Lines changed: 26 additions & 23 deletions
@@ -1,6 +1,9 @@
 on:
   workflow_call:
 
+env:
+  RUN_IN_CI: "1"
+
 jobs:
   run-checks:
     runs-on: ubuntu-latest
 
@@ -4,13 +4,13 @@ repos:
     hooks:
       - id: black
         name: Black
-        entry: poetry run black docling_eval docling_eval_next tests docs/examples
+        entry: poetry run black docling_eval tests docs/examples
         pass_filenames: false
         language: system
         files: '\.py$'
       - id: isort
         name: isort
-        entry: poetry run isort docling_eval docling_eval_next tests docs/examples
+        entry: poetry run isort docling_eval tests docs/examples
         pass_filenames: false
         language: system
         files: '\.py$'
@@ -22,7 +22,7 @@ repos:
 #        files: '\.py$'
       - id: mypy
         name: MyPy
-        entry: poetry run mypy docling_eval docling_eval_next tests docs/examples
+        entry: poetry run mypy docling_eval tests docs/examples
         pass_filenames: false
         language: system
         files: '\.py$'
 
@@ -1,3 +1,9 @@
+<p align="center">
+  <a href="https://github.com/docling-project/docling-eval">
+    <img loading="lazy" alt="Docling" src="docs/assets/docling-eval-pic.png" width="40%"/>
+  </a>
+</p>
+
 # Docling-eval
 
 
@@ -19,28 +25,23 @@ Evaluate [Docling](https://github.com/DS4SD/docling) on various datasets.
 
 Evaluate docling on various datasets. You can use the cli
 
-```sh
-docling-eval % poetry run evaluate --help
-
- Usage: python -m docling_eval.cli.main [OPTIONS]
-
-╭─ Options ─────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
-│ *  --task            -t                 [create|evaluate|visualize]                                                                              Evaluation task [default: None] [required]                                                               │
-│ *  --modality        -m                 [end-to-end|layout|table_structure|code_transcription|math_transcription|reading_order|markdown_text|ca  Evaluation modality [default: None] [required]                                                           │
-│                                         ptioning|bboxes_text]                                                                                                                                                                                             │
-│ *  --benchmark       -b                 [DPBench|OmniDocBench|WordScape|PubLayNet|DocLayNetV1|DocLayNetV2|FUNSD|Pub1M|PubTabNet|FinTabNet|WikiT  Benchmark name [default: None] [required]                                                                │
-│                                         abNet]                                                                                                                                                                                                            │
-│ *  --output-dir      -o                 PATH                                                                                                     Output directory [default: None] [required]                                                              │
-│    --input-dir       -i                 PATH                                                                                                     Input directory [default: None]                                                                          │
-│    --converter_type  -c                 [Docling|SmolDocling]                                                                                    Type of document converter [default: Docling]                                                            │
-│    --split           -s                 TEXT                                                                                                     Dataset split [default: test]                                                                            │
-│    --artifacts-path  -a                 PATH                                                                                                     Load artifacts from local path [default: None]                                                           │
-│    --begin_index     -bi                INTEGER                                                                                                  Begin converting from the given sample index (inclusive). Zero based. [default: 0]                       │
-│    --end_index       -ei                INTEGER                                                                                                  End converting to the given sample index (exclusive). Zero based. -1 indicates to take all               │
-│                                                                                                                                                  [default: 1000]                                                                                          │
-│    --debug                --no-debug                                                                                                             Enable debugging [default: no-debug]                                                                     │
-│    --help                                                                                                                                        Show this message and exit.                                                                              │
-╰───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
+```shell
+terminal %> poetry run docling_eval --help
+                                                                                                                                                                                                                                                
+ Usage: docling_eval [OPTIONS] COMMAND [ARGS]...                                                                                                                                                                                                
+                                                                                                                                                                                                                                                
+ Docling Evaluation CLI for benchmarking document processing tasks.                                                                                                                                                                             
+                                                                                                                                                                                                                                                
+╭─ Options ────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
+│ --help          Show this message and exit.                                                                                                                                                                                                  │
+╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
+╭─ Commands ───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
+│ create        Create both ground truth and evaluation datasets in one step.                                                                                                                                                                  │
+│ create-eval   Create evaluation dataset from existing ground truth.                                                                                                                                                                          │
+│ create-gt     Create ground truth dataset only.                                                                                                                                                                                              │
+│ evaluate      Evaluate predictions against ground truth.                                                                                                                                                                                     │
+│ visualize     Visualize evaluation results.                                                                                                                                                                                                  │
+╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
 
 
 ```
@@ -62,7 +63,9 @@ On our list for next benchmarks:
 - [OmniOCR](getomni-ai/ocr-benchmark)
 - Hyperscalers
 - [CoMix](https://github.com/emanuelevivoli/CoMix/tree/main/docs/datasets)
-
+- [DocVQA](https://huggingface.co/datasets/lmms-lab/DocVQA)
+- [rd-tablebench](https://huggingface.co/datasets/reducto/rd-tablebench)
+  
 ## Contributing
 
 Please read [Contributing to Docling](https://github.com/DS4SD/docling/blob/main/CONTRIBUTING.md) for details.