This repository provides a WPF-based document parser and template editor built on GroupDocs.Parser for .NET.
It allows you to visually define parsing templates for documents and then apply them to extract structured data.
The solution includes project(s):
- GroupDocs.Parser.GUI – Windows desktop GUI for creating and editing parsing templates.
-
Open & View Files – Load PDF or image documents into the viewer.
-
Set License – Apply a valid GroupDocs license.
-
Zoom Controls – Adjust document zoom level (
+,-,110%). -
Parse Methods
- Text – Create a visual template field for text extraction.
- Barcode – Create a visual template field for barcode extraction.
- OCR – Enable text recognition for scanned images.
- Parse Fields – Extract values using defined template fields.
- Parse Document – Run full parsing logic over the entire document.
-
Template Management
- Generate Template – Automatically generate template JSON based on selected fields.
- Visibility – Toggle template field overlays on the preview.
- Template Fields Panel – Manage and inspect detected fields (Invoice Number, Date, Total, Tax, Details, etc.).
-
Visual template editor for PDFs and scanned images.
-
Supports three field types:
- Field
- Table (🚧 not supported in Beta)
- Barcode
-
Save & load templates as XML definition files.
-
Apply templates to documents via console tool.
-
OCR support for scanned PDFs and TIFFs.
├── Distribution/ # Precompiled binaries
│ └── GroupDocs.Parser.GUI/
├── src/ # Source code
│ └── GUI/
├── Examples/ # Sample documents, templates & output files
Update config.json with your GroupDocs.Parser license:
{
"LicensePath": "D:\\Licenses\\GroupDocs.Parser.NET.lic"
}👉 If you don’t have a license, request a free temporary one here:
Get Temporary License
Run the GUI:
.\Distribution\GroupDocs.Parser.GUI\GroupDocs.Parser.GUI.batSteps:
- Open a document.
- (Optional) Click Generate Template to enable auto field adjustment.
- Add fields visually (Field / Barcode).
- Save the template as XML.
- Try parsing inside the GUI.
- Set License: Load GroupDocs license.
- Open File: Browse and open a document file.
- Zoom Controls: Adjust view scale.
- Parsing Controls:
Text: Add a text extraction field visually.Barcode: Add a barcode extraction field visually.OCR: Enable OCR for scanned documents.500: Set parsing resolution (DPI).Parse Fields: Extract based on template fields only.Parse Document: Parse entire document.Visibility: Show/hide template field rectangles.Generate Template: Export defined fields into a reusable template file.
- Displays the loaded file.
- Allows placing template fields by clicking Text or Barcode and drawing over regions.
- Extracted fields are highlighted with bounding boxes.
Clone the repository and build with .NET:
git clone https://github.com/groupdocs-parser/groupdocs-parser-gui.git
cd groupdocs-parser-gui/src
dotnet buildRun GUI:
dotnet run --project GUIRun Console Parser:
dotnet run --project DocumentParser-
Supported documents:
- PDFs with text
- Scanned PDFs & TIFF images (with OCR enabled)
-
Supported field types
- Text field
- Barcode field
-
Templates work per page (can be reused across pages with same structure).
- Automatic detection of scanned vs text-based docs (auto OCR toggle).
- Implement Table field support.
- Improve GUI usability & stability.
- Automated matching between documents and templates.
This project is open-source.
You are welcome to:
- Suggest new features.
- Submit pull requests.
- Extend the tool with your own dev capabilities.
This tool is provided for customer convenience under open-source terms.
For core parsing functionality, a GroupDocs.Parser for .NET license is required.
