Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
seanpedrickcase
/
document_redaction
like
7
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
document_redaction
/
tools
1.12 MB
3 contributors
History:
233 commits
seanpedrickcase
Fixed Textract coordinate transformation issue. Made feedback buttons display optional in config.py
c1dca16
1 day ago
__init__.py
Safe
0 Bytes
Initial commit
over 1 year ago
auth.py
Safe
2.55 kB
Fixed on deprecated Github workflow functions. Applied linter and formatter to code throughout. Added tests for GUI load.
about 2 months ago
aws_functions.py
Safe
10 kB
Fixed minor bugs related to Textract API calls, pyproject format. Removed print statements and fixed some future concat deprecation issues
4 days ago
aws_textract.py
Safe
44.1 kB
Fixed minor bugs related to Textract API calls, pyproject format. Removed print statements and fixed some future concat deprecation issues
4 days ago
cli_usage_logger.py
Safe
11.4 kB
Revised environment variables for consistency.
18 days ago
config.py
Safe
48.1 kB
Fixed Textract coordinate transformation issue. Made feedback buttons display optional in config.py
1 day ago
custom_csvlogger.py
Safe
12.9 kB
Added regex search feature for multi-word text search
4 days ago
custom_image_analyser_engine.py
Safe
152 kB
Added regex functionality to deny lists. Corrected tesseract to word level parsing. Improved review search regex capabilities. Updated documentation
4 days ago
data_anonymise.py
Safe
52.8 kB
Revised environment variables for consistency.
18 days ago
file_conversion.py
Safe
131 kB
formatter and linter applied
5 days ago
file_redaction.py
Safe
243 kB
Fixed Textract coordinate transformation issue. Made feedback buttons display optional in config.py
1 day ago
find_duplicate_pages.py
Safe
72.6 kB
Added regex functionality to deny lists. Corrected tesseract to word level parsing. Improved review search regex capabilities. Updated documentation
4 days ago
find_duplicate_tabular.py
Safe
27.5 kB
Added possibility to specify allowed hosts. Fixed some tests to return more reliably. Fixed some issues related to file path checks not working correctly. Redaction should now return review files correctly at redaction and apply changes stages.
about 1 month ago
helper_functions.py
Safe
35 kB
Fixed Textract coordinate transformation issue. Made feedback buttons display optional in config.py
1 day ago
load_spacy_model_custom_recognisers.py
Safe
32.2 kB
Added regex functionality to deny lists. Corrected tesseract to word level parsing. Improved review search regex capabilities. Updated documentation
4 days ago
presidio_analyzer_custom.py
Safe
4.58 kB
Enabled export of both review pdfs and redacted pdfs from same redaction run. Added config variables for user guide url and showing redaction settings. Moved config variables around a bit. Minor GUI improvements
about 1 month ago
redaction_review.py
Safe
114 kB
Added regex search feature for multi-word text search
4 days ago
run_vlm.py
Safe
14.6 kB
Improved paddle and hybrid OCR analysis across all options. Tried to revise requirements for spaces
5 days ago
secure_path_utils.py
Safe
12.8 kB
Updated formatting and linting
19 days ago
secure_regex_utils.py
Safe
7.93 kB
Improved paddle and hybrid OCR analysis across all options. Tried to revise requirements for spaces
5 days ago
textract_batch_call.py
Safe
38.9 kB
Fixed minor bugs related to Textract API calls, pyproject format. Removed print statements and fixed some future concat deprecation issues
4 days ago
word_segmenter.py
Safe
48.4 kB
Allow for tesseract to run OCR in line-level mode and then query LLM with line-level data. Added option for running as MCP server, added api for multi-word text search
4 days ago