Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
seanpedrickcase
/
document_redaction
like
7
Running
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
d5b5291
document_redaction
/
tools
1.1 MB
3 contributors
History:
229 commits
seanpedrickcase
Fixed minor bugs related to Textract API calls, pyproject format. Removed print statements and fixed some future concat deprecation issues
7bb945f
7 days ago
__init__.py
Safe
0 Bytes
Initial commit
over 1 year ago
auth.py
Safe
2.55 kB
Fixed on deprecated Github workflow functions. Applied linter and formatter to code throughout. Added tests for GUI load.
about 2 months ago
aws_functions.py
Safe
10 kB
Fixed minor bugs related to Textract API calls, pyproject format. Removed print statements and fixed some future concat deprecation issues
7 days ago
aws_textract.py
Safe
44.1 kB
Fixed minor bugs related to Textract API calls, pyproject format. Removed print statements and fixed some future concat deprecation issues
7 days ago
cli_usage_logger.py
Safe
11.4 kB
Revised environment variables for consistency.
21 days ago
config.py
Safe
47.6 kB
Added suffix to textract output files according to tasks included (e.g. signature analysis). Improved reporting when Textract client doesn't exist. Fixed display for cost and time taken. Changes to config variables to allow exclusion of PaddleOCR from display
8 days ago
custom_csvlogger.py
Safe
12.9 kB
Fixed minor bugs related to Textract API calls, pyproject format. Removed print statements and fixed some future concat deprecation issues
7 days ago
custom_image_analyser_engine.py
Safe
147 kB
Fixed minor bugs related to Textract API calls, pyproject format. Removed print statements and fixed some future concat deprecation issues
7 days ago
data_anonymise.py
Safe
52.8 kB
Revised environment variables for consistency.
21 days ago
file_conversion.py
Safe
131 kB
formatter and linter applied
8 days ago
file_redaction.py
Safe
241 kB
Fixed minor bugs related to Textract API calls, pyproject format. Removed print statements and fixed some future concat deprecation issues
7 days ago
find_duplicate_pages.py
Safe
63.6 kB
Formatter and linter check
22 days ago
find_duplicate_tabular.py
Safe
27.5 kB
Added possibility to specify allowed hosts. Fixed some tests to return more reliably. Fixed some issues related to file path checks not working correctly. Redaction should now return review files correctly at redaction and apply changes stages.
about 1 month ago
helper_functions.py
Safe
34.8 kB
formatter and linter applied
8 days ago
load_spacy_model_custom_recognisers.py
Safe
28.4 kB
Again revised spaCy language model load for different languages
18 days ago
presidio_analyzer_custom.py
Safe
4.58 kB
Enabled export of both review pdfs and redacted pdfs from same redaction run. Added config variables for user guide url and showing redaction settings. Moved config variables around a bit. Minor GUI improvements
about 2 months ago
redaction_review.py
Safe
113 kB
Fixed minor bugs related to Textract API calls, pyproject format. Removed print statements and fixed some future concat deprecation issues
7 days ago
run_vlm.py
Safe
14.6 kB
Improved paddle and hybrid OCR analysis across all options. Tried to revise requirements for spaces
8 days ago
secure_path_utils.py
Safe
12.8 kB
Updated formatting and linting
22 days ago
secure_regex_utils.py
Safe
7.93 kB
Improved paddle and hybrid OCR analysis across all options. Tried to revise requirements for spaces
8 days ago
textract_batch_call.py
Safe
38.9 kB
Fixed minor bugs related to Textract API calls, pyproject format. Removed print statements and fixed some future concat deprecation issues
7 days ago
word_segmenter.py
Safe
48.3 kB
Fixed minor bugs related to Textract API calls, pyproject format. Removed print statements and fixed some future concat deprecation issues
7 days ago