From 799d997bccf3fefd750925b73eb171f19b96522d Mon Sep 17 00:00:00 2001 From: Trenton H <797416+stumpylog@users.noreply.github.com> Date: Wed, 2 Jul 2025 09:51:53 -0700 Subject: [PATCH] Updated v3 Ideas List (markdown) --- v3-Ideas-List.md | 7 +++++-- 1 file changed, 5 insertions(+), 2 deletions(-) diff --git a/v3-Ideas-List.md b/v3-Ideas-List.md index 1e4dbf7..a66b750 100644 --- a/v3-Ideas-List.md +++ b/v3-Ideas-List.md @@ -14,6 +14,7 @@ - A default system configuration, controlled by staff/superusers - A user specific settings set - The final settings used for OCR are then the combined set, with user, then default system settings + - Other parsers, like text, etc also define these levels. But it's separate entirely. - Allow workflows/matching to set certain settings: - Document filename matches regex, disable archive generation and disable de-skew - When a document starts consumption, settings go through the pipeline with it. ie set once, not read (from DB) again @@ -51,6 +52,7 @@ ### Simpler consumer - Use something like [watchfiles](https://github.com/samuelcolvin/watchfiles) for a simpler loop with only itself as a dependency +- See some ideas in https://github.com/paperless-ngx/paperless-ngx/tree/feature-simpler-consume-loop ### Transition to Alpine container @@ -60,14 +62,15 @@ ### Ditch celery for Huey - Celery is big and bulky, with support for memcached, sqs, etc, which we don't need -- Huey also has nice Django integrations -- Would need to use its signals to implement task tracking +- Huey also has nice Django integrations, like for database connections, which we kind of hacked into Celery +- Would need to use its signals to implement task tracking, but the Django celery integrate is pretty "meh" ## Improved Tasks - Show scheduled tasks with next execution - Simple task status - Include more task types +- Include ability to trigger scheduled tasks "now" ## External Services