aboutsummaryrefslogtreecommitdiffstats
path: root/proposals/20201012_no_capture.md
diff options
context:
space:
mode:
Diffstat (limited to 'proposals/20201012_no_capture.md')
-rw-r--r--proposals/20201012_no_capture.md7
1 files changed, 5 insertions, 2 deletions
diff --git a/proposals/20201012_no_capture.md b/proposals/20201012_no_capture.md
index bb47ea2..7f6a1f5 100644
--- a/proposals/20201012_no_capture.md
+++ b/proposals/20201012_no_capture.md
@@ -1,5 +1,8 @@
-status: in-progress
+status: work-in-progress
+
+NOTE: as of December 2022, bnewbold can't remember if this was fully
+implemented or not.
Storing no-capture missing URLs in `terminal_url`
=================================================
@@ -29,7 +32,7 @@ The current status quo is to store the missing URL as the last element in the
pipeline that would read from the Kafka feed and extract them, but this would
be messy. Eg, re-ingesting would not update the old kafka messages, so we could
need some accounting of consumer group offsets after which missing URLs are
-truely missing.
+truly missing.
We could add a new `missing_url` database column and field to the JSON schema,
for this specific use case. This seems like unnecessary extra work.