From d6457355b5241d32333718ba7aca316976695019 Mon Sep 17 00:00:00 2001 From: Bryan Newbold Date: Wed, 24 Apr 2019 02:13:00 +0000 Subject: small doc/TODO notes --- arabesque.py | 7 ++++--- 1 file changed, 4 insertions(+), 3 deletions(-) (limited to 'arabesque.py') diff --git a/arabesque.py b/arabesque.py index 8dbc0ca..55e6223 100755 --- a/arabesque.py +++ b/arabesque.py @@ -11,8 +11,8 @@ Commands/modes: - backward - forward - everything -- postprocess -- dump_json +- postprocess +- dump_json Design docs in DESIGN.md @@ -21,8 +21,9 @@ Software under the GPLv3 license (a copy of which should be included with this file). TODO: +- pass SHA-1 and timestamp in forward mode (?) +- include final_size (if possible from crawl log) - open map in read-only when appropriate -- some kind of stats dump command? (querying sqlite) - should referrer map be UNIQ? - forward outputs get generated multiple times? - try: https://pypi.org/project/urlcanon/ -- cgit v1.2.3