index
:
sandcrawler
bnewbold-args
bnewbold-backfill
bnewbold-persist-grobid-errors
bnewbold-refactor-loggging
master
trawler
[no description]
about
summary
refs
log
tree
commit
diff
stats
log msg
author
committer
range
path:
root
/
python
/
sandcrawler
/
fileset_strategies.py
Commit message (
Expand
)
Author
Age
Files
Lines
*
filesets: fix archive.org path naming
Bryan Newbold
2022-03-29
1
-7
/
+8
*
bugfix: sha1/md5 typo
Bryan Newbold
2022-03-23
1
-1
/
+1
*
small lint/typo/fmt fixes
Bryan Newbold
2022-02-24
1
-1
/
+1
*
ingest: handle more fileset failure modes
Bryan Newbold
2022-02-18
1
-1
/
+5
*
fileset ingest: better verification of resources
Bryan Newbold
2022-01-13
1
-7
/
+23
*
null-body -> empty-blob
Bryan Newbold
2022-01-13
1
-0
/
+4
*
more fileset ingest tweaks
Bryan Newbold
2021-12-15
1
-0
/
+5
*
fileset ingest: more requests timeouts, sessions
Bryan Newbold
2021-12-15
1
-27
/
+49
*
fileset ingest: create tmp subdirectories if needed
Bryan Newbold
2021-12-15
1
-0
/
+5
*
fileset ingest: configure IA session from env
Bryan Newbold
2021-12-15
1
-1
/
+6
*
fileset ingest: actually use spn2 CLI flag
Bryan Newbold
2021-12-11
1
-1
/
+1
*
codespell typos in python (comments)
Bryan Newbold
2021-11-24
1
-1
/
+1
*
make fmt (black 21.9b0)
Bryan Newbold
2021-10-27
1
-71
/
+100
*
fileset: refactor out tables of helpers
Bryan Newbold
2021-10-27
1
-8
/
+0
*
start handling trivial lint cleanups: unused imports, 'is None', etc
Bryan Newbold
2021-10-26
1
-11
/
+5
*
make fmt
Bryan Newbold
2021-10-26
1
-27
/
+41
*
python: isort all imports
Bryan Newbold
2021-10-26
1
-6
/
+7
*
more small fileset ingest tweaks
Bryan Newbold
2021-10-26
1
-2
/
+7
*
more fileset iteration
Bryan Newbold
2021-10-15
1
-6
/
+20
*
move SPNv2 'simple_get' logic to SPN client
Bryan Newbold
2021-10-15
1
-23
/
+1
*
filesets: iteration of implementation and docs
Bryan Newbold
2021-10-15
1
-10
/
+13
*
fileset ingest: improve error handling
Bryan Newbold
2021-10-15
1
-4
/
+3
*
improvements to platform helpers
Bryan Newbold
2021-10-15
1
-1
/
+4
*
progress on web ingest strategy
Bryan Newbold
2021-10-15
1
-1
/
+102
*
fileset ingest progress for dataverse
Bryan Newbold
2021-10-15
1
-10
/
+135
*
progress on dataset ingest
Bryan Newbold
2021-10-15
1
-6
/
+51
*
progress on fileset/dataset ingest
Bryan Newbold
2021-10-15
1
-0
/
+22