aboutsummaryrefslogtreecommitdiffstats
BranchCommit messageAuthorAge
masterfilesets: handle weird figshare link-only case betterBryan Newbold5 months
bnewbold-argsmake hbase_table and zookeeper_hosts CLI argsBryan Newbold4 years
bnewbold-backfillmake hbase_table and zookeeper_hosts CLI argsBryan Newbold4 years
 
 
AgeCommit messageAuthorFilesLines
2021-12-16filesets: handle weird figshare link-only case betterHEADmasterBryan Newbold1-1/+4
2021-12-15lint ('not in')Bryan Newbold1-2/+2
2021-12-15lint: ignore unused 'sentry_client'Bryan Newbold1-1/+1
2021-12-15fix type with --enable-sentryBryan Newbold1-1/+1
2021-12-15ingest tool: allow enabling sentry (for exception debugging)Bryan Newbold1-0/+13
2021-12-15more fileset ingest tweaksBryan Newbold2-0/+7
2021-12-15fileset ingest: more requests timeouts, sessionsBryan Newbold3-37/+68
2021-12-15fileset ingest: create tmp subdirectories if neededBryan Newbold1-0/+5
2021-12-15fileset ingest: configure IA session from envBryan Newbold1-1/+6
2021-12-15pipenv: add pymupdf; update trafilaturaBryan Newbold2-420/+644
[...]
 
Clone
git@git.bnewbold.net:sandcrawler
https://git.bnewbold.net/sandcrawler
git://git.bnewbold.net/sandcrawler