Mode | Name | Size | |
---|---|---|---|
-rw-r--r-- | .coveragerc | 32 | logstatsplain |
-rw-r--r-- | .gitignore | 29 | logstatsplain |
-rw-r--r-- | .pylintrc | 409 | logstatsplain |
-rw-r--r-- | Pipfile | 651 | logstatsplain |
-rw-r--r-- | Pipfile.lock | 62666 | logstatsplain |
-rw-r--r-- | TODO | 52 | logstatsplain |
-rw-r--r-- | common.py | 2618 | logstatsplain |
-rwxr-xr-x | grobid2json.py | 5595 | logstatsplain |
-rwxr-xr-x | ia_pdf_match.py | 2889 | logstatsplain |
-rwxr-xr-x | ingest_file.py | 10487 | logstatsplain |
-rwxr-xr-x | kafka_grobid.py | 13599 | logstatsplain |
-rw-r--r-- | pytest.ini | 171 | logstatsplain |
d--------- | sandcrawler | 111 | logstatsplain |
d--------- | scripts | 460 | logstatsplain |
d--------- | tests | 161 | logstatsplain |
l--------- | title_slug_blacklist.txt -> ../scalding/src/main/resources/slug-denylist.txt | 48 | logstatsplain |