diff options
| author | Martin Czygan <martin.czygan@gmail.com> | 2021-10-15 21:56:52 +0200 | 
|---|---|---|
| committer | Martin Czygan <martin.czygan@gmail.com> | 2021-10-15 22:04:04 +0200 | 
| commit | 1b4136d415cbe549c735d459cd0ab1345bb37011 (patch) | |
| tree | e2630189ca57d5b54c4ec21fdc5008e553aca5e6 /python | |
| parent | 4be667616ae209fa0efaaa2350c1b75eacf0e344 (diff) | |
| download | fatcat-1b4136d415cbe549c735d459cd0ab1345bb37011.tar.gz fatcat-1b4136d415cbe549c735d459cd0ab1345bb37011.zip | |
pubmed: switch default http site to retrieve update files
Proxy started to throw: "dial tcp: lookup ftp.ncbi.nlm.nih.gov on
[::1]:53: read udp [::1]:45178->[::1]:53: read: connection refused"
NIH has a http version on it's own, try to use that.
Diffstat (limited to 'python')
| -rw-r--r-- | python/fatcat_tools/harvest/pubmed.py | 6 | 
1 files changed, 4 insertions, 2 deletions
| diff --git a/python/fatcat_tools/harvest/pubmed.py b/python/fatcat_tools/harvest/pubmed.py index 579dd505..92798a99 100644 --- a/python/fatcat_tools/harvest/pubmed.py +++ b/python/fatcat_tools/harvest/pubmed.py @@ -263,10 +263,12 @@ def ftpretr(url, max_retries=10, retry_delay=1, proxy_hostport=None):              return f.name -def ftpretr_via_http_proxy(url, proxy_hostport="159.69.240.245:15201", max_retries=10, retry_delay=1): +def ftpretr_via_http_proxy(url, proxy_hostport="ftp.ncbi.nlm.nih.gov", max_retries=10, retry_delay=1):      """      Fetch file from FTP via external HTTP proxy, e.g. ftp.host.com:/a/b/c would -    be retrievable via proxy.com/a/b/c. +    be retrievable via proxy.com/a/b/c; (in 09/2021 we used +    "159.69.240.245:15201" as proxy_hostport but that started to fail +    2021-10-15; just switch to NIH's http version).      """      parsed = urlparse(url)      server, path = parsed.netloc, parsed.path | 
