Wget es a https protocol

2009-04-04 01:12

Elhataroztam, hogy levadaszom magamnak az osszes eddigi leforditott Full Circle Magazin peldanyokat. Termeszetesen nem kezdjuk el egyesevel letoltogetni, hanem ittvan nekunk egy remek kis program, nevezetesen a wget. Probaljuk meg eloszor letolteni az elso szamot:

$ wget https://wiki.ubuntu.com/UbuntuMagazine/TranslateFullCircle/Hungarian?action=AttachFile&do=get&target=issue1_hu.pdf
[1] 28279
[2] 28280
$ --00:47:24-- https://wiki.ubuntu.com/UbuntuMagazine/TranslateFullCircle/Hungarian?action=AttachFile
  => `Hungarian?action=AttachFile'
Resolving wiki.ubuntu.com... 91.189.90.19
Connecting to wiki.ubuntu.com|91.189.90.19|:443... connected.
ERROR: Certificate verification error for wiki.ubuntu.com: unable to get local issuer certificate
To connect to wiki.ubuntu.com insecurely, use `--no-check-certificate'.
Unable to establish SSL connection.

Hm-hm, ez igy nem lesz jo, HTTPS protocol leven a wget nem tudja levarazsolni a cert-et. Probajuk ki, amit felajanl.

$ wget --no-check-certificate https://wiki.ubuntu.com/UbuntuMagazine/TranslateFullCircle/Hungarian?action=AttachFile&do=get&target=issue1_hu.pdf
[1] 28290
[2] 28291
$ --00:53:24-- https://wiki.ubuntu.com/UbuntuMagazine/TranslateFullCircle/Hungarian?action=AttachFile
  => `Hungarian?action=AttachFile'
Resolving wiki.ubuntu.com... 91.189.90.19
Connecting to wiki.ubuntu.com|91.189.90.19|:443... connected.
WARNING: Certificate verification error for wiki.ubuntu.com: unable to get local issuer certificate
HTTP request sent, awaiting response... 403 Forbidden
00:53:24 ERROR 403: Forbidden.

Mint lathatjuk, semmivel sem vagyunk elorebb :) (*)
Viszont ha mindenkeppen szuksegunk van cert-re, hat oldjuk meg magunk!

# apt-get install ca-certificates

Majd probaljuk ujra!

$ wget https://wiki.ubuntu.com/UbuntuMagazine/TranslateFullCircle/Hungarian?action=AttachFile&do=get&target=issue1_hu.pdf
[1] 31008
[2] 31009
$ --01:02:01-- https://wiki.ubuntu.com/UbuntuMagazine/TranslateFullCircle/Hungarian?action=AttachFile
  => `Hungarian?action=AttachFile'
Resolving wiki.ubuntu.com... 91.189.90.19
Connecting to wiki.ubuntu.com|91.189.90.19|:443... connected.
HTTP request sent, awaiting response... 403 Forbidden
01:02:01 ERROR 403: Forbidden.

Na, haladunk, mar nem parazik a certre, de meg mindig nem tetszik neki valami. Kozelebbrol megvizsgalva latjuk, hogy nem is azt csinalja, amit szeretnenk, hiszen az & jel utan levag mindent. Vedjuk le!

$ wget 'https://wiki.ubuntu.com/UbuntuMagazine/TranslateFullCircle/Hungarian?action=AttachFile&do=get&target=issue1_hu.pdf'
--01:11:28-- https://wiki.ubuntu.com/UbuntuMagazine/TranslateFullCircle/Hungarian?action=AttachFile&do=get&target=issue1_hu.pdf
  => `Hungarian?action=AttachFile&do=get&target=issue1_hu.pdf'
Resolving wiki.ubuntu.com... 91.189.90.19
Connecting to wiki.ubuntu.com|91.189.90.19|:443... connected.
HTTP request sent, awaiting response... 200 OK
Length: 8,401,201 (8.0M) [application/pdf]

100%[==================================================================>] 8,401,201 1.34M/s ETA 00:00

01:11:31 (1.40 MB/s) - `Hungarian?action=AttachFile&do=get&target=issue1_hu.pdf' saved [8401201/8401201]

(*) Igen-igen, ha a --no-check-certificate opcional levedjuk az url-t, az mukodik. Viszont
(a) hasznaljuk a trusted certet
(b) nem akartam leloni a poent

Es a script:

for ((i=1;i<23;i++)); do
  wget $(echo "https://wiki.ubuntu.com/UbuntuMagazine/TranslateFullCircle/Hungarian?action=AttachFile&do=get&target=issue${i}_hu.pdf") -O $(echo "issue_${i}.pdf")
done

--