wget은 HTTP나 FTP 등의 프로토콜을 통해 웹페이지나 파일을 다운받는 유틸리티이다.
아무튼 youtube에서 파일로 다운로드 받기위해 주소를 변환해서 시도를 해도
403 Forbidden 에러만 날뿐 다운로드 되지 않는다.
아무튼 쿠키를 저장하고, 이를 다시 불러들여 다운로드를 시도하면 제대로 받아진다.
쿠키를 이용해서 세션이 달라지면, 이전의 내용은 무효화 되서 그런것으로 생각된다.
- ‘--no-cookies’
- Disable
the use of cookies. Cookies are a mechanism for maintaining
server-side state. The server sends the client a cookie using the
Set-Cookie header, and the client responds with the same
cookie
upon further requests. Since cookies allow the server owners to keep
track of visitors and for sites to exchange this information, some
consider them a breach of privacy. The default is to use cookies;
however, storing cookies is not on by default.
- ‘--load-cookies file’
- Load
cookies from file before the first HTTP retrieval.
file is a textual file in the format originally used by
Netscape's
cookies.txt file.
You will typically use this option when mirroring sites that
require
that you be logged in to access some or all of their content. The login
process typically works by the web server issuing an http
cookie
upon receiving and verifying your credentials. The cookie is then
resent by the browser when accessing that part of the site, and so
proves your identity.
Mirroring such a site requires Wget to send the same cookies
your
browser sends when communicating with the site. This is achieved by
‘--load-cookies’—simply point
Wget to the location of the
cookies.txt file, and it will
send the same cookies your browser
would send in the same situation. Different browsers keep textual
cookie files in different locations:
- Netscape 4.x.
- The cookies are in ~/.netscape/cookies.txt.
- Mozilla and Netscape 6.x.
- Mozilla's
cookie file is also named cookies.txt,
located
somewhere under ~/.mozilla, in
the directory of your profile.
The full path usually ends up looking somewhat like
~/.mozilla/default/some-weird-string/cookies.txt.
- Internet Explorer.
- You can produce a
cookie file Wget can use by using the File menu,
Import and Export, Export Cookies. This has been tested with Internet
Explorer 5; it is not guaranteed to work with earlier versions.
- Other browsers.
- If you are using a
different browser to create your cookies,
‘--load-cookies’ will only work
if you can locate or produce a
cookie file in the Netscape format that Wget expects.
If you cannot use ‘--load-cookies’,
there might still be an
alternative. If your browser supports a “cookie manager”, you can use
it to view the cookies used when accessing the site you're mirroring.
Write down the name and value of the cookie, and manually instruct Wget
to send those cookies, bypassing the “official” cookie support:
wget --no-cookies --header "Cookie: name=value"
- ‘--save-cookies file’
- Save
cookies to file before exiting. This will not save cookies
that have expired or that have no expiry time (so-called “session
cookies”), but also see ‘--keep-session-cookies’.
- ‘--keep-session-cookies’
- When
specified, causes ‘--save-cookies’
to also save session
cookies. Session cookies are normally not saved because they are
meant to be kept in memory and forgotten when you exit the browser.
Saving them is useful on sites that require you to log in or to visit
the home page before you can access some pages. With this option,
multiple Wget runs are considered a single browser session as far as
the site is concerned.
Since the cookie file format does not normally carry session
cookies,
Wget marks them with an expiry timestamp of 0. Wget's
‘--load-cookies’ recognizes those
as session cookies, but it might
confuse other browsers. Also note that cookies so loaded will be
treated as other session cookies, which means that if you want
‘--save-cookies’ to preserve them
again, you must use
‘--keep-session-cookies’ again.
[링크 : http://www.gnu.org/software/wget/manual/html_node/HTTP-Options.html]
|
--save-cookies 로 저장한 youtube 쿠키이다. 음.. 무슨 의미지 -ㅁ-?
$ cat yt.cookie
# HTTP cookie file.
# Generated by Wget on 2010-04-10 11:57:59.
# Edit at your own risk.
.youtube.com TRUE / FALSE 1586228278 PREF f1=50000000&f2=8000000
.youtube.com TRUE / FALSE 1291604278 VISITOR_INFO1_LIVE FNfBrJzTQY
|
$ wget "http://www.youtube.com/watch?v=mdljV2uEs1A" --save-cookies yt.cookie
$ wget --load-cookies=yt.cookie "http://v22.lscache2.c.youtube.com/videoplayback?ip=211.0.0.0&sparams=id%2Cexpire%2Cip%2Cipbits%2Ci tag%2Calgorithm%2Cburst%2Cfactor&fexp=904405%2C900037&algorithm=throttle-factor&itag=35&ipbits=8&burst=40&sver=3&expire=1270890000&key=yt1&signature=5C611E956FB97E74D3435F8815A7A2376E3C61D4.C2C593CDDE0C15671462BB13C5404EC6927F7F7D&factor=1.25&id=99d963576b84b350" -O file.mp4
--2010-04-10 12:33:26-- http://v22.lscache2.c.youtube.com/videoplayback?ip=211.0.0.0&sparams=id%2Cexpire%2Cip%2Cipbits%2Citag%2Calgorithm%2Cburst%2Cfactor&fexp=904405%2C900037&algorithm=throttle-factor&itag=35&ipbits=8&burst=40&sver=3&expire=1270890000&key=yt1&signature=5C611E956FB97E74D3435F8815A7A2376E3C61D4.C2C593CDDE0C15671462BB13C5404EC6927F7F7D&factor=1.25&id=99d963576b84b350
Resolving v22.lscache2.c.youtube.com... 74.125.167.33
접속 v22.lscache2.c.youtube.com|74.125.167.33|:80... 접속됨.
HTTP request sent, awaiting response... 200 OK
Length: 15708973 (15M) [video/x-flv]
Saving to: `file.mp4'
100%[==========================================================================>] 15,708,973 105K/s in 2m 0s
2010-04-10 12:35:27 (128 KB/s) - `file.mp4' saved [15708973/15708973]
|
URL이 너무 길어서 별도의 파일이름을 지정해주지 않으면
Cannot write to `videoplayback?ip=211.0.0.0&sparams=id,expire,ip,ipbits,itag,algorithm,burst,factor&fexp=904405,900037&algorithm=throttle-factor&itag=35&ipbits=8&burst=40&sver=3&expire=1270890000&key=yt1&signature=5C611E956FB97E74D3435F8815A7A2376E3C61D4.C2C593CDDE0C15671462BB13C5404EC6927F7F7D&factor=1.25&id=99d963576b84b350' (File name too long).
|
주소가 255자를 넘어서는 관계로, 파일 이름으로 하기에는 너무 길다고 에러가 발생한다.
반드시
-O filename 으로 별도의 이름을 지정해 주어야 한다.
2010/04/09 - youtube 동영상 페이지 fmt_map, fmt_url_map, fmt_list, fmt_stream_map
[링크 :
http://kldp.org/node/75150]