My first article was about downloading movies from the page with a bit of parse and do everything in bash. This time the task gotta be more complicated. We need to download the movies from playlist which loaded by ajax.

First task is to find the check how the list with episodes appears on the page. I've checked in browser console ajax request and found PLAYLIST_ID in page source code.

<div class="playlists-ajaxed" data-playlist_id="666"></div>

So we should download page and get only the ID from it.

get_list_id() {
    echo $1 |
    wget -O- -i- --no-verbose | 
    hxnormalize -x | 
    sed -n 's/.*data-playlist_id="\([^"]\+\).*/\1/p'

I was really excited how this pretty function works!
Next we should make a request to custom url and get json in result.

get_json_list() {
    echo "$1" |
    wget -O- -i- --no-verbose | 
    jq -r .response | # get value by response key
    hxnormalize -x | # normalize html
    hxselect -i "li[data-id=\"0_0\"]" | # select videos only from first player
    sed 's/data-file/href/g' | #replacements to make hxwls work
    sed 's/<li /<a /g' |  #replacements to make hxwls work

Interesting thing, that we have "response" key in json response which contains html tags! Ah good old jquery days...
To get correct data from json I'm using "jq" program here. jq is a lightweight and flexible command-line JSON processor.
There are TWO playlists in html, so I selected li elements only for the first one. Great that they all have the same data-id attribute in li element. hxselect -i "li[data-id=\"0_0\"]".
To make the last command work - hxwls, which will parse HTML for the links I simply replace data-href attribute in li element to href and "<li" to "<a". Works perfect.
In result I have a variable with a list of urls, which can be processed like in previous script.

IFRAMES_LIST=$(get_json_list $PLAYLIST_ID)
if [ -z "$IFRAMES_LIST" ]; then
    echo "No iframes found. exit"

for iframe in $IFRAMES_LIST;
    VIDEO_URI=$(get_video_uri $iframe)
    FILENAME=$(get_filename_from_url $VIDEO_URI)
    ffmpeg -i $PLAYLIST -c copy -bsf:a aac_adtstoasc "$FILENAME.mp4" -y

Add new comment

The content of this field is kept private and will not be shown publicly.
  • No HTML tags allowed.
 oooooo     oooo  ooooo             o8o              .oooo.    oooo  
`888. .8' `888' `"' .dP""Y88b `888
`888. .8' 888 oooo oooo d8b ]8P' 888
`888. .8' 888 `888 `888""8P <88b. 888
`888.8' 888 888 888 `88b. 888
`888' 888 o 888 888 o. .88P 888
`8' o888ooooood8 888 d888b `8bd88P' o888o
.o. 88P
Enter the code depicted in ASCII art style.