I have an IMDB sources script to scrape movie metadata for my library. A few months ago it appears the structure of the returned HTML changed significantly. I am now working to fix the script. I have noticed that the HTML in the debug output is significantly different from the HTML source I see directly in my browser (i.e. Chrome), and the problem is there is a lot of data I want to scrape that is only available in the HTML source Chrome receives. I presume IMDB is detecting the browser type and returning different HTML to the web sources framework. Does anyone have a suggestion on how to work around this so I can receive the richer HTML the browser receives?
I've tried it with a simple test script and got the very same keywords via the Web Source Framework as you're getting when using Chrome.
# ###################################################################
# Mp3tag Tag Source Test
# ###################################################################
[Name]=imdb.com
[BasedOn]=www.imdb.com
[PreviewUrl]=https://www.imdb.com/title/tt1856101/?ref_=nv_sr_1
[AlbumUrl]=https://www.imdb.com/title/tt1856101/?ref_=nv_sr_1
[WordSeparator]=%20
[SearchBy]=%title%
[Encoding]=url-utf-8
[ParserScriptIndex]=...
#
[ParserScriptAlbum]=...
# ###################################################################
# A L B U M
# ###################################################################
debugwriteinput "C:\Users\florian\Desktop\debug.out"
debug "on" "C:\Users\florian\Desktop\debug.txt"
Thanks Florian. That was the key I needed. The script I started with would use the IMDB "references" view for the AlbumUrl, which dramatically changed at the end of last year. This explains why it is different from what I saw in Chrome. The reason the data, such as keywords, is different is due to the data in the standard and references view being different (i.e. plot keywords).