Commit Graph

18832 Commits (b5cb53beac04d5e9fca5d86d16430149acfd9766)
 

Author SHA1 Message Date
dirkf b5cb53beac
Merge 05aa2ad2f8 into a96a45b2cd 3 months ago
Zizheng Guo a96a45b2cd
[Vimeo] Improve `config` extraction (#32742)
* update for more robust json parsing
3 months ago
dirkf 05aa2ad2f8 [utils] `mode` might be `None` in `write_string()`
* see https://github.com/yt-dlp/yt-dlp/issues/8816
3 months ago
dirkf 80cb917ef6 [utils] Unescape HTML5 named character references (with no ;) 3 months ago
dirkf 059ef5b55e [utils] Handle ss:xxx in parse_duration(), based on yt-dlp 8bd1c00bf399ba7002fc21cd399c931e9d301bd8 3 months ago
dirkf 645d7a399c [utils] Fix/improve InAdvancePagedList, from yt-dlp d37707bda466282b66ce381cbfd734f899f9e06e 3 months ago
dirkf 0205fea833 [utils] Improve parse_count() with single regex, based on yt-dlp 352d5da81219e2675ef8cac9383ab0dfbd161a19 3 months ago
dirkf f5f1908275 [utils] Add parsing YYYYMMDD dates, also in Nov/Dec (yt-dlp PR #2094) 3 months ago
dirkf 1d9df28c7d [utils] Recognise FLAC audio in parse_codecs() 3 months ago
dirkf 77c778b838 [utils] Sort Chrome versions used for UAs; drop obsolete `Accept-Charset` header 3 months ago
dirkf 5988a39483 [utils] Simplify int_or_none(), based on yt-dlp 9e907ebddf9c9c3084fb1b24156c9e3786ab428e
Also swallow inf, nan
3 months ago
dirkf 7a438da3b1 [utils] Ensure a value from determine_protocol() 3 months ago
dirkf 045ff706ee [utils] Work-around for yt-dlp issue 1060 (skip bad certs from Windows, Py>=3.7) 3 months ago
dirkf eb93aaf702 [utils] Improve ExtractorError with msg IV and ie constructor param 3 months ago
df 58f15bb6c8 [utils] Don't find classname as part of class="... x-classname ...", etc
Eg, in [1], the class with name 'plist-info' was found when searching for 'info'.

1. https://github.com/ytdl-org/youtube-dl/issues/30230
3 months ago
df a3fe1d1dc6 [utils] Avoid scrubbing supplied cookie file on failed update 3 months ago
df 973f76cf7b [utils] Detect extension from any RFC Content-Disposition syntax
Add support for unquoted token and RFC 5987 extended parameter syntax
3 months ago
df 7990d1e630 [utils] Disambiguate 4-digit year and time-zone suffix
Restore check omitted from extract_timezone(); adjust DATE_FORMATS_DAY/MONTH_FIRST; add tests.
3 months ago
df d87e2ad040 [utils] Fix urlhandle_detect_ext() non-ASCII error in Py2, with test 3 months ago
df 06d489c522 [utils] Small fixes to utils and compat and test 3 months ago
hatsomatt 820fae3b3a [Videa] Fix extraction
* update API URL
* from https://github.com/yt-dlp/yt-dlp/pull/8003
* thanks to the authors!

Closes yt-dlp/7427
Authored by: hatsomatt, aky-01
3 months ago
dirkf aef24d97e9 [Videa] Align with yt-dlp 3 months ago
dirkf f7b30e3f73 [XFileShare] Update extractor for 2024
* simplify aa_decode()
* review and update supported sites and tests
* in above, include FileMoon.sx, and remove separate module
* incorporate changes from yt-dlp
* allow for decoding multiple scripts (eg, FileMoon)
* use new JWPlayer extraction
3 months ago
dirkf f66372403f [InfoExtractor] Rework and improve JWPlayer extraction
* use traverse_obj() and _search_json()
* support playlist `.load({**video1},{**video2}, ...)`
* support transform_source=... for _extract_jwplayer_data()
3 months ago
dirkf 7216fa2ac4 [InfoExtractor] Add `_search_json()`
* uses the error diagnostic to truncate the JSON string
* may be confused by non-C-Pythons
3 months ago
dirkf acc383b9e3 [utils] Let int_or_none() accept a base, like int() 3 months ago
Hubert Hirtz f0812d7848
[utils] Handle user:pass in URLs (#28801)
* Handle user:pass in URLs

Fixes "nonnumeric port" errors when youtube-dl is given URLs with
usernames and passwords such as:

    http://username:password@example.com/myvideo.mp4

Refs:
- https://en.wikipedia.org/wiki/Basic_access_authentication
- https://tools.ietf.org/html/rfc1738#section-3.1
- https://docs.python.org/3.8/library/urllib.parse.html#urllib.parse.urlsplit

Fixes #18276 (point 4)
Fixes #20258
Fixes #26211 (see comment)

* Align code with yt-dlp

---------

Co-authored-by: dirkf <fieldhouse@gmx.net>
3 months ago
Aaron Tan 40bd5c1815
[caffeine.tv] Add new extractor (#32514)
* Add CaffeineTVIE info extractor to support site caffeine.tv

---------

Co-authored-by: dirkf <fieldhouse@gmx.net>
3 months ago
dirkf 70f230f9cf
[GBNews]Add new extractor for GB News TV channel (#29432)
* Add extractor for GB News TV channel

* Support more GBNews URL formats
Allow alphanumeric and _ in place of `shows`, which redirect to site's preferred URL

* Update for 2024
3 months ago
dirkf 48ddab1f3a
[downloader/external] Fix WgetFD proxy (rev 2)
From PR (defunct source), closes #29343.
Matches https://github.com/yt-dlp/yt-dlp/pull/3152
Thx former user kikuyan.
3 months ago
dirkf 7687389f08 [Vbox7] Improve extraction, adding features from yt-dlp PR #9100
* changes from https://github.com/yt-dlp/yt-dlp/pull/9100 (thx
seproDev):
  - attempt HLS extraction
  - re-enable XFF
  - test `view_count`, `duration` extraction
* improve commenting, error checks
3 months ago
dirkf 4416f82c80 [Vbox7IE] Sanitise ld+json containing unexpected characters
* based on PR #29680
* added hack to force invoking `transform_source`
* fixes #26218
4 months ago
dirkf bdda6b81df [Vbox7IE] Improve extraction
* DASH extraction no longer fails with new range support
* but always find combined formats if available
* suppress ineffective XFF geo-bypass (causes time-outs)
* adapted from https://github.com/ytdl-org/youtube-dl/pull/29680
* thx former GH user kikuyan
4 months ago
dirkf 1fd8f802b8 [InfoExtractor] Correctly resolve BaseURL in DASH manifest
Specs:
* ISO/IEC 23009-1:2012 section 5.6
* RFC 3986 section 5.
4 months ago
dirkf 4eaeb9b2c6 [InfoExtractor] Support byte range for DASH
* adapted from https://github.com/ytdl-org/youtube-dl/pull/30279
* thx former GH user kikuyan
4 months ago
dirkf bec9180e89 [downloader/dash] Support `range` in fragment (format f'{start}-{end}')
* adapted from https://github.com/ytdl-org/youtube-dl/pull/30279
 * thx former GH user kikuyan
4 months ago
dirkf c58b655a9e [InfoExtractor] Support DASH subtitle extraction (yt-dlp back-port) 4 months ago
dirkf dc512e3a8a [YouTube] Fix `like_count` extraction using `likeButtonViewModel`
* also fix various tests
* TODO: check against yt-dlp tests
4 months ago
dirkf f8b0135850 [YouTube] Rework n-sig processing, realigning with yt-dlp
* apply n-sig before chunked fragments, fixes #32692
4 months ago
dirkf 640d39f03a [InfoExtractor] Support some warning and `._downloader` shortcut methods from yt-dlp 4 months ago
dirkf 6651871416 [compat] Rework compat for `method` parameter of `compat_urllib_request.Request` constructor
* fixes #32573
* does not break `utils.HEADrequest` (eg)
4 months ago
mk-pmb be008e657d [core] Fix format string injection for metadata JSON filename message. 6 months ago
Robotix b1bbc1e502
[Epidemic Sound] Add new extractor (#32628)
* Add simple extractor
* Support separate tracks
* Use index as id instead of slug

---------

Co-authored-by: dirkf <fieldhouse@gmx.net>
6 months ago
dirkf 55a442adae
[Imgur] Overhaul extractor module (#32612)
Revise extractors for new API and page formats
6 months ago
mimvahedi c62936a5f2
[telewebion] Fix extraction (#32634)
* [telewebion] fix extraction

Resolves https://github.com/ytdl-org/youtube-dl/issues/5135#issuecomment-932952119

---------

Co-authored-by: dirkf <fieldhouse@gmx.net>
6 months ago
dirkf 427472351c [utils] Make restricted filenames ignore characters in Unicode categories Mark, Other
Resolves #32629
6 months ago
dirkf c6538ed323 [workflows/ci.yml] Use setup-python for now released Python 3.12 6 months ago
dirkf 8d227cb97b [workflows/ci.yml] Actually use default values for push and pull_request 6 months ago
dirkf 4e115e18cb [workflows/ci.yml] Run apt-get update before installing 6 months ago
ReenigneArcher b7fca0fab3 [Youtube] Update consent cookie handling to match site
Apologies for force push!
[skip ci]
6 months ago