Skip to content

Preserve percent-encoded octets in href paths#2173

Open
Sonai124 wants to merge 1 commit into
microsoft:mainfrom
Sonai124:fix/preserve-percent-encoded-hrefs
Open

Preserve percent-encoded octets in href paths#2173
Sonai124 wants to merge 1 commit into
microsoft:mainfrom
Sonai124:fix/preserve-percent-encoded-hrefs

Conversation

@Sonai124

Copy link
Copy Markdown

Summary

  • preserve syntactically valid %HH octets in href paths without decoding them as UTF-8
  • continue quoting raw Unicode, spaces, and literal or malformed percent signs
  • add HTML converter regression coverage for non-UTF-8 paths and URL escaping compatibility

Fixes #2171

Testing

  • test_html_converter.py: 6 passed on Python 3.10, 3.11, and 3.12
  • full local/CI-equivalent package suite: 307 passed, 34 skipped, 1 Windows-incompatible URI test deselected
  • repository pre-commit hook: passed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Preserve percent-encoded octets in hrefs to avoid UTF-8 replacement (EUC-JP / Shift-JIS)

1 participant