This package provides Python bindings for Upa URL – a library compliant with the WHATWG URL standard. This is the same standard followed by modern browsers and JavaScript runtimes such as Bun, Deno, and Node.js.
This package is designed to be as close to the URL standard as possible. It uses the same class names (URL, URLSearchParams), their function names, the same function parameters, and the same behavior.
pip install upa_url
If the binary wheel is not available for your platform, then you will need a C++ compiler that supports C++17 and CMake to build the Python package.
First you need import classes:
from upa_url import URL, URLSearchParams
The URL
class provides a structured way to parse, manipulate, and serialize URLs.
An URL can be parsed using one of two methods:
- Use
URL
constructor. It throws an exception on error:try: url = URL('https://upa-url.github.io/docs/') print(url.href) except Exception: print('URL parse error')
- Use
URL.parse
fucntion. It returnsNone
on error:url = URL.parse('docs', 'https://upa-url.github.io') if url is not None: print(url.href)
The parsed URL object components can be accessed using getters and setters: href
, origin
(only get value), protocol
, username
, password
, host
, hostname
, port
, pathname
, search
and hash
. Also you can get and change search parameters using searchParams
getter which returns URLSearchParams
object associated with URL:
url = URL.parse('https://example.org')
if url is not None:
url.searchParams.append('lang', 'lt')
print(url.href) # https://example.org/?lang=lt
To serialize parsed URL use url.href
or str(url)
.
If you need only check URL validity, then URL.canParse
function can be used:
if URL.canParse('docs', 'https://upa-url.github.io'):
print('URL is valid')
The URLSearchParams
class provides a structured way to parse, manipulate, and serialize the query string of a URL.
An URLSearchParams
object can be created by using constructor:
- To create empty:
params = URLSearchParams()
- Create from string:
params = URLSearchParams('lang=lt&id=123')
- Create from dictionary or list:
params1 = URLSearchParams({'lang': 'lt', 'id': '123'}) params2 = URLSearchParams([('lang', 'lt'), ['id', '123']])
Use get
or getAll
to retrieve parameter values:
params = URLSearchParams('a=b&a=c&b=10')
print(params.get('a')) # b
print(params.getAll('a')) # ['b', 'c']
To check for name and optionally value in parameters, use the has
function:
print(params.has('a')) # True
print(params.has('a', 'c')) # True
print(params.has('c')) # False
Iterate over all parameters:
params = URLSearchParams('a=1&b=2')
# Get all name - value pairs:
for name, value in params:
print(name, '=', value)
# Get all parameter names
for name in params.keys():
print(name)
# Get all parameter values
for value in params.values():
print(value)
Count parameters:
print(params.size) # 2
print(len(params)) # 2
To serialize URLSearchParams
object use str(params)
.
There are functions to manipulate search parameters:
- Add or replace parameters:
params = URLSearchParams('a=a') params.append('a', 'aa') params.append('b', 'bb') print(params) # a=a&a=aa&b=bb params.set('a', '1') print(params) # a=1&b=bb
- Remove parameters:
params = URLSearchParams('a=a&a=aa&b=b&b=bb') params.delete('a') print(params) # b=b&b=bb params.delete('b', 'bb') print(params) # b=b
- Sort parameters by name:
params = URLSearchParams('c=1&b=2&a=3') params.sort() print(params) # a=3&b=2&c=1
This package is licensed under the BSD 2-Clause License (see LICENSE
file).