Metadata-Version: 2.1
Name: django-html-xml-validator
Version: 1.0.0
Summary: Django middleware to validate HTML and XML responses
Home-page: https://github.com/itell-solutions/django_html_xml_validator
License: MIT
Keywords: django,html,validate,xml
Author: ITELL.SOLUTIONS GmbH
Requires-Python: >=3.7,<4.0
Classifier: Development Status :: 5 - Production/Stable
Classifier: Environment :: Web Environment
Classifier: Framework :: Django
Classifier: Framework :: Django :: 3
Classifier: Framework :: Django :: 4
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Natural Language :: English
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3 :: Only
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.7
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Topic :: Internet :: WWW/HTTP :: Dynamic Content
Classifier: Topic :: Software Development
Classifier: Topic :: Software Development :: Quality Assurance
Classifier: Topic :: Text Processing :: Markup :: HTML
Classifier: Topic :: Text Processing :: Markup :: XML
Requires-Dist: Django (>=3,<5)
Requires-Dist: lxml (>=4,<5)
Project-URL: Changes, https://github.com/itell-solutions/django_html_xml_validator/blob/main/CHANGES.md
Project-URL: Issue Tracker, https://github.com/itell-solutions/django_html_xml_validator/issues
Project-URL: Repository, https://github.com/itell-solutions/django_html_xml_validator.git
Description-Content-Type: text/markdown

[![PyPI](https://img.shields.io/pypi/v/django_html_xml_validator)](https://pypi.org/project/django_html_xml_validator/)
[![Python Versions](https://img.shields.io/pypi/pyversions/django_html_xml_validator.svg)](https://www.python.org/downloads/)
[![Build Status](https://github.com/itell-solutions/django_html_xml_validator/actions/workflows/build.yaml/badge.svg)](https://github.com/itell-solutions/django_html_xml_validator/actions/workflows/build.yaml)
[![Black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)
[![License](https://img.shields.io/github/license/itell-solutions/django_html_xml_validator)](https://opensource.org/licenses/MIT)

# Django HTML and XML Validator

`Django_html_xml_validator` is a Django middleware to validate HTML and XML
responses generated by your application. This includes but is not limited to
Django views using `render()` and Django HTML templates.

Features:

- **Specific error locations** in case validation finds issues.
- **Runs locally** without the need to upload your page to an external
  validation service.
- **Uses only Python packages** without the need to install external tools
  from other ecosystems.
- **Fast** because based on [lxml](https://lxml.de/) and its native components.

This makes it feasible to perform validation while running your test suite.

## Installation

To install, depending on your package manager, run:

```bash
pip install --update django_html_xml_validator
```

or

```bash
poetry add django_html_xml_validator
```

## Usage

To add validation to your project, add it to `settings.MIDDLEWARE`.

```python
MIDDLEWARE = [
    ...,
    "django_html_xml_validator.middleware.HtmlXmlValidatorMiddleware",
]
```

In most cases you only want it to validate the HTML generated by your views
directly, so it would be the last entry. Especially if you have other
middleware installed that modifies your HTML like adding the Django Debug
toolbar or minifying it.

For example:

```python
MIDDLEWARE = [
    # Possible middleware your project requires.
    ...,
     # Example middleware that modifies the HTML.
    "django_minify_html.middleware.MinifyHtmlMiddleware",
    "debug_toolbar.middleware.DebugToolbarMiddleware",
    ...,
    # Put validation middleware toward the end to ensure only your HTML/XML is validated.
    "django_html_xml_validator.middleware.HtmlXmlValidatorMiddleware",
]
```

After that, responses with a matching content type are validated:

- HTML:
  - application/xhtml+xml
  - text/html
- XML:
  - application/xml
  - text/xml

In case the response is valid, the middleware returns the original response
and HTTP status code verbatim.

In case errors have been found, the response includes an HTML page detailing
the errors with an HTTP status code of 500 (internal server error).

## Configuration

By default, validation is active when the Django `DEBUG` mode is enabled in
`settings.py`. In a reasonably configured project this means during local
development and while running the test suite, but not once deployed to a
server.

For more granular control, add the following to `settings.py`:

```python
VALIDATE_HTML = True
VALIDATE_XML = True
```

If you are sure all your HTML pages are actually XHTML (which sadly will not
be the case as soon as your code contains forms based on standard Django
forms), you can enforce HTML to be validated as XHTML:

```python
VALIDATE_HTML_AS_XHTML = True  # WARNING: Will fail with standard form templates
```

## Disabling validation for specific tests

In case validation is not useful for selected tests (for example when
processing deliberately huge documents), it can be disabled with the
[override_settings](https://docs.djangoproject.com/en/4.0/topics/testing/tools/#django.test.override_settings)
annotation. For example:

```python
from django.test import override_settings

@override_settings(VALIDATE_XML=False)
def test_can_build_huge_xml():
    ...
```

## Limitations

- Validation does not apply to stream responses.
- Validation of HTML5 uses a hack to ignore errors about invalid tags on
  sectioning elements like `<nav>` or `<article>`.
- Validation of XML only checks if the document is well-formed but does not
  validate against a schema or DTD. Technically lxml could do all this but
  would require more setup. If you need such a feature, feel free to submit a
  pull request.

## License

Copyright (c) 2022 ITELL.SOLUTIONS GmbH, Graz, Austria.

Distributed under the
[MIT license](https://en.wikipedia.org/wiki/MIT_License). For details refer to
the file `LICENSE`.

The source code is available from
<https://github.com/itell-solutions/django_html_xml_validator>.

