xmltodict

Makes working with XML feel like you are working with JSON

These details have not been verified by PyPI

Project links

Homepage

Project description

# xmltodict

`xmltodict` is a Python module that makes working with XML feel like you are working with [JSON](http://docs.python.org/library/json.html), as in this ["spec"](http://www.xml.com/pub/a/2006/05/31/converting-between-xml-and-json.html):

[![Build Status](https://secure.travis-ci.org/martinblech/xmltodict.png)](http://travis-ci.org/martinblech/xmltodict)

```python
>>> print(json.dumps(xmltodict.parse("""
... <mydocument has="an attribute">
... <and>
... <many>elements</many>
... <many>more elements</many>
... </and>
... <plus a="complex">
... element as well
... </plus>
... </mydocument>
... """), indent=4))
{
"mydocument": {
"@has": "an attribute",
"and": {
"many": [
"elements",
"more elements"
]
},
"plus": {
"@a": "complex",
"#text": "element as well"
}
}
}
```

## Namespace support

By default, `xmltodict` does no XML namespace processing (it just treats namespace declarations as regular node attributes), but passing `process_namespaces=True` will make it expand namespaces for you:

```python
>>> xml = """
... <root xmlns="http://defaultns.com/"
... xmlns:a="http://a.com/"
... xmlns:b="http://b.com/">
... <x>1</x>
... <a:y>2</a:y>
... <b:z>3</b:z>
... </root>
... """
>>> xmltodict.parse(xml, process_namespaces=True) == {
... 'http://defaultns.com/:root': {
... 'http://defaultns.com/:x': '1',
... 'http://a.com/:y': '2',
... 'http://b.com/:z': '3',
... }
... }
True
```

It also lets you collapse certain namespaces to shorthand prefixes, or skip them altogether:

```python
>>> namespaces = {
... 'http://defaultns.com/': None, # skip this namespace
... 'http://a.com/': 'ns_a', # collapse "http://a.com/" -> "ns_a"
... }
>>> xmltodict.parse(xml, process_namespaces=True, namespaces=namespaces) == {
... 'root': {
... 'x': '1',
... 'ns_a:y': '2',
... 'http://b.com/:z': '3',
... },
... }
True
```

## Streaming mode

`xmltodict` is very fast ([Expat](http://docs.python.org/library/pyexpat.html)-based) and has a streaming mode with a small memory footprint, suitable for big XML dumps like [Discogs](http://discogs.com/data/) or [Wikipedia](http://dumps.wikimedia.org/):

```python
>>> def handle_artist(_, artist):
... print artist['name']
... return True
>>>
>>> xmltodict.parse(GzipFile('discogs_artists.xml.gz'),
... item_depth=2, item_callback=handle_artist)
A Perfect Circle
Fantômas
King Crimson
Chris Potter
...
```

It can also be used from the command line to pipe objects to a script like this:

```python
import sys, marshal
while True:
_, article = marshal.load(sys.stdin)
print article['title']
```

```sh

c a t e n w i k i - p a g e s - a r t i c l e s . x m l . b z 2 | b u n z i p 2 | x m l t o d i c t . p y 2 | m y s c r i p t . p y A c c e s s i b l e C o m p u t i n g A n a r c h i s m A f g h a n i s t a n H i s t o r y A f g h a n i s t a n G e o g r a p h y A f g h a n i s t a n P e o p l e A f g h a n i s t a n C o m m u n i c a t i o n s A u t i s m . . . ‘ ‘ ‘ O r j u s t c a c h e t h e d i c t s s o y o u d o n^{'} t h a v e t o p a r s e t h a t b i g X M L f i l e a g a i n . Y o u d o t h i s o n l y o n c e : ‘ ‘ ‘ s h

cat enwiki-pages-articles.xml.bz2 | bunzip2 | xmltodict.py 2 | gzip > enwiki.dicts.gz
```

And you reuse the dicts with every script that needs them:

```sh

c a t e n w i k i . d i c t s . g z | g u n z i p | s c r i p t 1. p y

cat enwiki.dicts.gz | gunzip | script2.py
...
```

## Roundtripping

You can also convert in the other direction, using the `unparse()` method:

```python
>>> mydict = {
... 'response': {
... 'status': 'good',
... 'last_updated': '2014-02-16T23:10:12Z',
... }
... }
>>> print unparse(mydict, pretty=True)
<?xml version="1.0" encoding="utf-8"?>
<response>
<status>good</status>
<last_updated>2014-02-16T23:10:12Z</last_updated>
</response>
```

## Ok, how do I get it?

You just need to

```sh

p i p i n s t a l l x m l t o d i c t ‘ ‘ ‘ T h e r e i s a n [o f f i c i a l F e d o r a p a c k a g e f o r x m l t o d i c t] (h t t p s : / / a d m i n . f e d o r a p r o j e c t . o r g / p k g d b / a c l s / n a m e / p y t h o n - x m l t o d i c t) . I f y o u a r e o n F e d o r a o r R H E L, y o u c a n d o : ‘ ‘ ‘ s h

sudo yum install python-xmltodict
```

There is also an [official Arch Linux package for xmltodict](https://www.archlinux.org/packages/community/any/python-xmltodict/). You can use pacman to install if you are using Arch:

```sh
$ sudo pacman -S python-xmltodict
```

Algorithm	Hash digest
SHA256	`b2cab0184bbb8c3627fc54b03ed79ea2f4d5579fa041e3456ff8d3b3c09b0d5e`
MD5	`cb538f606811d9e8d108fd15675b492f`
BLAKE2b-256	`6ccb0628b276d670eb9553c2d21f45c395a87f1748203e5a39795bf53b299e78`

xmltodict 0.10.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes