The author has also written related tools. One to convert XML to JSON and back (...

0xbadcafebee · on Oct 8, 2019

> convert XML to JSON and back

This is basically impossible to do in a way that is compatible with other tools. Things like duplicate attributes of an object can exist in XML, but not in JSON. You can still work-around these limitations if you just have a pipeline using the same toolset, but part of the point of these tools is to then convert them back to a format that some other tool can use, which is where this pattern breaks down.

Here's a list of pitfalls: https://stackoverflow.com/questions/33072812/potential-probl...

namibj · on Oct 8, 2019

This suggest a very scalable, easy approach to extract data from somewhat regular HTML...

Yetanfou · on Oct 8, 2019

I generally use xidel [1] for that type of task. Feed it xpath, css selectors or its own pattern matching thing.

[1] https://github.com/benibela/xidel

benibela · on Oct 8, 2019

or just use xpath