Gumbo is an implementation of the HTML5 parsing algorithm implemented as a pure
C99 library with no outside dependencies.
Goals and features of the C library:
- Fully conformant with the HTML5 spec.
- Robust and resilient to bad input.
- Simple API that can be easily wrapped by other languages.
- Support for source locations and pointers back to the original text.
- Relatively lightweight, with no outside dependencies.
- Tested on over 2.5 billion pages from Google's index.