What is an HTML tokenizer and what does it do?

by **JAB Creations** » Sat Nov 05, 2011 8:52 am

What is an HTML tokenizer? What does it do and how would that be useful?

by **zcorpan** » Sat Nov 05, 2011 9:57 pm

An HTML tokenizer is part of an HTML parser. To parse HTML, you first tokenize the input stream (a sequence of bytes or characters) into a sequence of tokens, where a token is text, start tag, end tag, doctype, or comment. Then the HTML tree builder looks at the sequence of tokens and builds a DOM tree.

by **JAB Creations** » Sun Nov 06, 2011 1:07 am

So text to DOM tree essentially, thanks Z!

What is an HTML tokenizer and what does it do?

What is an HTML tokenizer and what does it do?

Re: What is an HTML tokenizer and what does it do?

Re: What is an HTML tokenizer and what does it do?

Who is online

Who is online