这里有个 非常好的分析 html的 类。
节约了不少时间。
项目地址
http://www.codeplex.com/Wiki/View.aspx?ProjectName=htmlagilitypack
For example, here is how you would fix all hrefs in an HTML file:
HtmlDocument doc = new HtmlDocument();
doc.Load("file.htm");
foreach(HtmlNode link in doc.DocumentElement.SelectNodes("//a@href")
{
HtmlAttribute att = link"href";
att.Value = FixLink(att);
}
doc.Save("file.htm");
If you want to participate to the project - because that's the whole purpose of putting the source there, right - use the forums or drop me a note (simon underscore mourier at hotmail dot com)!
Happy coding, scraping, scanning, html-ing, xhtml-ing, etc... :^)
Simon Mourier.
http://www.cnblogs.com/wujun/archive/2006/10/31/545646.html
一起学吧部分文章转载自互联网,供读者交流和学习,若有涉及作者版权等问题请及时与我们联系,以便更正、删除或按规定办理。感谢所有提供资讯的网站,欢迎各类媒体与一起学吧进行文章共享合作。