0.1.0	~~2024年5月3日~~

#2036 in Web编程

BSL-1.0 许可证

19KB
267 行

Web-Scraper

一个Rust crate，用于从网站获取HTML并抓取其中的内容

示例

以下代码是如何使用它的示例，它从 Rust语言网站获取HTML，然后查找所有的div标签，并获取其中的内容

use web_scraper::site::get_html;
use web_scraper::HtmlTag;
#[tokio::main]
async fn main() -> Result<(), Box<dyn std::error::Error + std::marker::Send + std::marker::Sync>> {
    let file = get_html("https://rust-lang.net.cn").await.unwrap();
    // Define the tag you want to parse
    let tag = HtmlTag::DIV;
    // Parse the <div> tags and collect the results into a vector
    let new_vector = tag.parse_tags(&file);

    // Print the parsed <div> tags
    for li_tag in &new_vector {
        println!("{}", li_tag);
    }

    Ok(())
}

依赖项

~8–19MB
~273K SLoC