简体   繁体   中英

Scrape all the certificate information from a website?

I'm working with beautifulsoup but I have no clue on how to scrape certificate data from a website.

If you're thinking of the TLS certificate, that part is not visible to beatifulsoup. It exists on the level of TCP connection, not in the HTTP request.

Instead you can create a new connection, create an OpenSSL.SSL.Connection around it, do a handshake, and call get_peer_certificate . Or check the documentation of the library you use right now to make the connection - maybe it exposes the certificates as well.

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM