简体繁体 English

复杂的Web文档检索

[英]Complex retrieval of a web document

原文 2010-09-12 00:43:21 5 2 java/ post/ cookies/ redirect/ https

I need to retrieve a document from a website, and parse it. 我需要从网站检索文档并进行解析。 Problem is that: 问题是：

The site uses both http and https protocol 该网站同时使用http和https协议
You need to log in the site (I have a regular account) 您需要登录该网站（我有一个普通帐户）
From the login page, there are at least 2 redirect just to log in yourself 在登录页面上，至少有2个重定向只是为了自己登录

I managed an HTTPS connection and posted my login and pass, but I'm having troubles with cookie management and the redirect.... 我管理了HTTPS连接并发布了登录名和密码，但是在cookie管理和重定向方面遇到了麻烦。

2 个解决方案

commons-httpclient会有所帮助。

使用类似HtmlUnit的库可能会有所帮助。

休眠复杂的数据检索？ - Hibernate complex data retrieval?

Mongodb文件检索 - Mongodb document retrieval

具有复杂XML / JSON的SmartGWT数据检索 - SmartGWT data retrieval with complex XML/JSON

Json Web服务数据检索 - Json Web Service Data Retrieval

对clojure的Web检索到Lazy字符串错误 - Web retrieval to Lazy string error on clojure

Web 元素检索使用内部 Web 元素属性在 selenium ZD52387880E1EA22817AZ2D357 - Web Element retrieval using inner Web Element attribute in selenium Java

使用 FileNet API 获取 DocumentSet 中最新版本文档的检索名称 - Acquiring retrieval name for latest version of a document in a DocumentSet using FileNet API

复杂的Web应用程序上的ConversionInputException - ConversionInputException on a complex web application

创建具有复杂类型的Web服务 - Creating a web service with complex types

Delphi Web服务中的复杂类型 - Complex types in Delphi web service

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 休眠复杂的数据检索？ - Hibernate complex data retrieval? Mongodb文件检索 - Mongodb document retrieval 具有复杂XML / JSON的SmartGWT数据检索 - SmartGWT data retrieval with complex XML/JSON Json Web服务数据检索 - Json Web Service Data Retrieval 对clojure的Web检索到Lazy字符串错误 - Web retrieval to Lazy string error on clojure Web 元素检索使用内部 Web 元素属性在 selenium ZD52387880E1EA22817AZ2D357 - Web Element retrieval using inner Web Element attribute in selenium Java 使用 FileNet API 获取 DocumentSet 中最新版本文档的检索名称 - Acquiring retrieval name for latest version of a document in a DocumentSet using FileNet API 复杂的Web应用程序上的ConversionInputException - ConversionInputException on a complex web application 创建具有复杂类型的Web服务 - Creating a web service with complex types Delphi Web服务中的复杂类型 - Complex types in Delphi web service

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM