[英]Golang parse raw HTTP/2 response
我有一個特定的情況,我需要將 HTTP/2 響應 output 解析為 Go 的http.Response
。 響應本身具有默認結構:
$ curl --include https://google.com
HTTP/2 301
location: https://www.google.com/
content-type: text/html; charset=UTF-8
date: Mon, 15 Jun 2020 11:08:39 GMT
expires: Wed, 15 Jul 2020 11:08:39 GMT
cache-control: public, max-age=2592000
server: gws
content-length: 220
<HTML><HEAD><meta http-equiv="content-type" content="text/html;charset=utf-8">
<TITLE>301 Moved</TITLE></HEAD><BODY>
<H1>301 Moved</H1>
The document has moved
<A HREF="https://www.google.com/">here</A>.
</BODY></HTML>
地位和身體本身並不重要,這只是一個例子。
http
library has function ReadResponse(r *bufio.Reader, req *Request) (*Response, error)
which does exactly what I need, but it fails parsing HTTP/2 with malformed HTTP version HTTP/2
, however it works fine for HTTP /1.1 和 HTTP/1.0。 另外,在使用http.DefaultClient.Do()
進行請求后,您可以看到響應的字段Proto
包含HTTP/2.0
,這意味着 HTTP/2 沒有問題。
任何想法如何解析此響應?
總結討論,解析錯誤的原因是http.ParseHTTPVersion
not parse HTTP/2
,將前綴 7 個字節替換為HTTP/2.0
來修復它,或者為net/http
提供 pr 支持來解析HTTP/2
。
package main
import (
"bufio"
"bytes"
"fmt"
"io"
"net/http"
)
func main() {
fmt.Println("Hello, playground")
req, _ := http.NewRequest("GET", "https://google.com", nil)
{
resp, err := http.ReadResponse(bufio.NewReader(bytes.NewBuffer(httpbody)), req)
fmt.Println(resp, err)
// err is malformed HTTP version "HTTP/2", because http.ParseHTTPVersion not parse "HTTP/2"
}
{
body := bytes.NewBuffer(httpbody)
prefix := make([]byte, 7)
n, err := io.ReadFull(body, prefix)
if err != nil {
panic("handler err")
}
fmt.Println(n, err, string(prefix))
if string(prefix[:n]) == "HTTP/2 " {
// fix HTTP/2 proto
resp, err := http.ReadResponse(bufio.NewReader(io.MultiReader(bytes.NewBufferString("HTTP/2.0 "), body)), req)
fmt.Println(resp, err)
} else {
// other proto
resp, err := http.ReadResponse(bufio.NewReader(io.MultiReader(bytes.NewBuffer(prefix[:n]), body)), req)
fmt.Println(resp, err)
}
}
}
var httpbody = []byte(`HTTP/2 301
location: https://www.google.com/
content-type: text/html; charset=UTF-8
date: Mon, 15 Jun 2020 11:08:39 GMT
expires: Wed, 15 Jul 2020 11:08:39 GMT
cache-control: public, max-age=2592000
server: gws
content-length: 220
<HTML><HEAD><meta http-equiv="content-type" content="text/html;charset=utf-8">
<TITLE>301 Moved</TITLE></HEAD><BODY>
<H1>301 Moved</H1>
The document has moved
<A HREF="https://www.google.com/">here</A>.
</BODY></HTML>`)
output:
Hello, playground
<nil> malformed HTTP version "HTTP/2"
7 <nil> HTTP/2
&{301 301 HTTP/2.0 2 0 map[Cache-Control:[public, max-age=2592000] Content-Length:[220] Content-Type:[text/html; charset=UTF-8] Date:[Mon, 15 Jun 2020 11:08:39 GMT] Expires:[Wed, 15 Jul 2020 11:08:39 GMT] Location:[https://www.google.com/] Server:[gws]] 0xc0000902c0 220 [] false false map[] 0xc0000f2000 <nil>} <nil>
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.