簡體   English   中英

為什么chrono無法解析我的RFC 2822日期?

[英]Why is my RFC 2822 date not parsed by chrono?

我正在編寫一些代碼來解析RSS feed,但是我對Abstruse Goose RSS feed有麻煩。 如果您查看該供稿,則日期將編碼為Mon, 06 Aug 2018 00:00:00 UTC 對我來說,它看起來像RFC 2822。

我試圖使用DateTime::parse_from_rfc2822DateTime::parse_from_rfc2822解析它,但得到ParseError(NotEnough)

let pub_date = entry.pub_date().unwrap().to_owned();
return rfc822_sanitizer::parse_from_rfc2822_with_fallback(&pub_date)
    .unwrap_or_else(|e| {
        panic!(
            "pub_date for item {:?} (value is {:?}) can't be parsed due to error {:?}",
            &entry, pub_date, e
        )
    })
    .naive_utc();

我做錯了什么嗎? 我需要以某種方式破解嗎?

我使用rfc822_sa​​nitizer可以很好地解決錯誤的寫入錯誤(大部分時間)。 我認為這不會影響解析...但是誰知道呢?

RFC2822 /時間格式在RFC中很好地編碼為以下格式:

date-time       =       [ day-of-week "," ] date FWS time [CFWS]
day-of-week     =       ([FWS] day-name) / obs-day-of-week
day-name        =       "Mon" / "Tue" / "Wed" / "Thu" /
                        "Fri" / "Sat" / "Sun"
date            =       day month year
year            =       4*DIGIT / obs-year
month           =       (FWS month-name FWS) / obs-month
month-name      =       "Jan" / "Feb" / "Mar" / "Apr" /
                        "May" / "Jun" / "Jul" / "Aug" /
                        "Sep" / "Oct" / "Nov" / "Dec"
day             =       ([FWS] 1*2DIGIT) / obs-day
time            =       time-of-day FWS zone
time-of-day     =       hour ":" minute [ ":" second ]
hour            =       2DIGIT / obs-hour
minute          =       2DIGIT / obs-minute
second          =       2DIGIT / obs-second
zone            =       (( "+" / "-" ) 4DIGIT) / obs-zone

其中obs-zone定義如下:

obs-zone        =       "UT" / "GMT" /          ; Universal Time
                                                ; North American UT
                                                ; offsets
                        "EST" / "EDT" /         ; Eastern:  - 5/ - 4
                        "CST" / "CDT" /         ; Central:  - 6/ - 5
                        "MST" / "MDT" /         ; Mountain: - 7/ - 6
                        "PST" / "PDT" /         ; Pacific:  - 8/ - 7
                        %d65-73 /               ; Military zones - "A"
                        %d75-90 /               ; through "I" and "K"
                        %d97-105 /              ; through "Z", both
                        %d107-122               ; upper and lower case

滾動他們自己的時間戳生成庫時,很多人都會犯錯,這就是這一點-如何正確標記RFC2822 TZ偏移量。 UT之所以如此,是因為UTCUT不完全相同(一個具有額外的秒,另一個具有...四個變體!並且RFC沒有定義使用哪個變體;它們都完全不同) 。

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM