簡體   English   中英

使用 Python 中的正則表達式提取 javascript 表

[英]Extract javascript table with regex in Python

我已經抓取了一個網頁,我想使用正則表達式提取一個特定的表。 我將如何進行?

renderList([
["Platform_1","1","TWOB","6150","Paris","France", "2016/03/01", "01-03-2016", "", ""],
["Platform_2","3","NRCN","2000","Bruxelles","Belgium", "2009/04/21", "21-04-2009", "", ""],
]
);

我想保留列表格式,這樣我只檢索以下字符串:

["Platform_1","1","TWOB","6150","Paris","France", "2016/03/01", "01-03-2016", "", ""],
["Platform_2","3","NRCN","2000","Bruxelles","Belgium", "2009/04/21", "21-04-2009", "", ""],

如果可能,請刪除表格末尾的逗號:

["Platform_1","1","TWOB","6150","Paris","France", "2016/03/01", "01-03-2016", "", ""],
["Platform_2","3","NRCN","2000","Bruxelles","Belgium", "2009/04/21", "21-04-2009", "", ""]

您可以使用substring來提取數組。 然后您可以對其進行解析以生成您希望的 output:

 const str = `renderList([ ["Platform_1","1","TWOB","6150","Paris","France", "2016/03/01", "01-03-2016", "", ""], ["Platform_2","3","NRCN","2000","Bruxelles","Belgium", "2009/04/21", "21-04-2009", "", ""], ] );` const table = JSON.parse(str.substring(11, str.length - 2).replaceAll('\n', '').replaceAll(',]', ']')); console.log(table);

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM