[英]Extract javascript table with regex in Python
我已經抓取了一個網頁,我想使用正則表達式提取一個特定的表。 我將如何進行?
renderList([
["Platform_1","1","TWOB","6150","Paris","France", "2016/03/01", "01-03-2016", "", ""],
["Platform_2","3","NRCN","2000","Bruxelles","Belgium", "2009/04/21", "21-04-2009", "", ""],
]
);
我想保留列表格式,這樣我只檢索以下字符串:
["Platform_1","1","TWOB","6150","Paris","France", "2016/03/01", "01-03-2016", "", ""],
["Platform_2","3","NRCN","2000","Bruxelles","Belgium", "2009/04/21", "21-04-2009", "", ""],
如果可能,請刪除表格末尾的逗號:
["Platform_1","1","TWOB","6150","Paris","France", "2016/03/01", "01-03-2016", "", ""],
["Platform_2","3","NRCN","2000","Bruxelles","Belgium", "2009/04/21", "21-04-2009", "", ""]
您可以使用substring
來提取數組。 然后您可以對其進行解析以生成您希望的 output:
const str = `renderList([ ["Platform_1","1","TWOB","6150","Paris","France", "2016/03/01", "01-03-2016", "", ""], ["Platform_2","3","NRCN","2000","Bruxelles","Belgium", "2009/04/21", "21-04-2009", "", ""], ] );` const table = JSON.parse(str.substring(11, str.length - 2).replaceAll('\n', '').replaceAll(',]', ']')); console.log(table);
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.