繁体   English   中英

如何使用selenium webdriver从第三个网站下载文件

[英]How to download file using selenium webdriver from a third website

我正在自动化一个过程,我应该能够登录到一个网站并自动下载和重命名这些文件。

  1. 它应该只下载以'06'结尾的csv文件。
  2. 它应该与报告类型共同报告报告ID。 示例:如果报告ID为“XXXXXX06”且报告类型以“PAS”开头,则应使用XXXXXX06 PAS.csv重命名文件名,如果报告ID为“XXXXXX06”且报告类型以“TAS”开头,则应重命名XXXXXX06 TAS.csv的文件名

我附上了代码。 我只能下载并重命名文件,休息所有功能都正常。 我认为给出xpath存在一些问题。

在此输入图像描述

package package1;
import java.io.File;

import java.io.IOException;
import java.nio.file.Files;
import java.nio.file.Path;
import java.nio.file.Paths;
import java.util.HashMap;

import java.util.List;
import org.apache.commons.io.FileUtils;
import org.openqa.selenium.By;
import org.openqa.selenium.WebDriver;

import org.openqa.selenium.WebElement;

import org.openqa.selenium.chrome.ChromeDriver;

import org.openqa.selenium.chrome.ChromeOptions;

import org.openqa.selenium.remote.CapabilityType;

import org.openqa.selenium.remote.DesiredCapabilities;


public class Test_new {

public static WebDriver setup() throws Exception

{
    String downloadFilepath = "E:\\HCA_Automation\\Files";
    System.out.println(downloadFilepath);
    HashMap<String, Object> chromePrefs = new HashMap<String, Object>();

    chromePrefs.put("profile.default_content_settings.popups", 0);

    chromePrefs.put("download.default_directory", downloadFilepath);

    ChromeOptions options = new ChromeOptions();

    options.setExperimentalOption("prefs", chromePrefs);

    DesiredCapabilities cap = DesiredCapabilities.chrome();

    cap.setCapability(CapabilityType.ACCEPT_SSL_CERTS, true);

    cap.setCapability(ChromeOptions.CAPABILITY, options);

    //WebDriver driver = new ChromeDriver(cap);

    WebDriver wd;

    //String exepath=System.getProperty("user.dir") + "//chromedriver.exe";

    System.setProperty("webdriver.chrome.driver",
            "C:\\Users\\sh370472\\Downloads\\chromedriver_win32 (1)\\chromedriver.exe");

    return (wd= new ChromeDriver(cap));

    }


    public static void login(WebDriver wd) throws Throwable
    {
    wd.manage().window().maximize();
    wd.get("https://www.shipper-ml.com");
    wd.findElement(By.id("inpUserId")).sendKeys("XXXXXXXX");
    wd.findElement(By.id("inpPassword")).sendKeys("XXXXXXXXXXXX");
    wd.findElement(By.id("btnLogonLabel")).click();
    Thread.sleep(1000); 
}

public static void execute(WebDriver wd) throws Throwable{
    wd.get("https://www.shipper-ml.com/viewReports.do");  
    Thread.sleep(2000);
    List<WebElement> list= wd.findElements(By.xpath("//table[@class='lcb']/tbody/tr/td/table[@class='ibody']/tbody/tr/td[contains(translate(text(),'0123456789'),'06')]/parent::tr/td[7]/a"));
    int i = 0;
    FileUtils.cleanDirectory(new File("E:\\HCA_Automation\\Files"));
    for (WebElement element:list)
    {
        i++;
        element.click();
        Thread.sleep(1000);
        System.out.println((element.findElement(By.xpath("(//table[@class='lcb']/tbody/tr/td/table[@class='ibody']/tbody/tr/td[contains(translate(text(), '0123456789'),'06')]/parent::tr/td[3])["+i+"]")).getText()).substring(0,2));
        String report_type = (element.findElement(By.xpath("(//table[@class='lcb']/tbody/tr/td/table[@class='ibody']/tbody/tr/td[contains(translate(text(), '0123456789'),'06')]/parent::tr/td[3])["+i+"]")).getText()).substring(0,2);

    } 
}


public static void tearDown(WebDriver wd) throws Throwable{
    wd.quit();
}



public static void main(String[] args) throws Throwable 
{
    WebDriver wd = setup();
    login(wd);
    execute(wd);
    tearDown(wd);
    Thread.sleep(1000);
    FileUtils.copyDirectory(new File("C:\\Users\\nea558\\Desktop\\New_Folder\\Files\\TAS\\"), new File( "\\\\zneugo1p17ecn02.bp1.ad.bp.com\\DataTransfer\\ETAP\\DropBoxes\\CATS\\01_Inbox"));

    //FileUtils.copyDirectory(new File("C:\\Users\\nea558\\Desktop\\New_Folder\\Files\\PAS\\"), new File("\\\\bp1xeuap2433\\aamon\\scheduler\\Handoffs\\paa\\itd\\process"));

}

}

HTML:

<div id="lc_ctrl258037362">
    <table id="ctrl258037362" cellspacing="0" width="845px" class="lc_nf" border="0" cellpadding="0">
        <tbody>
        <tr>
            <td>
                <table cellspacing="0" width="100%" class="lcb" border="0" cellpadding="0">
                    <tbody>
                    <tr>
                        <td>
                            <table cellspacing="1" width="100%" class="ibody" border="0" cellpadding="0">
                                <colgroup>
                                    <col>
                                    <col>
                                    <col>
                                    <col>
                                    <col>
                                    <col>
                                    <col>
                                    <col>
                                </colgroup>
                                <tbody>
                                <tr class="header">
                                    <td>Report ID</td>
                                    <td>Version No.</td>
                                    <td>Report Type</td>
                                    <td>Contract</td>
                                    <td>Date Created</td>
                                    <td>Status</td>
                                    <td align="center">Excel</td>
                                    <td align="center">XML</td>
                                </tr>
                                <tr onmouseover="high(this);" class="even" onmouseout="low(this);" style="">
                                    <td class="cl">19062006</td>
                                    <td class="cl">V1</td>
                                    <td class="cl">PAS ITD/EOD EX1 Allocation Statement</td>
                                    <td class="cl">ETAP</td>
                                    <td class="cl">21/06/2019 08:56</td>
                                    <td class="cl">Published</td>
                                    <td align="center" class="cl">
                                        <a href="/viewReports.do?ctrl=reportListForDownload&amp;action=DownloadReport&amp;param=0" target="_blank">
                                            <img vspace="0" align="absmiddle" border="0" src="images/buttons/excel.gif"></a></td>
                                    <td align="center" class="cl"><a href="/viewReports.do?ctrl=reportListForDownload&amp;action=DownloadXml&amp;param=0" target="_blank"><img
                                            vspace="0" align="absmiddle" border="0" src="images/buttons/document.gif"></a></td>
                                </tr>
                                </tbody>
                            </table>
                        </td>
                    </tr>
                    </tbody>
                </table>
            </td>
        </tr>
        </tbody>
    </table>
</div>

找到下载链接的XPath如下:

//table/descendant-or-self::tr[td[3][text()='PAS ITD/EOD EX1 Allocation Statement']]/td[7]/a

这找到一个<table>元素包含<tr>有一个元素<td>元素,值为PAS ITD/EOD EX1 Allocation Statement ,然后路由你<td>包含下载链接在同一行元素。

要将其放入可以调整报告类型值的代码中,您可以使用:

String reportTypeValue = "PAS ITD/EOD EX1 Allocation Statement"
driver.findElement(By.xpath("//table/descendant-or-self::tr[td[3][text()='" + reportTypeValue + "']]/td[7]/a")

*编辑*

根据你的评论你会变得更复杂,我现在有:

//table/descendant-or-self::tr[td[1][substring(text(), string-length(text()) - string-length('06') + 1)]][td[3][text()='PAS ITD/EOD EX1 Allocation Statement']]/td[7]/a

这将发现一个<table>元素包含一个<tr>具有元件<td>具有值元件PAS ITD/EOD EX1 Allocation Statement<td>与所述值结束元素06 然后它会将您路由到包含下载链接的同一行中的<td>元素。

XPath 1.0不支持结束 - 因此这个块:

substring(text(), string-length(text()) - string-length('06') + 1)]

这将计算文本节点的长度和您要搜索的字符串的长度,并使用子字符串功能确保最后x个字符匹配。

要将其放入可用于调整报告类型值和报告ID的新代码块中,您可以执行以下操作:

String reportID = "06"
String reportTypeValue = "PAS ITD/EOD EX1 Allocation Statement"
driver.findElement(By.xpath("//table/descendant-or-self::tr[td[1][substring(text(), string-length(text()) - string-length('" + reportID + "') + 1)]][td[3][text()='" + reportTypeValue + "']]/td[7]/a")

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM