want to parse href in ruby on rails using nokogiri

Question

I am using nokogiri as my HTML parser.

<html>
<body>
<form>
<table>
    <tr><td>Some Text</td></tr>
    <tr>
        <td colspan="2" align="center">
            <br />
            <a href="TransportRoom?servlet=CaseSearch.jsp&amp;advancedSearch=Advanced">
                Advanced Search
            </a>
            <br />
            &nbsp;
        </td>
    </tr>
</table>
</form>
</body>
</html>

In this html code I want to parse the "Advance Search" link. This html is saved in variable named doc1

Can anyone help me with this?

Answer 1

Should be as simple as

doc = Nokogiri::HTML(doc1)
href = doc.css("a").first.attr('href')

This is what you want?

Answer 2

First answer is working for me but if there is n number of links than we can manipulate it by this way

 html = Nokogiri::HTML(doc1)

 html.css("a").each do |element|
      if (element.text.strip == 'Advanced Search')
        advance_search_link = element.attr('href')
      end
  end

Answer 3

I would do as below :

require 'nokogiri'

@doc = Nokogiri.HTML <<-eotl
<html>
<body>
<form>
<table>
    <tr><td>Some Text</td></tr>
    <tr>
        <td colspan="2" align="center">
            <br />
            <a href="TransportRoom?servlet=CaseSearch.jsp&amp;advancedSearch=Advanced">
                Advanced Search
            </a>
            <br />
            &nbsp;
        </td>
    </tr>
</table>
</form>
</body>
</html>
eotl

@doc.at_xpath("//a[normalize-space(.)='Advanced Search']")['href']
# => "TransportRoom?servlet=CaseSearch.jsp&advancedSearch=Advanced"

want to parse href in ruby on rails using nokogiri

Question

3 answers

solution1
4 2012-07-23 08:40:42

solution2
1 ACCPTED 2012-07-27 06:18:42

solution3
1 2013-12-08 16:55:14

want to parse href in ruby on rails using nokogiri

Question

3 answers

solution1 4 2012-07-23 08:40:42

solution2 1 ACCPTED 2012-07-27 06:18:42

solution3 1 2013-12-08 16:55:14

solution1
4 2012-07-23 08:40:42

solution2
1 ACCPTED 2012-07-27 06:18:42

solution3
1 2013-12-08 16:55:14