Regular expresso: mi sono un pò perso

aris · September 22, 2012, 11:41pm

Ciao a tutti, ho bisogno del vostro aiuto.
Non riesco a ricavare da una pagina html, tutti i link che mi servono,
utilizzando questo script

#main è la pagina scaricata in maniera corretta ed è di tipo String
er=Array.new
er << main[’

(…)

’]
puts “Post search”
puts “Valore #{er.class}”
for i in er
puts i
end

Dove Sbaglio???

reis · September 23, 2012, 12:35am

Perch non ti appoggi alla gemma nokogiri?

Il giorno sabato 22 settembre 2012, Andrea R. ha scritto:

reis · September 23, 2012, 9:50am

Andrea P. wrote in post #1077137:

Perch non ti appoggi alla gemma nokogiri?

Il giorno sabato 22 settembre 2012, Andrea R. ha scritto:

Ho risolto con hpricot XD

reis · September 23, 2012, 11:14am

David W. wrote in post #1077151:

Perch non ti appoggi alla gemma nokogiri?

+1

I regexp non vanno bene per parsare HTML.

"Some people, when confronted with a problem, think
I know, I’ll use regular expressions. Now they have two problems. "

–
David N. Welton

David N. Welton

http://www.dedasys.com/

L’ho notato, infatti sto usando nokogiri (grazie apeacox), in quanto
hpricot sembra un progetto chiuso/abbandonato.

reis · September 23, 2012, 10:36am

Perch non ti appoggi alla gemma nokogiri?

+1

I regexp non vanno bene per parsare HTML.

"Some people, when confronted with a problem, think
I know, I’ll use regular expressions. Now they have two problems. "

–
David N. Welton

http://www.welton.it/davidw/

http://www.dedasys.com/

reis · October 8, 2012, 2:31pm

On 8 October 2012 13:48, Marcello B. (void) [email protected] wrote:

On Sep 23, 2012, at 10:35:48 AM, David W. wrote:

I regexp non vanno bene per parsare HTML.

"Some people, when confronted with a problem, think
I know, I’ll use regular expressions. Now they have two problems. "

Obligatory: html - RegEx match open tags except XHTML self-contained tags - Stack Overflow

“Chuck Norris can parse HTML with regex.”

Maurizio

My profile https://plus.google.com/100973969013103507046/about

reis · October 8, 2012, 1:48pm

On Sep 23, 2012, at 10:35:48 AM, David W. wrote:

I regexp non vanno bene per parsare HTML.

"Some people, when confronted with a problem, think
I know, I’ll use regular expressions. Now they have two problems. "

Obligatory: html - RegEx match open tags except XHTML self-contained tags - Stack Overflow

~Marcello