Problems using the RedCloth gem !?!

August_Lilleaas · January 21, 2007, 11:46am

I think it filters some HTML tags, but not normal and safe ones like <br />, <p>, <h1> etc.

RSL · January 22, 2007, 1:55pm

If there is, I haven’t found it. I have though found a useful Regex for doing that. It’s a slightly modified version of the one used in Typo. [No source code will go scavenged!] I know I might catch some flack for doing so but I choose to add this method to the String class itself, as I use it a lot.

class String def strip_html(leave_whitespace = false) name = /[\w:_-]+/ value = /([A-Za-z0-9]+|(‘[^’]?'|“[^”]?"))/ attr = /(#{name}(\s*=\s*#{value})?)/ rx = /<[!/??(#{name}|–)(\s+(#{attr}(\s+#{attr})))?\s([!/?]]+|–)?>/

(leave_whitespace) ?  self.gsub(rx, "").strip : self.gsub(rx, "").gsub(/\s+/, " ").strip

end end

Be aware, though, that there is stil a lot of HTML entities left in the Textilized string. [™, etc.] Depending on your end use, you may need to strip those entities as well. Let me know if you do need that because I’ve written some really handy code for it, completely based on what transformations RedCloth does. You know, only make the server work as hard as it has to.

RSL

RSL · January 22, 2007, 6:27pm

In what circumstances would you want an h1 and not an h2? Sounds like you’re definitely going to need to Regex that one.

Anyhow, here’s the two additional methods [both on the String class as before] for dealing with RedClothed HTML entities.

def convert_entities dummy = self.dup { “#822[01]” => “"”,

  "#821[67]" => "'",
  "#8230" => "...",
  "#8211" => "-",
  "#8212" => "--",
  "#215" => "x",

  "gt" => ">",
  "lt" => "<",
  "(#8482|trade)" => "(tm)",
  "(#174|reg)" => "(r)",
  "(#169|copy)" => "(c)",

  "(#38|amp)" => "&",
  "nbsp" => " ",
  "cent" => " cent",
  "pound" => " pound",
}.each do |textiled, normal|

  dummy.gsub!(/&#{textiled};/, normal)
end
dummy.gsub(/&[^;]+;/, "")

end

Topic		Replies	Views
some redcloth questions rubyonrails-talk	1	138	December 5, 2007
RedCloth and sanitizing input rubyonrails-talk	2	170	July 29, 2011
RedCloth Changing Features rubyonrails-talk	2	123	September 23, 2009
Regex in Ruby - Strip HTML out of comments - help rubyonrails-talk	0	155	August 21, 2006
Removing <p> from RedCloth rubyonrails-talk	5	128	February 20, 2009

Problems using the RedCloth gem !?!

Related topics

More Resources