View Directory | Revision Log | View Changeset
/hippo/hippo-components/htmlcleaner/tags/htmlcleaner-1.02.08/src
Revision 38539 - Directory Listing - [select for diffs]
Modified Tue Feb 26 11:59:35 2013 UTC (7 years, 9 months ago) by fvlankvelt
Diff to previous 38360
CMS7-6961:  copy for tag htmlcleaner-1.02.08
Revision 38360 - Directory Listing - [select for diffs]
Modified Tue Feb 12 13:42:25 2013 UTC (7 years, 9 months ago) by jbloemendal
Original Path: hippo-components/htmlcleaner/trunk/src
Diff to previous 38132
CMS7-6909 Improve HTMLRepairer/HtmlCleaner so that it doesn't throw NullPointerException on an unconfigured element

Null-pointer swallowed internally when div cleanup element not configured and warn message logged. Added null check and info logging to inform about missing configuration.
Revision 38132 - Directory Listing - [select for diffs]
Modified Wed Jan 30 23:15:53 2013 UTC (7 years, 10 months ago) by adouma
Original Path: hippo-components/htmlcleaner/trunk/src
Diff to previous 37421
CMS7-6770: cleanup and refresh LICENSE and NOTICE files
- apply rules to Htmlcleaner
Revision 37421 - Directory Listing - [select for diffs]
Modified Tue Dec 11 23:04:55 2012 UTC (7 years, 11 months ago) by abogaart
Original Path: hippo-components/htmlcleaner/trunk/src
Diff to previous 34417
HTMLCLEANER-43: AllowSingleWhitespace incorrectly replaces <p>\n<a/></p> with <p>&nbsp;<a/></p>
 - Removed <html><body> from test input for readability
 - Added test for HTMLCLEANER-43
 - SingleWhiteSpaceElement handling now correctly detects child elements after characters
Revision 34417 - Directory Listing - [select for diffs]
Modified Wed May 9 10:41:12 2012 UTC (8 years, 6 months ago) by jbloemendal
Original Path: hippo-components/htmlcleaner/trunk/src
Diff to previous 34416
CMS7-6095: root javascript element xss get transformed into head script xss
 
Add test to validate explicit allowed javascript as child of the body element is not getting cleaned.
Revision 34416 - Directory Listing - [select for diffs]
Modified Wed May 9 10:24:21 2012 UTC (8 years, 6 months ago) by jbloemendal
Original Path: hippo-components/htmlcleaner/trunk/src
Diff to previous 34047
CMS7-6095: root javascript element xss get transformed into head script xss
 
 XSS which is part of a head element with no following body element is not getting removed. The neko parser transforms the html structure, if no body element is present the HtmlRepairer is throwing an exception. The  html-element cleanup method is split up into html- and head-element, in case the body-element is reached the head-element method is skipping further processing.
Revision 34047 - Directory Listing - [select for diffs]
Modified Wed Apr 11 16:29:58 2012 UTC (8 years, 7 months ago) by fvlankvelt
Original Path: hippo-components/htmlcleaner/trunk/src
Diff to previous 34016
HTMLCLEANER-6: add some unit tests that describe the implemented behavior
Revision 34016 - Directory Listing - [select for diffs]
Modified Tue Apr 10 11:33:03 2012 UTC (8 years, 7 months ago) by jbloemendal
Original Path: hippo-components/htmlcleaner/trunk/src
Diff to previous 33994
HTMLCLEANER-39 the cleanupStartElement method is complex and it's descended method calls have duplicate code

The method and sub methods of  cleanupStartElement are refactored and simplified. The elements are passing cleanUpAttributes and are set depending if after the cleanup elements without attributes are deleted or not.
Revision 33994 - Directory Listing - [select for diffs]
Modified Fri Apr 6 13:59:03 2012 UTC (8 years, 8 months ago) by jjoachimsthal
Original Path: hippo-components/htmlcleaner/trunk/src
Diff to previous 33986
HTMLCLEANER-39 HTMLCleaner does not check individual class names for special elements
Revision 33986 - Directory Listing - [select for diffs]
Modified Fri Apr 6 13:01:24 2012 UTC (8 years, 8 months ago) by jjoachimsthal
Original Path: hippo-components/htmlcleaner/trunk/src
Diff to previous 33984
HTMLCLEANER-38 Add option to allow any CSS class in div, span, pre and p.
Use * as wildcard (which is not allowed as regular CSS classname so it cannot conflict with the output of the WYSIWYG editor)
Revision 33984 - Directory Listing - [select for diffs]
Modified Fri Apr 6 11:16:24 2012 UTC (8 years, 8 months ago) by jjoachimsthal
Original Path: hippo-components/htmlcleaner/trunk/src
Diff to previous 26408
HTMLCLEANER-6 add log4j configuration for test purposes
Revision 26408 - Directory Listing - [select for diffs]
Modified Sat Jan 8 12:04:39 2011 UTC (9 years, 10 months ago) by fvlankvelt
Original Path: hippo-components/htmlcleaner/trunk/src
Diff to previous 25479
HTMLCLEANER-35: apply patch

Patch supplied by Marijan Milicevic:

- adding generics (where possible) 
- checks for string.length ==0 instead of equals("") 
- other, small micro performance improvements, like zero array allocations 
- foreach loops instead of for loops (with k, z, etc loop variables, which is really confusing and error prone) 
- removed unnecessary castings 


Revision 25479 - Directory Listing - [select for diffs]
Modified Mon Nov 29 17:11:02 2010 UTC (10 years ago) by jjoachimsthal
Original Path: hippo-components/htmlcleaner/trunk/src
Diff to previous 24286
HTMLCLEANER-33 Fix incorrect nested lists from Xinha
Revision 24286 - Directory Listing - [select for diffs]
Modified Tue Oct 12 09:04:09 2010 UTC (10 years, 1 month ago) by jjoachimsthal
Original Path: hippo-components/htmlcleaner/trunk/src
Diff to previous 23868
HTMLCLEANER-32 only add a p in table cell if there is already a block element. Otherwise the cell contains tabular data which doesn't need a p.
Small refactoring for readability and reusing the same String.
Added unittest for this.
Revision 23868 - Directory Listing - [select for diffs]
Modified Tue Sep 7 22:11:11 2010 UTC (10 years, 2 months ago) by abogaart
Original Path: hippo-components/htmlcleaner/trunk/src
Diff to previous 23397
HTMLCLEANER-31: Prevent HtmlSerializer from outputting script element as self-closing (<script />)
- added check for hardcoded set of nonSelveClosingEmptyElements (currently contains only 'script')
Revision 23397 - Directory Listing - [select for diffs]
Modified Tue Jul 6 09:37:25 2010 UTC (10 years, 5 months ago) by abogaart
Original Path: hippo-components/htmlcleaner/trunk/src
Diff to previous 22580
HTMLCLEANER-30 - <style> element outside of the <head> is removed, but character data is preserved
 - Character data in <style> elements found outside of the <head> are now skipped.
 - Added unit test for <head> and <style> cleanup, as well a lineWidth test
Revision 22580 - Directory Listing - [select for diffs]
Modified Wed Apr 28 11:15:40 2010 UTC (10 years, 7 months ago) by abogaart
Original Path: hippo-components/htmlcleaner/trunk/src
Diff to previous 22422
HTMLCLEANER-27 - Whitespace characters (including non-breaking-space) inside body/td/th/blockquote should not be wrapped in a <p> element
- fixed and added test
Revision 22422 - Directory Listing - [select for diffs]
Modified Fri Apr 9 16:18:47 2010 UTC (10 years, 7 months ago) by abogaart
Original Path: hippo-components/htmlcleaner/trunk/src
Diff to previous 22410
HTMLCLEANER-23: Created test for newly introduced AllowSingleWhitespaceElement with ignoreTrailing whitespace
Revision 22410 - Directory Listing - [select for diffs]
Modified Fri Apr 9 13:57:02 2010 UTC (10 years, 7 months ago) by abogaart
Original Path: hippo-components/htmlcleaner/trunk/src
Diff to previous 22405
HTMLCLEANER-26: Allow easier debugging of intermediate cleanup results
- added slf4j and debug statements for intermediate results of cleanup method
Revision 22405 - Directory Listing - [select for diffs]
Modified Fri Apr 9 13:40:20 2010 UTC (10 years, 7 months ago) by abogaart
Original Path: hippo-components/htmlcleaner/trunk/src
Diff to previous 22061
HTMLCLEANER-25: Add possibility for elements like <p> to contain a non-breaking-space + new-line char and NOT be removed by the cleanupBrsAndEmptyContentBlocks step
 - added configuration option (default is true) that allows SingleWhitesSpaceElements (like <p>&nbsp;</p>) to ignore trailing NewLine characters.
- added check to introduceParas step to tests if a char[] inside the body element is actually only whitespace and if so, ignores it instead of wrapping it inside a <p> element
Revision 22061 - Directory Listing - [select for diffs]
Modified Tue Mar 9 17:13:52 2010 UTC (10 years, 8 months ago) by fvlankvelt
Original Path: hippo-components/htmlcleaner/trunk/src
Diff to previous 19753
HTMLCLEANER-21: filter out javascript: value from img tag
Revision 19753 - Directory Listing - [select for diffs]
Modified Thu Sep 24 09:11:27 2009 UTC (11 years, 2 months ago) by ddam
Original Path: hippo-components/htmlcleaner/trunk/src
Diff to previous 18891
HTMLCLEANER-17 : Use allowed attributes for <span> elements
Revision 18891 - Directory Listing - [select for diffs]
Modified Fri Jul 17 10:01:37 2009 UTC (11 years, 4 months ago) by ddam
Original Path: hippo-components/htmlcleaner/trunk/src
Diff to previous 18356
fix HTMLCLEANER-12 : Nullpointer exception when html string is null
Revision 18356 - Directory Listing - [select for diffs]
Modified Wed Jun 3 11:40:02 2009 UTC (11 years, 6 months ago) by abogaart
Original Path: hippo-components/htmlcleaner/trunk/src
Diff to previous 18346
HTMLCLEANER-10: compatible with java 1.4
Revision 18346 - Directory Listing - [select for diffs]
Modified Tue Jun 2 15:33:41 2009 UTC (11 years, 6 months ago) by abogaart
Original Path: hippo-components/htmlcleaner/trunk/src
Diff to previous 15595
HTMLCLEANER-10: Add option to configure a set of elements that are allowed to contain a single whitespace character
http://issues.onehippo.com/browse/HTMLCLEANER-10

 - Added option to configure a set of elements that are allowed to contain a single whitespace char and not be removed. 
 - Each element can force the single whitespace char to be a non-breaking space
Revision 15595 - Directory Listing - [select for diffs]
Added Wed Dec 24 10:12:25 2008 UTC (11 years, 11 months ago) by ddam
Original Path: hippo-components/htmlcleaner/trunk/src
HTMLCLEANER-5: initial check in of generic htmlcleaner part

Convenience Links