Clearswift's content analysis is like no other: It can look inside a zip file to discover a Word document which itself may have another embedded spreadsheet detailing company finances or other sensitive information. Providing the spreadsheet is marked in some way (company sensitive), Clearswift's SECURE Web gateway can detect it and prevent it from accidentally leaking out. This depth and quality of analysis that Clearswift is renowned for is included as standard on the SECURE Web Gateway for full outbound threat protection.

Lexical analysis is one of the most powerful capabilities of the SECURE Web Gateway. It works by searching file uploads for key watermarks indicating sensitive data within documents. The image below shows how specific phrases, such as company name, can be detected.

Complex phrases can also be included, using the powerful expression analyser to look for patterns indicating, for example, a customer reference or credit card number e.g. three numeric characters followed by 10 letters and ending with the letter Z.

Pre-defined templates have also been included for the detection of Social Security Numbers, National Insurance Number, Credit Card Number, International Bank Account Numbers.

Pre-defined compliance dictionaries are also provided and include dictionaries for Gramm-Leach-Bliley Act (GLBA), Health Insurance Portability and Accountability Act (HIPAA), Securities and Equities Commission (SEC) and Sarbanes Oxley (SOX).