Varonis Data Classification Engine

Discover Your
Riskiest Data

Varonis discovers sensitive content, shows you where it is exposed, and helps you lock it down (and keep it that way) without interrupting business.

Get a Demo

Classify sensitive data on-premises and in the cloud

Turn on the lights and see what’s hiding inside your files. Varonis automatically scans and classifies sensitive, regulated information stored in file shares, NAS devices, SharePoint, and Office 365.

Data Classification Engine gives context around sensitive data, so that you can easily identify and lock down overexposed data, stale data, and remediate security vulnerabilities. Create rules that combine content sensitivity with risk exposure, usage and file system metadata, so that nothing falls through the cracks.

Sample data classification results

950+ million
Files contain sensitive data
339+ million
Sensitive files are open to global groups
  • CIFS_FS_2 13%
  • CIFS_FS_3 12%
  • CIFS_FS_4 8%
  • SP_FS_1 54%
  • EXCH_FS_1 13%

A vast library of pre-built rules and patterns

Varonis contains a pre-built library of almost 50 built-in rules and more than 400 patterns for all of the common laws and standards (HIPAA, SOX, PCI, GDPR, and more). Varonis has over 340 GDPR patterns alone, covering all of the EU nations.

  •   Personal information: credit card numbers, passport numbers, driver’s license numbers, social security numbers, IBAN, and more
  •   Financial records
  •   Security file types (.cer, crt, p7b, etc.)
  •   Regulated data (GDPR, HIPAA, PHI, PCI, Sarbanes Oxley, GLBA, etc.)

Finding sensitive data is only the beginning

Add context to your classification

Varonis tells you who can access sensitive data, who owns it, and who’s using it – helping you ensure the right level of access and demonstrate to auditors that you’re compliant with regulations.

Comply with access requests

A full-text index and sensitive content search helps you comply with requirements like public access requests, GDPR’s Right to be Forgotten and Subject Access Requests (SARs). 

Advanced classification criteria

Our rules contain a complex set of conditions that identify sensitive patterns using regular expressions, proximity of text and algorithms that validate the correctness of the data. Easily apply custom tags, flags, and notes to datasets that are accessible in the UI, reports, and via our API.

Automate remediation

Once you’ve classified your critical data, Varonis helps remediate security vulnerabilities like inconsistent ACLs and overexposed access to sensitive data. With Automation Engine, organizations have remediated petabytes of overexposed sensitive information in weeks, not years.

Actionable data security

Automatically move data according to business policy, quarantine sensitive or regulated data that is overexposed, and archive or delete stale data that’s no longer being used.

Up-to-date patterns and rules

We’re continually adding patterns (including GDPR patterns), RegExes, positive keywords, negative keywords, and more.  Get out-of-the-box classification policy with regular updates.


Protect your critical data with advanced security analytics

Alert on suspicious and abnormal activity on your sensitive data and get risk assessment insights with deep data context so that you know when something’s not right.


How it works

Truly Incremental Scanning

Data Classification Engine leverages its file activity audit trail to incrementally scan new and modified data without starting from scratch each time, giving you a scalable solution that works fast and efficiently.

Prioritize your riskiest data

Varonis prioritizes scans based on permissions exposure, frequency of activity, and other parameters that you can tune to your requirements. This ensures that you uncover your biggest security risks first, not last.

Distributed and multi-threaded

Scanning is performed by distributed multi-threaded collectors so scanning can be performed in close network proximity to the monitored nodes.

Robust file type support

Scan over 60 file types out-of-the-box with Oracle Outside-In technology including documents, spreadsheets, and more.


FAQFrequently Asked Questions

  • What file types do you work with?

    Over 60 file types, including: .doc; .xls; .sxc; .vsd; .stc; .csv; .ods;  .rtf; .pdf; .ots; .sti; .txt; .xml; .pps;  .ppt; .eml; .sub; .rar; .log; .mdb; .sxw; .aacdb; .dwg; .zip; and more.

  • How do you reduce false positives?

    The Varonis Data Classification Engine doesn’t simply regurgitate any string of characters that match one of our patterns: we check the data against known correct examples of the patterns to rule out false positives.

    (Not every 16 digit number is a credit card number, for instance, but the first 4 digits of a 16 digit pattern should match a known credit card issuer.)

  • How do you index the content of a file?

    Varonis Data Classification Engine leverages Oracle Outside-in SDK technology to read the aforementioned file types.

    As the Data Classification Engine reads the file, it compares the contents to the rules you have selected: any classification hits are saved to the Varonis database for review in the Varonis DatAdvantage UI.

  • Do you support [insert favorite file type here]?

    If your favorite file type isn’t in the list from a few questions prior, please get in touch for the full list.

Interested in seeing Varonis in action?

Request a demo or contact sales at 877-292-8767