Evaluation Guide

How to Run a DSPM POC

Most DSPM POCs evaluate the wrong things. They test whether the platform can find sensitive data at all, a baseline every platform in the market clears, rather than testing the specific capabilities that will determine whether the platform fits the actual requirements. A POC designed around "does this find PII" will produce a tie between every vendor on the shortlist. A POC designed around the specific coverage gaps, accuracy requirements, and integration scenarios that matter to the organization will produce real differentiation.

Before the POC: what you are actually evaluating

The design question for any DSPM POC is what the platform needs to do that another platform might not. The answer is different for every organization, but the categories are consistent.

Coverage scope. Which data sources matter? If the primary data risk is in cloud data stores and SaaS, every agentless platform will cover it. If on-premises file servers or legacy databases are in scope, that immediately constrains the vendor shortlist to platforms with agent or collector architectures. The POC data set should include the sources the final deployment will need to cover, not just the easy cloud buckets.

Classification accuracy on the hard cases. Any platform will find a Social Security Number that looks like 123-45-6789. The test is what happens with data that requires context: a document that describes medical conditions without using structured PHI fields; a spreadsheet that contains sensitive financial projections written in natural language; a code repository that has hardcoded credentials embedded in a configuration file alongside ten thousand lines of benign code. Run the POC against data that matches the actual complexity of the estate, not a sanitized sample.

False positive rate at scale. A platform that finds 10,000 findings in a 30-day POC is not necessarily better than one that finds 3,000. The question is what fraction of those findings require action. Request false positive rates from each vendor, then validate them on a sample of findings from the POC. A platform with high false positive rates creates remediation backlogs that are never cleared, which means the findings that matter get buried.

Integration with the existing security stack. Where do DSPM findings need to go? If SIEM, CSPM, or ticketing integration is required, test it in the POC. A platform that produces excellent findings but requires manual export to get them into the workflow where they will be acted on is a platform that will be underused.

Designing the POC data set

The POC data set is the most important variable in the evaluation. A data set that does not reflect the actual estate will produce results that do not predict production performance.

Include a representative sample of each data source type in scope

If the production environment includes S3, Snowflake, SharePoint, and Salesforce, include all four in the POC. Platforms that perform well on one source type but poorly on another will not reveal that gap unless all source types are in scope. A common POC mistake is connecting only the data source that is easiest to provision, which is typically the one every platform handles well.

Include intentionally difficult content

Create or identify test data that represents the edge cases the platform will face in production: documents with sensitive data embedded in unexpected structures, data that requires domain knowledge to classify correctly, files where the sensitive content is incidental rather than the primary subject. If unstructured text classification is a requirement, test it with actual unstructured text, not with pre-formatted test data that every classifier handles.

Include shadow data scenarios

If shadow data discovery is a requirement, create the conditions under which it occurs: a storage bucket that exists outside the formal governance catalog, a database copy in a development environment that was not cleaned up, a SaaS export in a cloud storage location that is not tracked. Run the POC against an environment that includes these scenarios and evaluate whether the platform surfaces them without being pointed at them.

Size the data set for meaningful scan completion

A POC data set that is too small produces results that are not statistically meaningful. A POC data set that is too large creates scan completion time problems during the POC window. As a baseline, target enough data to generate at least several hundred distinct findings across multiple sensitivity categories, with enough variety in source types to test cross-source classification consistency. Most vendors will provide guidance on data set sizing for POC design; request it.

The evaluation criteria that differentiate platforms

These are the tests that produce differentiation in most DSPM POCs, after the baseline capability tests that every platform passes.

Novel sensitive data category accuracy. Beyond standard PII and PHI, test against data categories specific to the organization. Financial services firms have data types that generic classifiers miss. Healthcare organizations have clinical documentation that requires domain context. Legal teams have privileged communications that look like ordinary documents. Define two or three organization-specific sensitive data categories and test each platform's accuracy before and after any configuration the vendor provides. Platforms that require significant tuning to handle these categories accurately will require significant ongoing tuning in production.

Cross-source consistency. Connect the same logical data, or data of the same type, in multiple source systems and compare classification results across sources. If the same customer PII is in Salesforce and a Snowflake table, do both surfaces show the same classification label and risk level? Platforms with per-source classification logic often show inconsistent results across source types, a problem that compounds as the deployment scales to more sources.

Remediation workflow completeness. Find a finding that requires action. Follow the full remediation workflow to see what happens: is the finding assigned to an owner, can the owner act on it within the platform or does the workflow exit to a ticketing system, is the finding marked resolved after action and removed from the active queue? The gap between a platform surfacing findings and those findings getting acted on at scale is almost always a workflow problem, not a classification problem. Evaluate the full workflow, not just the finding generation.

Permission context quality. For a sample of sensitive data findings, evaluate the quality of the identity and permission context attached to each finding. Does the platform tell you which specific identities have access, or just that access is overly permissive? Can you see whether the identities with access are service accounts, humans, or external parties? Can you filter findings by identity type? Platforms that surface data-only findings without substantive identity context produce findings that are harder to prioritize.

Scan performance at scale. Ask for reference customer data on scan duration and resource consumption at data volumes similar to the production environment. A platform that takes 30 days to complete an initial full scan of a large estate delivers a quarterly snapshot rather than continuous posture visibility. Request the scan performance data and validate it against a reasonable estimate of the production data volume.

Questions to ask during the POC

These are questions the platform should be able to answer from the POC data set. Run them at the end of the POC period and evaluate the completeness of each answer.

What is the total volume of sensitive data found, broken down by sensitivity category?
Which data stores contain the highest concentration of sensitive data?
Which sensitive data is publicly accessible or accessible to identities that should not have access?
What sensitive data exists in locations outside the formal data governance catalog?
Which identities have access to the most sensitive data, and is that access consistent with their role?
What sensitive data was created or modified in the last 30 days?

Platforms that cannot answer all of these from the POC data, or that require manual export and pivot table analysis to answer them, will require that same workaround in production. Evaluate the answer quality, not just the platform's assertion that the question is answerable.

Common POC failure modes

Grading on presentation, not capability. DSPM vendor POC decks and demos are polished. The vendor's solution engineer knows the platform's best workflows and will show them. The test is whether the platform works without the solution engineer running it. Request time with the platform unguided. At minimum, allocate a day for the internal team to explore without vendor support and evaluate what can be found and acted on independently.

Not testing the integration layer. The DSPM platform's value is only realized if the findings reach the people and systems that act on them. The most common post-deployment complaint about DSPM platforms has little to do with finding sensitive data. Teams complain instead that findings never make it into the workflows where they would actually get acted on. Test every integration in the POC that will be required in production.

Accepting vendor-provided test data. Some vendors will offer pre-configured POC environments or test data sets designed to show their platform in the best light. The POC should use actual production data, or a representative sample drawn from production, in a scope the organization controls. Vendor-provided test environments are demos, not evaluations.

Where this fits

The landscape page covers how the market is structured. The comparisons cover specific vendor trade-offs once a shortlist exists. This guide covers the step in between: the evaluation process that determines whether the shortlist's claimed differentiators hold up against the organization's actual data.