I'm curious what types of sensitive data you scan for, besides personally identifiable information? For example, how do you scan for IP that might be custom to an enterprise?
that's a good question. for this use case, we have a custom models feature where the user can define an entity type (types) they're interested in and then a custom ner model is trained to identify the specified type. for this, a combination of llms and user input are used to generate training data for the ner model.