AWS Macie not identifying PII data properly (Names and SSNs)

0

So I have started using the Macie service to do some data classification for a project I am building. I originally tried it with some JSON data that was put into a text file that contained US SSN numbers. I ran the Macie service to have it try and find this data and the scan did not return any findings.

Next I figured that I would try the Macie scan on a an excel file with three columns. The first column had 5 first names, the second column had 5 last names, and the third column had 5 SSN numbers. Nothing else was in the excel file. I ran the Macie scan again and it still failed to find any sensitive data. I tried using the all managed identifiers scan and just the individual SSN scan and neither of them returned any findings.

Does anyone know what I might be doing wrong and why Macie cant find simple SSN numbers? I am happy to provide more context as well as share the files if it will be helpful (all the SSNs are fake numbers for testing).

질문됨 일 년 전526회 조회
2개 답변
0

Hi - I used a simple .xlsx file with some fake first/last name and fake ssn. Created a job and it was able to identify as below

The object contains personal information such as first or last names, addresses, or identification numbers.

Also review some of the requirements mentioned here https://docs.aws.amazon.com/macie/latest/user/data-classification.html

You can also you can monitor and analyze specific events that occur as a job progresses https://docs.aws.amazon.com/macie/latest/user/discovery-jobs-monitor-cw-logs.html

profile pictureAWS
전문가
답변함 일 년 전
  • Hi Nitin thanks for your response. Could you share with me the excel doc you used and what the data looked like? Thank you for those links I have seen those as I have been investigating this problem. The cloudwatch logs just say that the scan was running and was completed but does not give me any insight into why Macie cant pick up on these sensitive data types.

0

Could you provide the sample file in the form of comma separated values in a comment?

It's worth noting that Macie does have some validation built in to filter out fake numbers. For example, if you entered 123-45-6789 or 000-00-0000 as the SSN, it wouldn't trigger.

AWS
답변함 일 년 전

로그인하지 않았습니다. 로그인해야 답변을 게시할 수 있습니다.

좋은 답변은 질문에 명확하게 답하고 건설적인 피드백을 제공하며 질문자의 전문적인 성장을 장려합니다.

질문 답변하기에 대한 가이드라인

관련 콘텐츠