从您的Elasticsearch索引和可搜索快照中删除字段数据
项目描述
你在Elasticsearch索引中找到了不属于那里的PII(个人信息)吗?这个工具就是为你准备的!
elastic-pii-redacter可以帮助你从甚至可搜索快照挂载的索引中删除信息。它还可以处理深层嵌套的字段!
客户端配置
该工具通过使用Python模块es_client连接。
您可以使用命令行选项或YAML配置文件来配置客户端连接。如果希望使用配置文件,则配置文件结构需要在根级别包含elasticsearch,如下所示
--- elasticsearch: client: hosts: https://10.11.12.13:9200 cloud_id: request_timeout: 60 verify_certs: ca_certs: client_cert: client_key: other_settings: username: password: api_key: id: api_key: token: logging: loglevel: INFO logfile: /path/to/file.log logformat: default blacklist: []
REDACTIONS_FILE 配置
注意:如果在forcemerge下,only_expunge_deletes为True,则任何配置的max_num_segments值都将被忽略,并且仅清除标记为删除的文档。注意这个区别非常重要,因为默认行为是将每个分片合并为1个段。
--- redactions: - job_name_20240506_redact_hot: pattern: hot-* query: {'match': {'message': 'message1'}} fields: ['message'] message: REDACTED expected_docs: 1 restore_settings: {'index.routing.allocation.include._tier_preference': 'data_warm,data_hot,data_content'} - job_name_20240506_redact_cold: pattern: restored-cold-* query: {'match': {'nested.key': 'nested19'}} fields: ['nested.key'] message: REDACTED expected_docs: 1 restore_settings: {'index.routing.allocation.include._tier_preference': 'data_warm,data_hot,data_content'} forcemerge: max_num_segments: 1 - job_name_20240506_redact_frozen: pattern: partial-frozen-* query: {'range': {'number': {'gte': 8, 'lte': 11}}} fields: ['deep.l1.l2.l3'] message: REDACTED expected_docs: 4 forcemerge: only_expunge_deletes: True
项目详情
关闭
哈希值 for elasticsearch_pii_redacter-1.11.0-py3-none-any.whl
算法 | 哈希摘要 | |
---|---|---|
SHA256 | 53a61f6fad7d1a6f7b53e0dc126dd011e9649af171488390c5fc1be940853973 |
|
MD5 | c21ac75eeb956d2cacdabdfd7535baac |
|
BLAKE2b-256 | f7b7deef31c72ce05a042868e5c242f8efe0bb5b836cd1632946aee872ceff81 |