Processing CSV Files

How CSV Files Are Processed
Constraints
Additional Options
Support Matrix
Sample Request
Sample Response

Limina supports scanning Comma Separated Value (CSV) files for PII and creating de-identified or redacted copies. Limina’s supported entity types function across each file type, with localized variants of different PII (Personally Identifiable Information) entities, PHI (Protected Health Information) entities, and PCI (Payment Card Industry) entities being detected. Our Supported Languages and Supported Entity Types page provides a more detailed look.

How CSV Files Are Processed

Similar to Excel files, CSV files are processed using the method described for Tabular Data in the Structured Data Guide. The output file retains its original format with labels in place of the detected PII.

Constraints

Please consider writing a handler for your specific application using the Structured Data Guide to get around any of the constraints listed below.

The file processing routes are synchronous, meaning that large files over 10MB in size may take a long time to process.
The data in the CSV file must be row-oriented (i.e. each row represents a separate record) and the headers must be on the first row.
Files must adhere to the csv file standards.

Additional Options

CSV file processing is additionally supported by a specialized pre-processing parameter called csv_options. This parameter enables a sampling mode that provides efficiencies in overall file processing performance. The sampling feature looks at a selection of cells within each column of the file, predicts the entity type, and redacts the entirety of the column using those detected entity types. This can have a drastic improvement in performance when working with very large CSV files.

Support Matrix

	CPU Container	GPU Container	Community API	Professional API
Supported	Yes	Yes	Up to 250 KiB	No

Sample Request

Connect with one of our privacy experts to run this code.

{
  "file": {
    "data": "<file_content_base64>",
    "content_type": "text/csv"
  },
  "entity_detection": {
    "return_entity": true
  }
}

Sample Response

Response

{
  "processed_file": "Base64 Encoded File Content of the Redacted File",
  "processed_text": "string",
  "entities": "List[Entity]",
  "entities_present": true,
  "languages_detected": {"lang_1": 0.67, "lang_2": 0.74}
}

Processing Audio Files

Processing DCM/DICOM Files

Container Quickstart

Demo

Installation

Configuration & Operations

Product Guides

Entities & Languages

Translations

How CSV Files Are Processed

Constraints

Additional Options

Support Matrix

Sample Request

Sample Response

Container Quickstart

Demo

Installation

Configuration & Operations

Product Guides

Entities & Languages

Translations

Documentation Index

​How CSV Files Are Processed

​Constraints

​Additional Options

​Support Matrix

​Sample Request

​Sample Response

How CSV Files Are Processed

Constraints

Additional Options

Support Matrix

Sample Request

Sample Response