The Data Analyst focuses on post-import data enrichment and quality preparation within EDT. Their daily work centres on running and verifying OCR, language detection, PII detection, named entity recognition, deduplication, pattern recognition, and other data quality operations that transform raw imported data into a clean, enriched, and review-ready data set. They bridge the gap between initial data ingestion and the analytical and review workflows downstream.
| EDT Feature | How Do I... |
|---|---|
| Login & Authentication | Log in to the platform to begin my data enrichment and quality work? |
| Terms & Conditions | Accept the platform terms of use required before accessing case data? |
| Site Home Page | Navigate to the platform landing page and locate the cases requiring data quality work? |
| Case List | Find a specific case where I need to run data enrichment operations? |
| My Cases View | Quickly access the cases where I am actively performing data quality tasks? |
| Case Home Page | Navigate to the Prepare tab and data quality views within a case? |
| Tabs | Switch between the Prepare tab, QA views, and Analysis tab during data enrichment work? |
| Workspaces | Set up a workspace tailored to data quality operations with OCR status, language, and PII detection views? |
| Views | Open the views I need for monitoring OCR queues, reviewing language detection, and verifying PII results? |
| Filters Panel | Filter records by OCR status, detected language, or PII detection results to focus my enrichment work? |
| Layout Panel | View the data quality fields for an individual record such as language, OCR status, and entity detections? |
| User Profile Settings | Configure my display preferences for working efficiently with data quality dashboards? |
| Case Profile Settings | Set case-specific preferences for auto-run search and data quality verification workflows? |
| Keyboard Shortcuts | Use keyboard shortcuts to navigate quickly through data quality queues and detection results? |
| Share via Email | Email a link to a specific data quality issue or detection result to a colleague? |
| Notifications | Monitor the progress of OCR, language detection, and other enrichment jobs? |
| About Dialog | Review case summary details to understand the scope and context of the data I am enriching? |
| Help & Links | Access the online help for data quality and AI/ML feature configuration? |
| EDT Feature | How Do I... |
|---|---|
| QA Tab | Navigate the QA tab to review records requiring data quality remediation after enrichment? |
| QA Status Error Messages | Diagnose why specific records failed OCR or other enrichment processing? |
| QA Tab Actions | Remediate records with data quality issues such as re-running OCR or uploading replacement files? |
| QA Reports | Generate a report documenting data quality issues and the status of enrichment operations? |
| Duplicate Detection | Configure and verify the deduplication strategy to ensure optimal duplicate exclusion across the case? |
| Duplicates View | Analyse MD5-based duplicate groupings to verify that the correct originals have been retained? |
| OCR | Configure and run OCR on image-based documents to maximise the searchability of the data set? |
| Noisy Text Detection | Identify and suppress repeated email footers and disclaimers that degrade search quality? |
| Logo Detection | Review and validate ML-based logo detections to remove non-substantive images from the review set? |
| Spam Detection | Review AI-based spam detections and verify that the algorithm has not incorrectly flagged substantive emails? |
| Alias Detection (Name Normalisation) | Run and refine NLP-based name normalisation to group name variants across email correspondence fields? |
| Language Detection | Run and verify language detection across the data set to classify documents and route foreign-language content for translation? |
| PII Detection | Configure and run PII detection patterns to identify personal information such as phone numbers, credit cards, and government identifiers? |
| Named Entity Recognition (NER) | Run NER to extract people, places, and identification numbers from document text for entity investigation? |
| Pattern Recognition | Configure and apply regex-based pattern recognition with validation rules and supportive words to detect structured data? |
| Flagrant Image Detection | Review AI-based flagrant image detections and confirm that sensitive content has been correctly isolated? |
| Object Detection in Images | Review ML-based object detection results to classify image content across the data set? |
| Container Management | Verify that all containers have been expanded and track container-level processing status? |
| Suppression | Suppress irrelevant or immaterial records from the working data set to improve data quality for reviewers? |
| Move to Folder | Organise records into folders based on data quality status or enrichment outcomes? |
| EDT Feature | How Do I... |
|---|---|
| Generate Summaries | Generate AI-powered document summaries to help reviewers quickly understand record content? |
| Continuous Active Learning (CAL) | Understand how data quality enrichment affects CAL model training and relevance ranking? |
| Detect Concepts | Run NLP concept detection to extract key themes and topics from the enriched data set? |
| Detect Sentiment | Run sentiment detection to classify the emotional tone of communications across the data set? |
| Detect Languages | Configure and verify language detection to accurately classify documents across 80+ languages? |
| Named Entity Recognition | Run and validate NER to extract people, places, and personal identifiers from document text? |
| Detect Logos | Run logo detection on email images and verify that non-substantive logos have been correctly identified? |
| Detect Flagrant Images | Run flagrant image detection and review confidence scores to validate the results? |
| Detect Objects in Images | Run object detection on images and review classification labels applied to the data? |
| Record Clustering | Run ML-based record clustering to group textually similar documents for thematic analysis? |
| Transcription | Configure and run speech-to-text transcription on audio and video files in the data set? |
| Translation | Run machine translation on foreign-language documents identified by language detection? |
| Pattern Recognition | Configure regex patterns with Luhn validation and supportive words for high-confidence PII detection? |
| Alias Detection | Run NLP-based alias detection to group name variants and improve people-based search accuracy? |
| Spam Detection | Run and validate the AI spam detection algorithm across imported email data? |
| Intelligent Processing | Configure and enable the full suite of Intelligent Processing capabilities during or after import? |
| OCR | Configure OCR queue routing, PDF character thresholds, and run OCR on image-based documents? |
| EDT Feature | How Do I... |
|---|---|
| Case Creation | Understand the case creation settings that affect data quality operations? |
| Case Templates | Apply a case template that includes standard data quality and enrichment configurations? |
| Case Lifecycle | Understand the case lifecycle stages and when data enrichment should be performed? |
| Custom Tabs | Request a custom tab layout that supports data quality and enrichment workflows? |
| Workspaces | Configure workspaces for data quality monitoring with OCR, language, PII, and entity views? |
| Fields | Understand which fields are populated by data quality and enrichment operations? |
| Tags | Use tags to classify records by data quality status such as OCR Complete, PII Detected, or Translation Required? |
| Layouts | Review the layout configuration used for data quality coding and enrichment status tracking? |
| Folders | Organise records into folders based on data quality outcomes or enrichment stage? |
| Quarantine Folders | Understand how quarantine folders isolate records flagged during data quality operations? |
| Document Sets | Create document sets to group records by enrichment category or data quality milestone? |
| Groups & Permissions | Understand which permissions I need to run OCR, PII detection, and other enrichment features? |
| Case Settings | Review case settings that affect data quality operations such as time zone and search index configuration? |
| Coding Rules | Understand how coding rules automatically populate fields based on data quality enrichment results? |
| Case Rules | Understand how case rules reflect data quality milestones at the case level? |
| Import Rules | Understand the import validation rules that affect initial data quality during ingestion? |
| Export Rules | Understand export rules that may reference data quality fields? |
| EDT Feature | How Do I... |
|---|---|
| Staging Tab | Review staged data to understand what is queued for import and what enrichment will be needed? |
| Itemisation Logs | Review itemisation reports to anticipate the data quality work required after import? |
| Item Kinds & Types | Assess file type composition to plan OCR, language detection, and other enrichment operations? |
| Assign Custodian | Verify that custodian assignments are correct before running enrichment operations? |
| Import Unprocessed Files | Understand the import options that affect downstream data quality such as OCR settings and Intelligent Processing? |
| Import Load Files | Understand how load file imports affect data quality fields and enrichment workflows? |
| Simplified Load File Import | Understand how simplified imports interact with data quality and enrichment features? |
| Import Cellebrite | Understand data quality considerations specific to Cellebrite mobile data imports? |
| Import QA (Reimport) | Reimport records after resolving data quality issues to update their enrichment status? |
| Create Record | Understand how manually created records are handled by data quality enrichment workflows? |
| Import Templates | Review import templates to verify that Intelligent Processing and data quality options are correctly configured? |
| Container Expansion | Verify that all containers have been expanded so their contents can be enriched? |
| Full-Text Extraction | Verify that text extraction succeeded so that NLP-based enrichment can operate on full text? |
| Metadata Extraction | Verify that metadata extraction provides the fields needed for data quality operations? |
| EXIF Data Extraction | Confirm that EXIF data has been extracted from images for location and device analysis? |
| File Type Detection | Verify file type detection accuracy to ensure the correct enrichment operations are applied? |
| DeNISTing | Verify that DeNISTing has excluded known system files before enrichment operations run? |
| Password Handling | Confirm that encrypted files have been decrypted so their content is available for enrichment? |
| Hard Deleted Recovery | Verify that recovered deleted items are included in the enrichment pipeline? |
| Malware Scanning | Confirm that malware scanning has cleared files before they enter enrichment processing? |
| Automated Import via API | Understand how API-triggered imports interact with data quality and enrichment workflows? |
| EDT Feature | How Do I... |
|---|---|
| Search Bar | Search for records by enrichment status such as OCR pending, language detected, or PII found? |
| Filters Panel | Filter records by language, PII detection results, or entity type to verify enrichment coverage? |
| Advanced Search | Build a query to find all records where a specific PII pattern was detected or OCR has not yet run? |
| Saved Searches | Save a search for records requiring enrichment attention to track outstanding data quality work? |
| Search History | Review my previous data quality verification searches? |
| Word Variations | Test stemming and wildcard coverage to verify that OCR text extraction supports expected search variations? |
| Document ID Filter | Look up a specific record to verify its enrichment status and data quality fields? |
| Document ID Lists | Upload a list of Document IDs to check enrichment status for a batch of records? |
| MD5 Lists | Upload a hash list to verify which duplicate files have been enriched versus excluded? |
| Location Filter | Navigate the source folder hierarchy to verify enrichment coverage by data source? |
| Alias Recipient "Only" | Search for communications between specific normalised aliases to verify alias detection accuracy? |
| Sample from Saved Search | Take a random sample from enriched data to perform quality assurance on detection results? |
| EDT Feature | How Do I... |
|---|---|
| Concepts View | Review the key concepts extracted from the enriched data set to verify NLP quality? |
| Clusters Tab | Review document clusters to assess whether data quality enrichment has improved thematic grouping? |
| Communications View | Analyse communication patterns using normalised aliases to verify alias detection quality? |
| Timeline View | Visualise the date distribution of enriched data to identify any gaps or anomalies? |
| Chronology View | Review or update chronology events related to data quality milestones? |
| Custodians View | Assess enrichment coverage by custodian to ensure all data sources have been processed? |
| File Types View | Analyse file type distribution to verify that OCR and enrichment have been applied to the correct types? |
| Detected Languages View | Review the language distribution chart to verify language detection accuracy and translation needs? |
| File Size View | Identify large files that may require special handling during enrichment operations? |
| Named Entities View | Review the named entities extracted by NER to validate detection accuracy? |
| Detect Sentiment | Review sentiment detection results across the enriched data set? |
| Record Clustering | Verify that clustering results improve after data quality enrichment has been applied? |
| Similar Content View | Use similarity scores to identify near-duplicates that may need additional deduplication analysis? |
| Compare Records View | Compare two near-duplicate documents to assess whether deduplication decisions are correct? |
| Email Threading | Review email threading to verify that enrichment has not disrupted thread integrity? |
| Detect Concepts | Validate that concept detection is producing meaningful results on the enriched data? |
| EDT Feature | How Do I... |
|---|---|
| Markup Tab | Access the markup tools to review or apply redactions based on PII detection results? |
| Markup Sets | Understand how markup sets separate PII redactions from other annotation types? |
| Redaction Tools | Apply manual redactions to PII that was identified during data quality enrichment? |
| Redaction Reasons | Assign a redaction reason such as "Personal Information" when redacting detected PII? |
| Redaction View Mode | Toggle transparent view to verify that PII redactions correctly cover the detected content? |
| 12 Annotation Types | Use annotation tools to highlight or flag data quality issues on specific records? |
| Convert Markup Types | Convert a highlight annotation to a redaction after confirming PII content? |
| Search and Redact | Search for a detected PII pattern within a document and redact all instances? |
| Auto Markup (Pattern Recognition) | Apply automatic redactions based on PII patterns detected by pattern recognition? |
| Auto Markup (Lists) | Apply automatic redactions from keyword lists containing sensitive terms? |
| Comments on Markups | Add a comment to a PII redaction to document the detection method or verification status? |
| Markup Keyboard Navigation | Navigate between PII redactions on a document using keyboard shortcuts? |
| Copy Markup | Copy PII redactions from one record to another similar document? |
| Burn Markup | Permanently render PII redactions into PDF renditions for production? |
| Image Adjustment | Adjust image quality on poorly scanned documents to improve OCR accuracy? |
| Rotate Document/Page | Rotate misaligned pages to improve OCR processing accuracy? |
| Download with Markup | Download a record with PII redactions applied for offline review? |
| Auto-Save Markup | Ensure that PII redaction work is automatically saved during review? |
| EDT Feature | How Do I... |
|---|---|
| Participants View | View and manage the participants associated with the case for alias normalisation context? |
| Participant Types | Understand the different participant types to support alias detection and entity grouping? |
| Roles | Review participant roles to understand how alias detection maps to case participants? |
| Add Participants | Add participants to the case based on entities discovered during NER and alias detection? |
| Representation Tracking | Review participant representation relationships identified through data enrichment? |
| Participant Details | Update participant details based on information discovered through NER and alias detection? |
| Alias Detection | Run and refine NLP-based alias detection to normalise name variants across all correspondence fields? |
| Alias Operations | Create, merge, verify, and manage aliases to maintain accurate people normalisation? |
| Communications View | Verify that communications analysis reflects correctly normalised alias groupings? |
| Persons (Site Level) | Review site-level person records for cross-case alias normalisation consistency? |
| Orgs (Site Level) | Review site-level organisation records to support entity enrichment? |
| Chronology Integration | Link discovered entities and participants to chronology events? |
| Download Participants | Download the participant list to verify alignment with NER and alias detection results? |
| EDT Feature | How Do I... |
|---|---|
| Report Builder View | Build a custom report showing data quality metrics such as OCR completion, language distribution, or PII detection counts? |
| Grid Download | Download a filtered list of records with enrichment status fields to Excel for operational reporting? |
| Chronology View | View or update chronology events related to data enrichment milestones? |
| PDF Presenter | Present enriched data quality findings in a briefing context? |
| Exhibit Lists | Generate an exhibit list from the enriched and quality-verified data set? |
| Case List Report | Download a report of cases to track data quality work across the portfolio? |
| Case Stats | View case-level statistics to monitor data quality and enrichment progress? |
| QA Reports | Generate QA reports documenting data quality enrichment outcomes and outstanding issues? |
| Import Details | Review import details to understand the starting point for data quality enrichment? |
| Export Summary | Review export summaries to verify that enriched data was correctly produced? |
| Participant Download | Download a participant list enriched through alias detection and NER? |
| Audit Reports | Generate audit reports documenting data quality enrichment activities and decisions? |
| Review Pool Reports | Monitor review progress on data that has completed quality enrichment? |
| Record Audit | View the audit trail for a record to trace all enrichment operations applied to it? |
| Download Charts as PNG | Export data quality charts such as language distribution or entity counts as images for reports? |
| Print Records | Print a set of enriched records as a combined PDF for physical review? |