IntuView Ltd.





Related Information


Technical Information
Supported Formats:
textual pdf, htm, html, mht, doc, txt, rtf, xls, pst
Supported Languages:
Arabic, Malay/Indonesian. Planned to be supported soon - Farsi, Urdu, Pashtu
Environment:
Standalone version: Windows XP, Minimum 4Gb RAM, No need for external communication.
Server version: Windows Server 2003, Minimum 8Gb RAM, Multiple cores (as much as needed), No need for external communication.
API is available for Java, C++ and .NET

Home > IntuScan™ Platform
IntuScan™ Platform
IntuScan™ Platform is a unique platform implemented currently for the domain of counter terrorism (CT). It incorporates Natural Language Processing (NLP) technology for processing textual materials in relevant languages (e.g. Arabic, Farsi, Urdu, Pashtu, Malay/Indonesian, etc.) along with relevant domain knowledge bases and domain-specific algorithms and rules for processing the contents of the texts. Hence, IntuScan™ is a "plug and play" platform that enables the user to use it immediately after the initial installation, with minimal training and with almost no necessary customization.


Further more, the IntuScan™ Platform contains -
  • NLP algorithms for neo-classical Arabic, which is mostly used in the relevant domain.
  • Large domain-specific training-set of human analyzed documents.
  • Domain specific lexicons and ontology sets.
  • Statistical models based on the ontological occurrences (not the usual "bag of words") in the training-set.
  • Rules that deal with linguistic or contextual "trumps", which challenge and override the statistical judgment and provide learning capabilities.



Although all those provided knowledge bases are developed by IntuView's domain experts, IntuScan™ Platform includes customization tools, for building user-defined content layers, which allow the user to enrich and enlarge any part of the knowledge bases according to his needs.


The provided knowledge bases are updated on daily basis and are provided to the user periodically according to the contract. The content update affects the internal knowledge bases only and does not affect the integrity of the user-defined content.


In addition, IntuView offers professional services for developing customer tailored knowledge bases on demand. Please contact us to learn more on our services.


IntuScan™ currently analyses documents with CT content, however future modules will be able to deal with additional domains. IntuScan™ can also be installed as a generic platform for processing documents based only on user-defined content.


IntuScan™ supports various digital input formats as well as hard copies, which first must be scanned and processed by an external Optical Character Recognition (OCR) software.



The processing of the documents includes:

  • Triage, categorization and prioritization of the examined material according to relevancy and urgency for the user.
  • Identification of the authorship and date of writing.
  • Detection of topics and events discussed in the text.
  • Recognition of political and ideological leanings.
  • Extraction of named entities and contextual information about them
  • Exegesis of the hermeneutics of religious, cultural or just professional or domain-specific allusions implicit within the text
  • Structured presentation of the information in textual summaries or data-base formats.

Current potential uses of IntuScan™ include:


1. Strategic Intelligence:
  • Cover of OSINT, blogs, websites, radicalization media.
  • Triage and real-time first-tier analysis of intercept.
  • Support for analysts in understanding hermeneutics of collected material.

2. Tactical Intelligence:
  • Immediate triage and extraction of Intel from captured documents in internal security operations.
  • Triage of captured material in military operations.

3. Border-control – warning regarding terrorist-related material brought into the country.
4. Police – first indicator of possible terrorist relevance of material caught on a suspect.
5. Prisons - prevention of infiltration of radicalizing material.