- Aug 21, 2020 - Explore Muhammad Akbar's board 'All Mac Software', followed by 604 people on Pinterest. See more ideas about mac software, mac, software.
- A free software for quantitative content analysis or text mining that supports multiple languages. Correspondance analysis, collocation analysis, frequency analysis: Windows, Mac, Linux: Free, Open Source: MaltParser.
Text data mining (TDM) by text analysis, information extraction, document mining, text comparison, text visualization and topic modelling
The search engine extracts automatically texts of different file formats and uses grammar rules (stemming) to index and find different word forms.
Content Analysis Tool
On this base and index you can search, review, filter, analyze and mine content with different text mining, analysis, extraction, data mining and clustering methods.
Get things done with the most intuitive and easy-to-learn software for QDA and mixed methods in science, market, health, HR and UX research. Download Free Trial. Access to all versions with any license: (Student, Single, Multi, or Site-License) Windows.
So you can use the search engine not only for information retrieval by full text search to search and find known issues or to get structured data from unstructured data sources or texts by information extraction. It can be used as integrated text mining toolbox for text datamining (TDM) for semi-automated or automated text analysis, document mining, text comparision, text visualization and topic modelling to get useful analysis results even of unknown data sources.
Search and filter the interesting documents
If you don't want to analyze all indexed documents, you can search and filter the context you want to mine and analyze.
Words: Word list and word cloud
The view Words (option of the tab/button Analyze) shows you the words which are contained in the most documents of the results of your search context (documents matching your search query and filters).
If you do not enter a search query and don't use a filters it shows the words which are contained in the most of all indexed documents.
The number shows you how many documents (matching your search query and filters or if no search query or filter of all documents) use this word.
If you click on a word, this word will be added as an additional filter.
The words are visualiszed as a word cloud. The more documents containing the word, the bigger it is in the visualization
Aggregated overviews of extracted structured informations, named entities and concepts for exploratory search (thesaurus based, ontonologies based and machine learning for automatic classification based faceted search)
With the faceted search you can see an aggregated overview for the different facets like paths, concepts, persons, locations or organzations showing, how many documents matching the named entities.
This structure will be generated and facets/fields are valued with data from the following analysis:
- Lists of Named Entities: Listed known named entities like organizations, persons, locations or concepts. They can be managed in plaintext lists, databases, ontologies, thesauri or in the thesaurus user interface for dictionary based or thesaurus based text mining and thesaurus based faceted search
- Annotation & Tagging: Tags from (collaborative) annotations and tagging
- Text patterns (Regular Expressions): Extraction of structured data or data enrichment with text patterns (regular expressions) can extract informations like email-adresses or amounts of money. They are added to facets like Email adresses, From:, To: or money.
- Named entity extraction or Named entity recognition (NER) of even yet unknown entities like persons, organizations or locations by automatic classification of this text parts by machine learning on an annotated training corpus model
.
Topic modelling (clustering and differences)
Coming soon (please donate so we can implement this sooner):
Topic modelling (clusters of topics what about documents are)
What are the contents about? What are the most common topics in the whole, selected or filtered document set?
Coocuration (Connected words): Which words occure together (Bigrams/Trigrams/N-Grams)?
What is special in comparision with another text or document set ? See 'Compare text or part of the corpus with other text or part of corpus'.
Similarity ('more like this')
Coming soon (please donate so we can implement this sooner):
Search with a whole document or text as a search query:
If not yet, index your document which should be used as search query.
Search for that document (i.e. by filename).
Find similar text or documents about the same topics by clicking on 'more like this'.
Direct text comparision: Differences of two text versions (visualization of added, deleted or copy pasted parts)
Compare two texts / versions to show differences or same/copied passages or deleted or added words.
Coming soon (please donate so we can implement this sooner):
Document set comparision (show differences like overrepresented terms)
Coming soon (please donate so we can implement this sooner):
Special focus of a text or document set (text corpora) by comparision with other text or document set (text corpora).
Show differences and focal points, core areas and key aspects by comparing word frequencies to find out what concepts or entities are overrepresented in documents in comparison to other documents or text corpus.
Extract text patterns with Regular Expressions (RegEx)
You can extract some structured data i.e. for aggregated overviews, interactive navigation and interactive filters (faceted search), data analysis and data visualization from unstructured text by extraction of the interesting text parts to structured flields, properties or facets by defining text patterns with regular expressions (RegEx) or own regular expressions based enhancer plugins
Advanced text analysis, text mining, document mining and text visualizations
Advanced features like clustering and network analysis and advanced visualizations need more CPU load, more parameters and knowledge and specialized tools for different analysis, so you have to start them manually for your documents or for special search context.
But many advanced text mining tools support only few document formats and data formats and do not optical character recognition (OCR) automatically.
Since this free software is interoperable open source software and uses open standards you are free to integrate additional data enrichment or data analysis plugins or to use other specialized tools additionally and based on the (exportable) text extraction, data enrichment, search and filter results of the search engine.
How to explore and analyse a document collection with external text mining tools?
After automatic extracting, indexing, analysis (i.e. optical character recognition by OCR engines) and enriching (i.e. with Named Entities or extraction of email-addresses) you can do an advanced text analysis, text mining and document mining with this special tools based on an export of all data or an export of search results or filtered results:
- Search and filter/drill down the interesting document set (or do not, if you want to analyze all documents)
- Export this search results to a CSV file. Select the interesting fields like id, title, persons, organzations and mainly the fields content and ocr_t
- Import the CSV in other open source text mining tools and use the extracted text data with natural language processing (NLP) or machine learning (ML), named entities recognition (NER) or classification libraries until some of its advances machine learning methods for text mining are integrated into the user interfaces
- Use their advanced features and views, for example different views from Jigsaw
Free Software and Open Source text analytics and text mining toolkits and platforms or text mining solutions
Alternate Free Software and Open Source text analytics and text mining toolkits or text mining platforms:
Text mining platforms
- Gate - General architecture for text engineering
Open source components for natural language processing (NLP), clustering and classification (machine learning)
Open source frameworks & programming libraries or APIs for natural language processing (NLP), clustering and classification (machine learning):
Content Analysis Software Free Mac Operating System
- Apache Solr (Java based REST-API)
- Elastic search
- Apache UIMA - Unstructured Information Management Architecture for information extraction
- DKPro - Text mining framework (Java and UIMA)
- OpenNLP - Command line tools and Java library
- Python Natural Language Toolkit (NLTK) - Natural language processing library (Python)
- Gensim - Topic modelling programming library (Python)
- Mallet (Java)
- Apache Mahout (Java)
- Apache Spark (Java, but APIs for Pyton, too)
- Apache Stanbol
More: Text Analysis Portal for Research or in Wikipedia list of text mining software
- Advertisement
- Disk usage analysis tool v.3.0Disk usage analysistool tells you how your disk space is being used. With Disk usage analysistool you will always know the size of all of your folders. This tool will help you to find out how much garbage you store on HDD.
- Disk analysis tool v.2.4.8Directory Size is the program in the field of disk space analysis. Being a multipurpose product, the disk analysistool solves the whole complex of the problems connected with the studying of current disk space usage.
- Microsoft Code Analysis Tool .NET (CAT.NET) v.1.0 CTPMicrosoft Code AnalysisTool .
- Microsoft Code Analysis Tool .NET (CAT.NET) v1 CTP v.1.0Microsoft Code AnalysisTool .
- Business Analysis Tool Desktop v.2.8Business AnalysisTool Desktop is a Business Intelligence software for data visualization and OLAP reporting. You can analyze your data by building pivot tables, charts, treemaps, scatter diagrams, filtering/sorting/searching for patterns, etc.
- Intangible Assets Software Analysis Tool v.2.0Intangible Assets Software AnalysisTool (Strategic Analysis, Management) ...
- ERA Pre-Auction Decision Analysis Tool v.4.02This free, easy-to-use tool helps buyers avoid the misapplication of electronic reverse auctions. By leading the user through a series of questions, the excel-based, pre-auction analysistool offers customized guidance on your sourcing initiative.
- Constitutional Analysis Tool v.1The Constitutional AnalysisTool (CAT) allows a user to analyze Constitutions and make quantitative consistent comparisons of different struggles across our planet, both today and throughout our history. There is an old saying,'actions speak louder ...
- PLA - Plain Language Analysis Tool v.1Plain Language Analysistool is a new revolutionary analysistool to help users create clear, concise, unambiguous documents using the Plain Language technique. On June 1, 1998, President Clinton issued an executive memo requiring agencies to write ...
- Specification Analysis Tool v.1The Specification AnalysisTool (SAT) augments existing Computer Automated Software Engineering (CASE) and Requirement Management Tools such as DOORS to fill the important need to help organizations create requirement text. It does not matter if you ...
- Equity Analysis Tool v.beta1The 'Equity Analysis Tool' is a utility to read in stock quote data files created by Briter Systems and output data formatted for use with other companies stock analysis software. Example files can be downloaded from ...
- GNOME Usability Analysis Tool v.0.2GUAT (GNOME Usability AnalysisTool) is an application that takes .glade files as inputs and summarises/evaluates the UI elments using the GNOME ...
- Adsat -- Sequence Analysis Tool v.1.0Automated Database-lookup and Sequence AnalysisTool for use specifically with BioPython. Used for obtaining Sequence and other Data from online databases and processing the data using open bioinformatics tools like BLAST and MEME.
- Web Link Analysis Tool v.1.0Web Link AnalysisTool is a general usage tool for Longitudinal and Cyclical Web Link AnalysisTool for Analyzing the referrers of one or more web sites.
- Firewall Backup and Analysis Tool v.1.0Firewall Backup and AnalysisTool (fBat) is a platform independent tool to manage initially Cisco ASDM FWSMs, but will be able in due time to analyze also IPTables (Netfilter) as well as IPF and PF rules. Others will (hopefully) be added along the way.
- FPA Analysis tool v.1.0The FPA Analysistool is intended for the analysis of the scope of IT projects in their initial phase by the method of FPA (Function Point Analysis).
- SEO Workers Analysis Tool for linux v.1.0.3Provides a basic SEO analysis for a single web page ...
- PARS - Regulatory sequence analysis tool v.1.0PARs is a bioinformatics tool for the analysis of cis-regulatory DNA sequences. Composed of two parts: a suite of sequence analysis algorithms for predicting cis-binding sites in DNA sequences and a GUI for visualisation and exploration of the results.
- Peptide MS Analysis Tool v.1.0Peptide is a project to develop an integrated software tool for Mass Spectroscopy analysis of peptides.
- Yi Xue - Firewall Analysis Tool v.1.0Tool for analysis, consulting, auditing and optimizing firewalls.
- Disk usage analysis tool Disk usage analysis tool tells you how your disk
- Disk analysis tool Directory Size is the program in the field of disk space
- Firewall Backup and Analysis Tool Firewall Backup and Analysis Tool (fBat) is a platform
- SharePoint Content Migration SharePoint content migration to 2010, SharePoint
- Equity Analysis Tool The 'Equity Analysis Tool' is a utility to read in stock
- Network Security Analysis Tool NSAT is a fast, highly configurable, bulk network security
- TextStatistics TextStatistics 1.0 is quite simple & easy freeware tool .
- Website Performance Analysis Tool Website downtime checker software securely evaluates your
- Web Link Analysis Tool Web Link Analysis Tool is a general usage tool for
- WinMemScan WinMemScan is a small, simple application specially designed