Compare the Top Unstructured Data Analysis Tools in China as of June 2025

What are Unstructured Data Analysis Tools in China?

Unstructured data analysis tools help organizations process and extract insights from data that lacks a predefined format, such as text, images, and audio. Leveraging AI, machine learning, and natural language processing, these tools identify patterns, sentiments, and trends within vast amounts of raw information. They are widely used for tasks like sentiment analysis, document classification, and image recognition, enabling businesses to make data-driven decisions from complex, unstructured datasets. Unstructured data analysis tools can also be used to process unstructured data for use in LLM RAG. Compare and read user reviews of the best Unstructured Data Analysis tools in China currently available using the table below. This list is updated regularly.

  • 1
    MongoDB Atlas
    The most innovative cloud database service on the market, with unmatched data distribution and mobility across AWS, Azure, and Google Cloud, built-in automation for resource and workload optimization, and so much more. MongoDB Atlas is the global cloud database service for modern applications. Deploy fully managed MongoDB across AWS, Google Cloud, and Azure with best-in-class automation and proven practices that guarantee availability, scalability, and compliance with the most demanding data security and privacy standards. The best way to deploy, run, and scale MongoDB in the cloud. MongoDB Atlas offers built-in security controls for all your data. Enable enterprise-grade features to integrate with your existing security protocols and compliance standards. With MongoDB Atlas, your data is protected with preconfigured security features for authentication, authorization, encryption, and more.
    Starting Price: $0.08/hour
    View Tool
    Visit Website
  • 2
    Medallia

    Medallia

    Medallia

    Medallia allows you to thoughtfully and systematically engage your users with targeted, in-the-moment surveys across digital and traditional touchpoints. Our easily implemented survey solutions ensure you're gathering relevant, actionable data to make measurable customer impact. Once the customer survey data is collected, Medallia's AI technology uses machine learning to analyze structured and unstructured data to uncover sentiment, find commonalities, predict behavior, anticipate needs and prescribe actions to improve experiences. Build the most effective surveys for your customer journeys. Rapidly manage change and innovation to every aspect of your experience management program—from design to emails, questions and translations—with sophisticated targeting logic, flexible conditioning and distribution. Medallia surveys allow you to
  • 3
    Dataleyk

    Dataleyk

    Dataleyk

    Dataleyk is the secure, fully-managed cloud data platform for SMBs. Our mission is to make Big Data analytics easy and accessible to all. Dataleyk is the missing link in reaching your data-driven goals. Our platform makes it quick and easy to have a stable, flexible and reliable cloud data lake with near-zero technical knowledge. Bring all of your company data from every single source, explore with SQL and visualize with your favorite BI tool or our advanced built-in graphs. Modernize your data warehousing with Dataleyk. Our state-of-the-art cloud data platform is ready to handle your scalable structured and unstructured data. Data is an asset, Dataleyk is a secure, cloud data platform that encrypts all of your data and offers on-demand data warehousing. Zero maintenance, as an objective, may not be easy to achieve. But as an initiative, it can be a driver for significant delivery improvements and transformational results.
    Starting Price: €0.1 per GB
  • 4
    Dovetail

    Dovetail

    Dovetail Research

    Analyze data, collaborate on insights, and build your research repository. Discover opportunities and become a hero in your team. Discover patterns across a variety of qualitative research methods, unstructured data, and video files. Dovetail is analysis software you’ll love to use. Dovetail is a powerful way to discover patterns across interviews, usability testing, survey responses, and more. Organize tags into a hierarchy with intuitive controls like drag & drop, and extend your project with global tags. Turn qualitative data into quantitative data with highlights, and visualize your work with a variety of beautiful charts. Simply select text and highlight to add tags. Transcribe video recordings, discover patterns across interviews, usability tests, survey responses, and more. Turn qualitative data into quantitative data. Chart, filter, and segment themes across interview notes, transcripts, survey responses, and more.
    Starting Price: $29/user/month
  • 5
    Wolfram Data Science Platform
    Wolfram Data Science Platform lets you use data sources that are structured or unstructured, and static or real-time. Use the power of WDF and the same linguistics as in Wolfram|Alpha to convert unstructured data to structured form, with automated or guided destructuring and disambiguation. Wolfram Data Science Platform uses industry database connection technology to bring database content into its highly flexible internal symbolic representation. Wolfram Data Science Platform can natively read hundreds of data formats, converting them. Wolfram Data Science Platform works with images, text, networks, geometry, sounds, GIS data and much more. Using the breakthrough symbolic data representation in the Wolfram Language, Wolfram Data Science Platform can seamlessly handle both SQL-style and NoSQL data. Wolfram Data Science Platform automatically constructs a sophisticated interactive report, using algorithms to identify interesting features of your data to visualize and highlight.
  • 6
    SAP Data Services
    Maximize the value of all your organization’s structured and unstructured data with exceptional functionalities for data integration, quality, and cleansing. SAP Data Services software improves the quality of data across the enterprise. As part of the information management layer of SAP’s Business Technology Platform, it delivers trusted,relevant, and timely information to drive better business outcomes. Transform your data into a trusted, ever-ready resource for business insight and use it to streamline processes and maximize efficiency. Gain contextual insight and unlock the true value of your data by creating a complete view of your information with access to data of any size and from any source. Improve decision-making and operational efficiency by standardizing and matching data to reduce duplicates, identify relationships, and correct quality issues proactively. Unify critical data on premise, in the cloud, or within Big Data by using intuitive tools.
  • 7
    KlearStack

    KlearStack

    KlearStack

    KlearStack offers template-less, automated invoice processing, and thus removes the drudgery of manual entry from unstructured documents. Our mission is to automate the tedious manual processes and exhausting data entry, so that humans are freed for more intelligent and creative tasks! To help organizations make their unstructured data a competitive advantage by unlocking the useful information from unstructured and free-form semi-structured documents. KlearStack’s artificial intelligence today provides best solutions to automate the following processes that involve unstructured documents: Invoice Automation Purchase Order Automation Receipt Capture Consumer Durable Loans Multi-Vendor Trade Finance Process Automation Two Wheeler Loan Automation Used Cars Loan Process Automation With our proprietary template-less AI/ML technology, you don't need to spend hundreds or thousands of days on designing and maintaining templates anymore! Improve productivity by up-to 200
  • 8
    Tensorlake

    Tensorlake

    Tensorlake

    Tensorlake is the AI data cloud that reliably transforms data from unstructured sources into ingestion-ready formats for AI applications. It seamlessly converts documents, images, and slides into structured JSON or markdown chunks, ready for retrieval and analysis by LLMs. The document ingestion APIs parse any file type, from hand-written notes to PDFs to complex spreadsheets, performing post-processing steps like chunking and preserving the reading order and layout of the documents. Tensorlake's serverless workflows enable lightning-fast, end-to-end data processing, allowing users to build and deploy fully managed Workflow APIs in Python that scale down to zero when idle and scale up when processing data. It supports processing millions of documents at once, maintaining context and relationships between various data formats, and offers secure, role-based access control for effective team collaboration.
    Starting Price: $0.01 per page
  • 9
    i2

    i2

    N. Harris Computer Corporation

    Turn overwhelming and disparate data from multiple sources into actionable intelligence in near-real time to make informed decisions. Quickly find hidden connections and critical patterns buried in internal, external, and open-source data. Experience i2’s world-class intelligence analysis software for yourself. Request an i2 demo and learn how to uncover critical connections and hidden insights faster than ever. Track critical missions across law enforcement, fraud and financial crime, military defense, and national security and intelligence sectors with the i2 intelligence analysis platform. Capture and fuse structured and unstructured data from internal and external sources, including OSINT and dark web data, to provide an expansive data pool to search and discover over. Fuse advanced analytics with sophisticated geospatial, visual, graph, temporal, and social analysis capabilities to give analysts greater situational awareness.
  • 10
    Qubole

    Qubole

    Qubole

    Qubole is a simple, open, and secure Data Lake Platform for machine learning, streaming, and ad-hoc analytics. Our platform provides end-to-end services that reduce the time and effort required to run Data pipelines, Streaming Analytics, and Machine Learning workloads on any cloud. No other platform offers the openness and data workload flexibility of Qubole while lowering cloud data lake costs by over 50 percent. Qubole delivers faster access to petabytes of secure, reliable and trusted datasets of structured and unstructured data for Analytics and Machine Learning. Users conduct ETL, analytics, and AI/ML workloads efficiently in end-to-end fashion across best-of-breed open source engines, multiple formats, libraries, and languages adapted to data volume, variety, SLAs and organizational policies.
  • 11
    BDB Platform

    BDB Platform

    Big Data BizViz

    BDB is a modern data analytics and BI platform which can skillfully dive deep into your data to provide actionable insights. It is deployable on the cloud as well as on-premise. Our exclusive microservices based architecture has the elements of Data Preparation, Predictive, Pipeline and Dashboard designer to provide customized solutions and scalable analytics to different industries. BDB’s strong NLP based search enables the user to unleash the power of data on desktop, tablets and mobile as well. BDB has various ingrained data connectors, and it can connect to multiple commonly used data sources, applications, third party API’s, IoT, social media, etc. in real-time. It lets you connect to RDBMS, Big data, FTP/ SFTP Server, flat files, web services, etc. and manage structured, semi-structured as well as unstructured data. Start your journey to advanced analytics today.
  • 12
    Synomia

    Synomia

    Synomia

    Thanks to AI, transform your semantic data into insights to objectify your strategic decisions and guide your actions. A pioneer in Artificial Intelligence and owner of semantic data processing technologies, Synomia transforms large amounts of unstructured data into insights to enable brands to better objectify their strategies and activation systems. Identify tomorrow's trends based on the massive analysis of strong and weak signals in your market. Find the most impactful angles of attack for your digital strategies. We master all semantic AI technologies, which we activate according to the needs of our customers: supervised or unsupervised machine learning and rule-based systems. Semantic AI makes it possible to analyze a large number of sources and makes it possible to set up methodologies oriented towards discovery and novelty, it is the key to strategies truly aligned with the expectations of its targets.
  • 13
    Acodis

    Acodis

    Acodis

    Intelligent document processing automates the processing of data within documents, contextualizing the document, understanding the information, extracting it, and sending it to the right place. With Acodis, you can do all of this in just a few seconds. The world is full of unstructured data hidden in documents and it will be for a long time to come. That's why we built Acodis so that you can extract data from any document, in any language. Get structured data from any document with machine learning, in seconds. Build and combine document processing workflows with a few clicks, no coding required. Once you capture and automate your document's data, integrate the process into your existing systems. Acodis offers an easy-to-use user interface. This enables your team to automate document-related processes and enables you to make faster decisions based on machine learning. Use the REST client in the programming language that you are using and integrate it with your existing business tools.
  • 14
    Cloudera Data Platform
    Unlock the potential of private and public clouds with the only hybrid data platform for modern data architectures with data anywhere. Cloudera is a hybrid data platform designed for unmatched freedom to choose—any cloud, any analytics, any data. Cloudera delivers faster and easier data management and data analytics for data anywhere, with optimal performance, scalability, and security. With Cloudera you get all the advantages of private cloud and public cloud for faster time to value and increased IT control. Cloudera provides the freedom to securely move data, applications, and users bi-directionally between the data center and multiple data clouds, regardless of where your data lives.
  • 15
    DryvIQ

    DryvIQ

    DryvIQ

    Gain deep and robust insight into your unstructured enterprise data to gauge risk, mitigate threats and vulnerabilities, while enabling better business decisions. Classify, label and organize unstructured data at enterprise scale. Enable rapid, accurate and detailed identification of sensitive and high-risk files and provide deep insight via A.I. Enable continuous visibility across both new and existing unstructured data. Enforce policy, compliance and governance decisions without reliance upon manual input from users. Expose dark data while automatically classifying and organizing sensitive and other content groups at scale—so you can make intelligent decisions on where and how to migrate that data. The platform also enables both simple and advanced file transfers across virtually any cloud service, network file system or legacy ECM platform, at scale.
  • 16
    Adarga

    Adarga

    Adarga

    We are faced with overwhelming volumes of unstructured data, news feeds, reports, presentations, videos, etc. There is a powerful competitive advantage for organizations able to exploit unstructured data, yet only 1% are able to leverage it as a strategic asset. Adarga’s knowledge platform processes unstructured data at a speed simply unachievable by humans alone, presenting it in comprehensible formats. Users can accelerate reporting, analyze complex situations and understand intricate networks with out-of-the-box AI capability that enhances human decision-making. The Adarga knowledge platform transforms productivity and extends human capability by automating time and knowledge-intensive tasks. It uses cutting-edge AI techniques, including natural language processing and network science, to understand and analyze unstructured data at speed, fusing it into a single, secure software platform.
  • 17
    Forcepoint Data Classification
    Forcepoint Data Classification leverages Machine Learning (ML) and Artificial Intelligence (AI) to increase the accuracy of data classification for unstructured data to improve your team’s efficiency, reduce false alerts and better prevent data loss. Insight generated using AI drives an innovative approach to classification so you can accurately and efficiently determine how data should be classified, at scale. Coverage of the broadest range of data types in the industry powers efficiency and streamlines compliance while delivering better protection for organizations’ data. Increase the speed and efficiency of data classification to reduce false positives and spend more time on legitimate data security incidents. Forcepoint enables organizations to discover, classify, monitor, and protect data with a complementary suite of data security products. Gain a panoramic view of unstructured data across your organization.
  • 18
    VoyagerAnalytics

    VoyagerAnalytics

    Voyager Labs

    Every day, an immense amount of publicly available, unstructured data is produced on the open, deep, and dark web. The ability to gain immediate and actionable insights from this vast amount of data is critical for any investigation. VoyagerAnalytics is an AI-based analysis platform, designed to analyze massive amounts of unstructured open, deep, and dark web data, as well as internal data, in order to reveal actionable insights. The platform enables investigators to uncover social whereabouts and hidden connections between entities and focus on the most relevant leads and critical pieces of information from an ocean of unstructured data. Simplify data gathering, analysis and smart visualization that would take months to handle. It presents the most relevant and important information in near real-time, saving resources normally spent retrieving, processing, and analyzing vast amounts of unstructured data.
  • 19
    NovaceneAI

    NovaceneAI

    NovaceneAI

    NovaceneAI offers a platform that automates the transformation of unstructured text data into actionable insights at scale using artificial intelligence. The platform provides data engineers and data scientists with complete control through a flexible RESTful API and a powerful interface, while also offering a user-friendly web-based experience for business analysts. It features theme-based analysis to track theme-specific sentiment, allowing users to extract experience areas from open-ended comments and measure sentiment in context. The platform is designed to reduce the manual effort involved in organizing unstructured data, enabling analysts to focus more on deriving valuable insights. NovaceneAI has been trusted by leading organizations, including KPMG, ArgylePR, Advanced Symbolics, ListedTech, Laval University, and Toronto Metropolitan University, to improve efficiencies and achieve consistent, systematic results.
  • 20
    Unity Catalog

    Unity Catalog

    Databricks

    Databricks Unity Catalog is the industry’s only unified and open governance solution for data and AI, built into the Databricks Data Intelligence Platform. With Unity Catalog, organizations can seamlessly govern both structured and unstructured data in any format, as well as machine learning models, notebooks, dashboards, and files across any cloud or platform. Data scientists, analysts, and engineers can securely discover, access, and collaborate on trusted data and AI assets across platforms, leveraging AI to boost productivity and unlock the full potential of the lakehouse environment. This unified and open approach to governance promotes interoperability and accelerates data and AI initiatives while simplifying regulatory compliance. Easily discover and classify both structured and unstructured data in any format, including machine learning models, notebooks, dashboards, and files across all cloud platforms.
  • 21
    DeepNLP

    DeepNLP

    SparkCognition

    SparkCognition, a leading industrial AI company, has developed a natural language processing solution that automates workflows of unstructured data within organizations so humans can focus on high-value business decisions. The DeepNLP product uses advanced machine learning techniques to automate the retrieval of information, the classification of documents, and content analytics. The DeepNLP product integrates into existing workflows to enable organizations to better respond to changes in their business and quickly get answers to specific queries or analytics that support decision-making.
  • 22
    OpenText Unstructured Data Analytics
    OpenText™ Unstructured Data Analytics products employ AI and machine learning to help organizations uncover and leverage key insights stored deep within their unstructured data, including text, audio, video, and images. Organizations can connect all their data to understand the context and information locked inside high-growth unstructured content—at scale. Discover insights hidden within all types of media with unified text, speech, and video analytics that support more than 1,500 data formats. Use natural language processing, optical character recognition (OCR), and other AI-powered models to understand and track the meaning within unstructured data. Employ the latest innovations in machine learning and deep neural networks to understand written and spoken language in data, revealing greater insights.
  • 23
    Commerce.AI

    Commerce.AI

    Commerce.AI

    Our systems intelligently gather a variety of high quality unstructured data streams across hundreds of sources, in the form of text, voice, images and videos. Our systems clean this data and are trained to extract signals across products, services, attributes, brands, sentiments, customers, markets, and trends. It gets synthesized and contextualized using our proprietary Deep Product Learning ® technology. Use our enterprise-grade integrations to ingest your private data. Assess and benchmark your view of your products and services with the competitive landscape. Our platform delivers powerful AI-driven actions where you need it - dashboard, APIs and integrations - and turn insights into action, across PIMs, CRMs, voice assistants, chatbots, and more.
  • Previous
  • You're on page 1
  • Next