Unstructured Information Article Index for
Unstructured
Website Links For
Data
 

Information About

Unstructured Information




Merrill Lynch estimates that more than 85% of all business information exists as unstructured data.

Data with some form of structure may also be referred to as unstructured data if the structure is not helpful for the desired processing task. For example, an HTML webpage is highly structured, but this structure is often oriented towards formatting, rather than performing more complex tasks with the content of the page.


DEALING WITH UNSTRUCTURED DATA


Data Mining and Text Analytics techniques are different methods used to find patterns in, or otherwise interpret, this information. Common techniques for structuring text usually involve manual Tagging With Metadata or Part-of-speech Tagging for further Text Mining -based structuring. UIMA provides a common framework for processing this information to extract meaning and create structured data about the information.


NOTES