Home >Documents >Text Analytics And Text Mining Best of Text and Data

Text Analytics And Text Mining Best of Text and Data

Date post:02-Jan-2016
Category:
View:40 times
Download:2 times
Share this document with a friend
Description:
Text Analytics And Text Mining Best of Text and Data. Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services http://www.kapsgroup.com. Agenda. Text Analytics Capabilities Text Analytics Applications Text Mining and Text Analytics - PowerPoint PPT Presentation
Transcript:
  • Text Analytics And Text MiningBest of Text and DataTom Reamy Chief Knowledge ArchitectKAPS GroupKnowledge Architecture Professional Serviceshttp://www.kapsgroup.com

  • *AgendaText Analytics CapabilitiesText Analytics ApplicationsText Mining and Text AnalyticsData and Unstructured ContentCase Study Text Mining for Taxonomy DevelopmentConclusion

  • *KAPS Group: GeneralKnowledge Architecture Professional ServicesVirtual Company: Network of consultants 8-10Partners SAS, Smart Logic, Microsoft-FAST, Concept Searching, etc.Consulting, Strategy, Knowledge architecture auditServices:Text Analytics evaluation, development, consulting, customizationKnowledge Representation taxonomy, ontology, PrototypeMetadata standards and implementationKnowledge Management: Collaboration, Expertise, e-learningApplied Theory Faceted taxonomies, complexity theory, natural categories

  • *Introduction to Text AnalyticsText Analytics FeaturesNoun Phrase Extraction Catalogs with variants, rule based dynamicMultiple types, custom classes entities, concepts, eventsFeeds facetsSummarizationCustomizable rules, map to different contentFact ExtractionRelationships of entities people-organizations-activitiesOntologies triples, RDF, etc.Sentiment AnalysisStatistical, rules full categorization set of operators

  • *Introduction to Text AnalyticsText Analytics FeaturesAuto-categorization Training sets Bayesian, Vector spaceTerms literal strings, stemming, dictionary of related termsRules simple position in text (Title, body, url)Semantic Network Predefined relationships, sets of rulesBoolean Full search syntax AND, OR, NOTAdvanced NEAR (#), PARAGRAPH, SENTENCEThis is the most difficult to developBuild on a TaxonomyCombine with Extraction, SentimentFoundation for best text analytics & combination

  • *

  • *

  • *

  • *

  • *

  • *

  • *Varieties of Taxonomy/ Text Analytics SoftwareTaxonomy ManagementSynaptica, SchemaLogicFull PlatformSAS-Teragram, SAP-Inxight, Smart Logic, Data Harmony, Concept Searching, Expert System, IBM, GATE Content Management embedded Embedded SearchFAST, Autonomy, Endeca, Exalead, etc.SpecialtySentiment Analysis , VOC Lexalytics, Attensity / ReportsOntology extraction, plus ontology

  • *Text Analytics ApplicationsPlatform for Multiple ApplicationsContent Aggregation, Duplicate Documents save millions!Business intelligence, Customer IntelligenceSocial Media - sentiment analysis, Voice of the CustomerSocial Hybrid folksonomy / taxonomy / auto-metadataSocial expertise, categorize tweets and blogs, reputationOntology travel assistant, semantic web, etc.eDiscovery, Reputation management, Customer ExperienceExpertise Location, Crowd sourcing Technical support

  • *Text Analytics Applications:Enterprise Search - ElementsText Analytics can solve enterprise searchMultiple Knowledge StructuresFacet orthogonal dimension of metadataTaxonomy - Subject matter / aboutnessSoftware - Search, ECM, auto-categorization, entity extraction, Text Analytics and Text MiningPeople tagging, evaluating tags, fine tune rules and taxonomyRich Search Results context and conversationPlatform for search based applications

  • *

  • *

  • Text Analytics and Text MiningData and Unstructured Content

    80% of content is unstructured adding to semantic web is majorText Analytics content into dataBig Data meets Big ContentReal integration of text and ontologyBeyond hasDescriptionImprove accuracy of extracted entities, facts disambiguationPipeline oil & gas OR research / FordAdd Concepts, not just Things 68% want thisSemantic Web + Text Analytics = real world valueLinked Data + Text Analytics best of both worldsBuild superior foundation elements taxonomies, categorization

    *

  • Text Analytics and Text Mining and Data MiningVaccine Adverse Reaction Combine with Data MiningNew sources of informationNews stories, medical recordsBlogs, social Find new connections, sources of knowledgeVaccine Adverse Effects disease, symptoms, variablesUnstructured text into a data source Some preliminary analysis, content structureFind unknown adverse effects and prevalenceDrug Discovery + search / research 5 year story

    *

  • *Text Analytics ApplicationsExample Vaccine Adverse Effects

  • *Text Analytics ApplicationsExample Vaccine Adverse Effects

  • *Text Analytics ApplicationsExample Vaccine Adverse Effects

  • Text Analytics and Text MiningCase Study Taxonomy Development

    Problem 200,000 new uncategorized documentsOld taxonomy need one that reflects change in corpus Text mining, entity extraction, categorizationBottom Up- terms in documents frequency, date, Clustering suggested categoriesClustering chunking for editorsTime savings only feasible way to scan documentsQuality important terms, co-occurring terms*

  • Text Analytics and Text MiningCase Study Taxonomy Development

    Text into Data: Article, Abstract, Title, Subtitle fields & source of termsAdd Data: PubDate, journalTitle, Taxonomy NodeTerms Map to frequency, date, date ranges, Taxonomy NodeNew Terms, Trends Relevance frequency, Abstract, Title, human judgmentEntity Extraction Authors, Organizations, Products, Categorization build on clusters & taxonomy Combination reports, visualizations, interactive explorations*

  • Case Study Taxonomy Development*

  • *

  • *

  • Case Study Taxonomy Development*

  • Case Study Taxonomy Development*

  • Conclusion

    Text Analytics impact is huge solve information overloadEnterprise Search and Search Based Applications: Save millions and enhance productivityCombination of Text Analytics & Text Mining unlimited range of applicationsMutual Enrichment more data, add structure to unstructuredAdd Ontology = Richer Text Analytics smarter, more usefulText Analytics + Text Mining + Semantic WebMove from theory to new practical applicationsThe best is yet to come!*

  • Questions? Tom Reamy [email protected] GroupKnowledge Architecture Professional Serviceshttp://www.kapsgroup.com

    ***********

Popular Tags:
of 30/30
Text Analytics And Text Mining Best of Text and Data Tom Reamy Chief Knowledge Architect KAPS Group Knowledge Architecture Professional Services http://www.kapsgroup.com
Embed Size (px)
Recommended