Detailed Course Outline
Module 1: Overview of Cloudera Search
- What is Cloudera Search?
- Helpful Features
- Use Cases
- Basic Architecture
Module 2: Performing Basic Queries
- Executing a Query in the Admin UI
- Basic Syntax
- Techniques for Approximate Matching
- Controlling Output
Module 3: Writing More Powerful Queries
- Relevancy and Filters
- Query Parsers
- Functions
- Geospatial Search
- Faceting
Module 4: Preparing to Index Documents
- Overview of the Indexing Process
- Understanding Morphlines
- Generating Configuration Files
- Schema Design
- Collection Management
Module 5: Batch Indexing HDFS Data with MapReduce
- Overview of the HDFS Batch Indexing Process
- Using the MapReduce Indexing Tool
- Testing and Troubleshooting
Module 6: Near-Real-Time Indexing with Flume
- Overview of the Near-Real-Time Indexing Process
- Introduction to Apache Flume
- How to Perform Near-Real-Time Indexing with Flume
- Testing and Troubleshooting
Module 7: Indexing HBase Data with Lily
- What is Apache HBase?
- Batch Indexing for HBase
- Indexing HBase Tables in Near-Real-Time
Module 8: Indexing Data in Other Languages and Formats
- Field Types and Analyzer Chains
- Word Stemming, Character Mapping, and Language Support
- Schema and Analysis Support in the Admin UI
- Metadata and Content Extraction with Apache Tika
- Indexing Binary File Types with SolrCell
Module 9: Improving Search Quality and Performance
- Delivering Relevant Results
- Helping Users Find Information
- Query Performance and Troubleshooting
Module 10: Building User Interfaces for Search
- Search UI Overview
- Building a User Interface with Hue
- Integrating Search into Custom Applications
Module 11: Considerations for Deployment
- Planning for Deployment
- Determining Hardware Needs
- Security Overview
- Collection Aliasing