SlideShare a Scribd company logo
Data as a service to enable compliance 
reporting 
Girish Juneja, CTO 
October 7, 2014 
© 22001144 AAllttiissoouurrccee LLaabbss.. AAllll RRiigghhttss RReesseerrvveedd.. Page | 1
Chairman: William C. Erbey 
CEO: William B. Shepro 
Employees: ~8,000 
NASDAQ: ASPS 
Market Cap: 
$2.2 Billion 
(Sept. 15, 2014) 
Performance since August 2009 
Separation from Ocwen® 
CAGR Share Price: 
(Through Sept. 15, 2014) 
47% 
CAGR Service Revenue: 
(Through Sept. 30, 2013) 
39% 
Altisource Overview 
 Separated from Ocwen in August 2009 
 Created and separated RESI and AAMC in 
December 2012 
 Strong free cash flow 
 Strong growth prospects in very large markets 
© 2014 Altisource Labs. All Rights Reserved. Page | 2
Altisource Vision 
Vision 
To be the premier real estate and mortgage marketplace offering both 
content and distribution to the marketplace participants 
Mission 
To offer homeowners, buyers, sellers, agents, mortgage originators and 
servicers trusted and efficient marketplaces to conduct real estate and 
mortgage transactions, and improve outcomes for market participants 
Real Estate Marketplace Mortgage Marketplace 
 Home Sales 
 Home Rentals 
 Home Maintenance 
 Mortgage Originations 
 Mortgage Servicing 
© 2014 Altisource Labs. All Rights Reserved. Page | 3
State of the Business: Servicing 
COMPLEXITY 
- Meeting borrower 
/customer expectations 
- Elevated scrutiny of 
borrower interactions 
- Proliferation of servicer 
products 
- Reporting requirements 
Increased Risk 
Increased costs 
Increased penalties 
and fines 
Decreased customer 
satisfaction 
COMPLIANCE 
- Velocity of new and 
changing rules 
- Magnitude of financial 
exposure 
- Existing technology limits 
compliance capabilities 
CHANGE 
- Lack of end-to-end visibility 
- Rigid and inflexible systems 
- Volume and nature of data 
interoperability between 
data silos 
Compliance 
© 2014 Altisource Labs. All Rights Reserved. Page | 4
Future of Servicing 
For servicers’ businesses to grow, a modern servicing platform must be: 
Flexible and 
Adaptable 
to easily and cost effectively 
respond to evolving market and 
business dynamics 
Scalable and 
Automated 
to enable cost effective business 
growth 
Interoperable 
to seamlessly interface with third 
party apps and other software 
platforms 
Compliance 
Centric 
to meet ever-changing regulatory 
mandates 
Analytical 
to drive continuous improvement 
and manage risk 
© 2014 Altisource Labs. All Rights Reserved. Page | 5
Common Foundational Layer 
Customer Experience 
Menus & Navigation API Management Caching DMZ Gateway 
Identity Mgmt 
Single Sign-on 
Multi-tenant 
Authorization & RBAC 
Compliance & 
Entitlements 
Security 
Framework 
Encryption 
MFA Authentication 
Access Governance 
User Profile 
Rules Mgmt 
Workflows, 
Business & 
Compliance 
Workflow Mgmt 
Messaging 
Notification & 
Subscription 
Rules, 
Messaging, 
Integrations 
Search 
3rd Party 
Integrations 
Metadata 
Management 
Master Data 
Management 
Reporting 
Compliant 
Auditing 
Data 
Management 
Transactional, 
Reporting & 
Analytics 
Data Archival 
Warehousing 
Data as a Service 
Provisioning 
Monitoring/Alerting 
Backup/Restore 
Configuration & 
Customization 
Elastic Performance 
Multi-tenant 
Operational 
Framework 
Availability/DR 
Metering 
High performance scala based framework 
App Provisioning, isolation, multi-tenancy & Life-cycle 
Service Registry 
Multi-tenant Cloud Provider Independent Rapid Deployment PaaS 
Deployment & Test Automation 
Cloud Abstraction layer 
© 2014 Altisource Labs. All Rights Reserved. Page | 6
Environment Overview 
Financial Industry faces: 
– Increasing regulatory requirements 
– Increasing customer compliance requirements 
– Regulatory & customers’ changing requirements need correlating data across sources 
– Risk Analysis requires correlating internal and non-conforming structured external data sets 
– Existing data stores unable to respond rapidly 
Organizations need solutions that: 
– Enable financial institutions to address changing regulations 
– Enable automated compliance monitoring processes and systems 
– Improve internal controls by retaining data lineage to unmodified source datasets 
– Provide actionable & timely information from data 
– Enable on-demand reporting on massive datasets based on schema defined with the 
reporting request 
© 2014 Altisource Labs. All Rights Reserved. Page | 7
The Traditional Enterprise 
Data Warehouse approach 
– Maintain data integrity by storing the organizational data with regulatory 
information in respective data dimensions. 
– Data modeling and the design of facts and dimensions are very critical to 
the success of compliance data warehouse. 
– The regulatory sufficiency needs to be maintained in the regulation 
dimension of the compliance data warehouse. 
– From data warehouse data is transmitted into a regulatory data mart. 
– Transformation of base data elements and regulation rules are maintained 
in the meta data. 
© 2014 Altisource Labs. All Rights Reserved. Page | 8
Metadata, ETL & Core Model 
Metadata 
– Reflects the business and business 
processes 
– In EDW, all functionality is metadata 
driven 
Data definitions 
• Source and Core model from technical 
perspective 
• business perspective 
– Transformations & Aggregations 
• Transformations to derive cleansed Core 
data from source data 
• Aggregations and de-normalizations to 
create Access model data 
Loading Service 
– Reflects the business and business 
processes 
– In EDW, all functionality is metadata 
driven 
Data definitions 
• Source and Core model from technical 
perspective 
• business perspective 
– Transformations & Aggregations 
• Transformations to derive cleansed Core 
data from source data 
• Aggregations and de-normalizations to 
create Access model data 
Access Model 
– Data in the Access model is directly 
traceable to the Core model 
• De-normalized 
• Aggregated 
• Designed for query and access 
performance. 
• End user requirements 
• Access restrictions and controls 
– As a design decision, access model 
objects can either be physical 
structures, or structures materialized 
on access. 
Core Model 
– Control 
• Only approved, tested, and validated 
processes can update data in the Core 
model. 
– Data Model 
• Highly Normalized 
• No redundancy 
• Subject areas, loans, investors.. 
• Not optimized for apps 
• target data formats post-cleansing 
• It is optimized for efficiency and 
correctness. 
© 2014 Altisource Labs. All Rights Reserved. Page | 9
The Change/Update Process 
Each Change involves the following steps: 
– Update the extraction module 
– Update the staging module 
– Update the Transformation module 
– Update the metadata 
– Update the data repository 
This process is tedious, involved, and brittle. 
© 2014 Altisource Labs. All Rights Reserved. Page | 10
Enhancing with Big Data Technologies 
We were driven to adopt big data technology for many reasons: 
– Demand to analyze new data sources in an ever shorter timeframe 
– Growth in data complexity 
– Variety of data types 
– Volume of data and inability to move it around due to time constraints 
– Velocity of data generation, internal and external 
– Veracity of data from multiple sources 
– Growth in analytical complexity 
– Increasing availability of cost-effective computing and data storage 
The big reason for us was the frequent change of requirements due to changing business & 
regulatory changes. 
Spark is a more flexible platform 
© 2014 Altisource Labs. All Rights Reserved. Page | 11
Data as a Service 
© 22001144 AAllttiissoouurrccee LLaabbss.. AAllll RRiigghhttss RReesseerrvveedd.. Page | 12
Data Mobilization View for Data Lake 
© 2014 Altisource Labs. All Rights Reserved. Page | 13
Borrower Data Service 
Request Details: 
Service Name: Borrower Data 
Context: Current/Cleansed & Conformed/History 
Request Filter: <Borrower Name>, <Date Range> 
Response Details: 
© 2014 Altisource Labs. All Rights Reserved. Page | 14
Borrower Data Service 
© 2014 Altisource Labs. All Rights Reserved. Page | 15
Mortgage Data Service 
Request Details: 
Service Name: Mortgage Data 
Context: Current/Cleansed & Conformed/History 
Request Filter: <Loan Number>, <Date Range> 
Response Details: 
© 2014 Altisource Labs. All Rights Reserved. Page | 16
Mortgage Data Service 
© 2014 Altisource Labs. All Rights Reserved. Page | 17
Loan Default Event 
Event Details: 
Event Name: Loan Default 
Context: Current 
© 2014 Altisource Labs. All Rights Reserved. Page | 18
Loan Default Event 
© 2014 Altisource Labs. All Rights Reserved. Page | 19
Data Lake vs. Data Warehouse 
Feature Data Lake Data warehouse 
Data Volume Extremely large 
(Petabytes) 
Large (Terabytes) 
Access Methods NoSQL SQL 
Schema Schema on read Schema on write 
Scalability Scales horizontally Scales vertically 
Hardware Commodity hardware Specialized hardware/ 
appliances 
Data Structure Structured and 
unstructured 
Structured 
Data Raw Cleansed/Aggregated 
© 2014 Altisource Labs. All Rights Reserved. Page | 20
Data Lake Technology Stack 
GraphX 
Services/ 
Application Portals 
Spark (DAG construct and execute engine) 
RDD Instances/Schemas 
3 rd Party 
Drivers 
Analytics Portals 
Data Storage 
Cassandra /HDFS/ 
Parquet 
BI/ETL tools 
ODBC/ 
JDBC 
Spark 
Streaming 
External 
Data Stores 
In-house 
API 
Interactive 
Mlib/ 
SparkR 
Hive/HQL Spark SQL 
In-house 
Drivers 
YARN 
© 2014 Altisource Labs. All Rights Reserved. Page | 21
Benefits of Apache Spark based Data Lake 
- 
– Load data as its stored in the source system - no transformation needed 
– Build structure on it, apply Hive external tables on this raw data 
– Data sets built with our business logic 
– The intermediate and final results saved back to data storages 
– Working data sets saved as Parquet files 
– Distinction between data view and update view 
– When the data file changes in Hadoop or Cassandra, we have to update 
the Hive or Schema RDD’s: then we are done. 
© 2014 Altisource Labs. All Rights Reserved. Page | 22
Data Storage Access Layer 
– Abstract the details of data accessing through contexts/drivers. 
 Hive/HQL 
 Spark Sql 
 Cassandra driver for Spark 
– Unify the data into RDD interfaces. 
 SchemaRDD 
 HadoopRDD 
 CassandraRDD 
© 2014 Altisource Labs. All Rights Reserved. Page | 23
Code Samples - Apply Hive Schema to Raw Data 
Pour data 
Into HDFS 
Create 
Hive 
Schema 
Use HQL 
inside 
Spark 
SQL 
Save 
result in 
Parquet 
format 
RDBMS’s 
Excel Files 
Documents 
External Sources 
Cluster Details: 
16 VM’s 
128 GB Memory 
126 GB Disk 
© 2014 Altisource Labs. All Rights Reserved. Page | 24
The Spark Cluster 
App App Service Service Tool Tool … … … … 
Spark Driver 
Worker Worker Worker Worker Worker Worker … … … … 
Data Data … … … … Data Data Data Data 
Worker Worker Worker Worker Worker Worker … … … … 
Storage Storage Storage Storage Storage Storage … … … … 
© 2014 Altisource Labs. All Rights Reserved. Page | 25
Performance observations 10 18 Rows 
4.5 hrs 
48 minutes 
1 min 
Engineered 
Solutions 
Cores 128 
Memory 2048 Gb 
Disk 12 Tb 
In-memory 
Databases 
Cores 160 
Memory 2048 Gb 
Disk 12 Tb 
Spark Cluster 
VM’s 
Cores 128 
Memory 2048 Gb 
Disk 12 Tb 
© 2014 Altisource Labs. All Rights Reserved. Page | 26
Challenges 
– Open source Apache Spark, while very promising, has to mature 
– Spark production deployment is complicated 
– Security of data is not enterprise class, needs additional layers 
– Tools eco system is still developing – BI Tools still in development 
But.. 
– Done right has a lot of business value 
– We are hiring engineers! 
© 2014 Altisource Labs. All Rights Reserved. Page | 27
Q & A 
© 2014 Altisource Labs. All Rights Reserved. Page | 28
Ad

More Related Content

What's hot (20)

The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data Hub
Cloudera, Inc.
 
Building a Logical Data Fabric using Data Virtualization (ASEAN)
Building a Logical Data Fabric using Data Virtualization (ASEAN)Building a Logical Data Fabric using Data Virtualization (ASEAN)
Building a Logical Data Fabric using Data Virtualization (ASEAN)
Denodo
 
Data Ninja Webinar Series: Realizing the Promise of Data Lakes
Data Ninja Webinar Series: Realizing the Promise of Data LakesData Ninja Webinar Series: Realizing the Promise of Data Lakes
Data Ninja Webinar Series: Realizing the Promise of Data Lakes
Denodo
 
Modern Data Management for Federal Modernization
Modern Data Management for Federal ModernizationModern Data Management for Federal Modernization
Modern Data Management for Federal Modernization
Denodo
 
Hadoop: Extending your Data Warehouse
Hadoop: Extending your Data WarehouseHadoop: Extending your Data Warehouse
Hadoop: Extending your Data Warehouse
Cloudera, Inc.
 
Deploying a Governed Data Lake
Deploying a Governed Data LakeDeploying a Governed Data Lake
Deploying a Governed Data Lake
WaterlineData
 
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...Analyst View of Data Virtualization: Conversations with Boulder Business Inte...
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...
Denodo
 
Big Data and Data Virtualization
Big Data and Data VirtualizationBig Data and Data Virtualization
Big Data and Data Virtualization
Kenneth Peeples
 
How to build a successful Data Lake
How to build a successful Data LakeHow to build a successful Data Lake
How to build a successful Data Lake
DataWorks Summit/Hadoop Summit
 
DW 101
DW 101DW 101
DW 101
jeffd00
 
Hadoop and Your Data Warehouse
Hadoop and Your Data WarehouseHadoop and Your Data Warehouse
Hadoop and Your Data Warehouse
Caserta
 
Better Together: The New Data Management Orchestra
Better Together: The New Data Management OrchestraBetter Together: The New Data Management Orchestra
Better Together: The New Data Management Orchestra
Cloudera, Inc.
 
White paper making an-operational_data_store_(ods)_the_center_of_your_data_...
White paper   making an-operational_data_store_(ods)_the_center_of_your_data_...White paper   making an-operational_data_store_(ods)_the_center_of_your_data_...
White paper making an-operational_data_store_(ods)_the_center_of_your_data_...
Eric Javier Espino Man
 
A beginners guide to Cloudera Hadoop
A beginners guide to Cloudera HadoopA beginners guide to Cloudera Hadoop
A beginners guide to Cloudera Hadoop
David Yahalom
 
Enterprise Data Lake - Scalable Digital
Enterprise Data Lake - Scalable DigitalEnterprise Data Lake - Scalable Digital
Enterprise Data Lake - Scalable Digital
sambiswal
 
Flash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lonFlash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lon
Jeffrey T. Pollock
 
Big Data Day LA 2015 - Data Lake - Re Birth of Enterprise Data Thinking by Ra...
Big Data Day LA 2015 - Data Lake - Re Birth of Enterprise Data Thinking by Ra...Big Data Day LA 2015 - Data Lake - Re Birth of Enterprise Data Thinking by Ra...
Big Data Day LA 2015 - Data Lake - Re Birth of Enterprise Data Thinking by Ra...
Data Con LA
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)
James Serra
 
Why Data Virtualization? An Introduction by Denodo
Why Data Virtualization? An Introduction by DenodoWhy Data Virtualization? An Introduction by Denodo
Why Data Virtualization? An Introduction by Denodo
Justo Hidalgo
 
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)
How to Quickly and Easily Draw Value  from Big Data Sources_Q3 symposia(Moa)How to Quickly and Easily Draw Value  from Big Data Sources_Q3 symposia(Moa)
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)
Moacyr Passador
 
The Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data HubThe Future of Data Management: The Enterprise Data Hub
The Future of Data Management: The Enterprise Data Hub
Cloudera, Inc.
 
Building a Logical Data Fabric using Data Virtualization (ASEAN)
Building a Logical Data Fabric using Data Virtualization (ASEAN)Building a Logical Data Fabric using Data Virtualization (ASEAN)
Building a Logical Data Fabric using Data Virtualization (ASEAN)
Denodo
 
Data Ninja Webinar Series: Realizing the Promise of Data Lakes
Data Ninja Webinar Series: Realizing the Promise of Data LakesData Ninja Webinar Series: Realizing the Promise of Data Lakes
Data Ninja Webinar Series: Realizing the Promise of Data Lakes
Denodo
 
Modern Data Management for Federal Modernization
Modern Data Management for Federal ModernizationModern Data Management for Federal Modernization
Modern Data Management for Federal Modernization
Denodo
 
Hadoop: Extending your Data Warehouse
Hadoop: Extending your Data WarehouseHadoop: Extending your Data Warehouse
Hadoop: Extending your Data Warehouse
Cloudera, Inc.
 
Deploying a Governed Data Lake
Deploying a Governed Data LakeDeploying a Governed Data Lake
Deploying a Governed Data Lake
WaterlineData
 
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...Analyst View of Data Virtualization: Conversations with Boulder Business Inte...
Analyst View of Data Virtualization: Conversations with Boulder Business Inte...
Denodo
 
Big Data and Data Virtualization
Big Data and Data VirtualizationBig Data and Data Virtualization
Big Data and Data Virtualization
Kenneth Peeples
 
Hadoop and Your Data Warehouse
Hadoop and Your Data WarehouseHadoop and Your Data Warehouse
Hadoop and Your Data Warehouse
Caserta
 
Better Together: The New Data Management Orchestra
Better Together: The New Data Management OrchestraBetter Together: The New Data Management Orchestra
Better Together: The New Data Management Orchestra
Cloudera, Inc.
 
White paper making an-operational_data_store_(ods)_the_center_of_your_data_...
White paper   making an-operational_data_store_(ods)_the_center_of_your_data_...White paper   making an-operational_data_store_(ods)_the_center_of_your_data_...
White paper making an-operational_data_store_(ods)_the_center_of_your_data_...
Eric Javier Espino Man
 
A beginners guide to Cloudera Hadoop
A beginners guide to Cloudera HadoopA beginners guide to Cloudera Hadoop
A beginners guide to Cloudera Hadoop
David Yahalom
 
Enterprise Data Lake - Scalable Digital
Enterprise Data Lake - Scalable DigitalEnterprise Data Lake - Scalable Digital
Enterprise Data Lake - Scalable Digital
sambiswal
 
Flash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lonFlash session -streaming--ses1243-lon
Flash session -streaming--ses1243-lon
Jeffrey T. Pollock
 
Big Data Day LA 2015 - Data Lake - Re Birth of Enterprise Data Thinking by Ra...
Big Data Day LA 2015 - Data Lake - Re Birth of Enterprise Data Thinking by Ra...Big Data Day LA 2015 - Data Lake - Re Birth of Enterprise Data Thinking by Ra...
Big Data Day LA 2015 - Data Lake - Re Birth of Enterprise Data Thinking by Ra...
Data Con LA
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)
James Serra
 
Why Data Virtualization? An Introduction by Denodo
Why Data Virtualization? An Introduction by DenodoWhy Data Virtualization? An Introduction by Denodo
Why Data Virtualization? An Introduction by Denodo
Justo Hidalgo
 
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)
How to Quickly and Easily Draw Value  from Big Data Sources_Q3 symposia(Moa)How to Quickly and Easily Draw Value  from Big Data Sources_Q3 symposia(Moa)
How to Quickly and Easily Draw Value from Big Data Sources_Q3 symposia(Moa)
Moacyr Passador
 

Viewers also liked (8)

Security & Compliance for Startups
Security & Compliance for StartupsSecurity & Compliance for Startups
Security & Compliance for Startups
Symosis Security (Previously C-Level Security)
 
On Analyzing and Specifying Concerns for Data as a Service
On Analyzing and Specifying Concerns for Data as a ServiceOn Analyzing and Specifying Concerns for Data as a Service
On Analyzing and Specifying Concerns for Data as a Service
Hong-Linh Truong
 
Data Lake vs. Data Warehouse: Which is Right for Healthcare?
Data Lake vs. Data Warehouse: Which is Right for Healthcare?Data Lake vs. Data Warehouse: Which is Right for Healthcare?
Data Lake vs. Data Warehouse: Which is Right for Healthcare?
Health Catalyst
 
UberTest Quick Guide
UberTest Quick GuideUberTest Quick Guide
UberTest Quick Guide
Amira Elsayed Ismail
 
ML and Data Science at Uber - GITPro talk 2017
ML and Data Science at Uber - GITPro talk 2017ML and Data Science at Uber - GITPro talk 2017
ML and Data Science at Uber - GITPro talk 2017
Sudhir Tonse
 
Stream Computing & Analytics at Uber
Stream Computing & Analytics at UberStream Computing & Analytics at Uber
Stream Computing & Analytics at Uber
Sudhir Tonse
 
Uber Analytics Test
Uber Analytics TestUber Analytics Test
Uber Analytics Test
Coursetake
 
Uber Real Time Data Analytics
Uber Real Time Data AnalyticsUber Real Time Data Analytics
Uber Real Time Data Analytics
Ankur Bansal
 
On Analyzing and Specifying Concerns for Data as a Service
On Analyzing and Specifying Concerns for Data as a ServiceOn Analyzing and Specifying Concerns for Data as a Service
On Analyzing and Specifying Concerns for Data as a Service
Hong-Linh Truong
 
Data Lake vs. Data Warehouse: Which is Right for Healthcare?
Data Lake vs. Data Warehouse: Which is Right for Healthcare?Data Lake vs. Data Warehouse: Which is Right for Healthcare?
Data Lake vs. Data Warehouse: Which is Right for Healthcare?
Health Catalyst
 
ML and Data Science at Uber - GITPro talk 2017
ML and Data Science at Uber - GITPro talk 2017ML and Data Science at Uber - GITPro talk 2017
ML and Data Science at Uber - GITPro talk 2017
Sudhir Tonse
 
Stream Computing & Analytics at Uber
Stream Computing & Analytics at UberStream Computing & Analytics at Uber
Stream Computing & Analytics at Uber
Sudhir Tonse
 
Uber Analytics Test
Uber Analytics TestUber Analytics Test
Uber Analytics Test
Coursetake
 
Uber Real Time Data Analytics
Uber Real Time Data AnalyticsUber Real Time Data Analytics
Uber Real Time Data Analytics
Ankur Bansal
 
Ad

Similar to Data-As-A-Service to enable compliance reporting (20)

Big Data Case study - caixa bank
Big Data Case study - caixa bankBig Data Case study - caixa bank
Big Data Case study - caixa bank
Chungsik Yun
 
AppSphere 15 - Mining the World’s Largest Healthcare Data Warehouse while Ens...
AppSphere 15 - Mining the World’s Largest Healthcare Data Warehouse while Ens...AppSphere 15 - Mining the World’s Largest Healthcare Data Warehouse while Ens...
AppSphere 15 - Mining the World’s Largest Healthcare Data Warehouse while Ens...
AppDynamics
 
Fast Data Overview
Fast Data OverviewFast Data Overview
Fast Data Overview
C. Scyphers
 
Best Practices for Monitoring Cloud Networks
Best Practices for Monitoring Cloud NetworksBest Practices for Monitoring Cloud Networks
Best Practices for Monitoring Cloud Networks
ThousandEyes
 
Approach to Data Management v0.2
Approach to Data Management v0.2Approach to Data Management v0.2
Approach to Data Management v0.2
Simon Baig, FCCA, CGEIT, MSc
 
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DATAVERSITY
 
Contexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to ProductionContexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to Production
Contexti
 
Ensure a Successful SAP Hybris Implementation – Part 2: Architecture and Buil...
Ensure a Successful SAP Hybris Implementation – Part 2: Architecture and Buil...Ensure a Successful SAP Hybris Implementation – Part 2: Architecture and Buil...
Ensure a Successful SAP Hybris Implementation – Part 2: Architecture and Buil...
Kellton Tech Solutions Ltd
 
Cloud Services Brokerage Demystified
Cloud Services Brokerage DemystifiedCloud Services Brokerage Demystified
Cloud Services Brokerage Demystified
Zach Gardner
 
Data Virtualization Journey: How to Grow from Single Project and to Enterpris...
Data Virtualization Journey: How to Grow from Single Project and to Enterpris...Data Virtualization Journey: How to Grow from Single Project and to Enterpris...
Data Virtualization Journey: How to Grow from Single Project and to Enterpris...
Denodo
 
Redefining End-to-End Monitoring: The Foundation - High-Performance Architect...
Redefining End-to-End Monitoring: The Foundation - High-Performance Architect...Redefining End-to-End Monitoring: The Foundation - High-Performance Architect...
Redefining End-to-End Monitoring: The Foundation - High-Performance Architect...
SL Corporation
 
Oracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analyticsOracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analytics
jdijcks
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Cloudera, Inc.
 
Oracle communications data model product overview
Oracle communications data model   product overviewOracle communications data model   product overview
Oracle communications data model product overview
GreenHamster
 
First Friday Forum December 5th Featuring Pentaho
First Friday Forum December 5th Featuring PentahoFirst Friday Forum December 5th Featuring Pentaho
First Friday Forum December 5th Featuring Pentaho
ArchipelagoIS
 
The Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationThe Shifting Landscape of Data Integration
The Shifting Landscape of Data Integration
DATAVERSITY
 
Realign Process & Data To Improve Your Customer-Centricity
Realign Process & Data To Improve Your Customer-CentricityRealign Process & Data To Improve Your Customer-Centricity
Realign Process & Data To Improve Your Customer-Centricity
Bizagi
 
CSC - Presentation at Hortonworks Booth - Strata 2014
CSC - Presentation at Hortonworks Booth - Strata 2014CSC - Presentation at Hortonworks Booth - Strata 2014
CSC - Presentation at Hortonworks Booth - Strata 2014
Hortonworks
 
Data Management Strategy
Data Management StrategyData Management Strategy
Data Management Strategy
Nandeep Nagarkar
 
Big Data and Analytics
Big Data and AnalyticsBig Data and Analytics
Big Data and Analytics
Cameron. A. Bradbury
 
Big Data Case study - caixa bank
Big Data Case study - caixa bankBig Data Case study - caixa bank
Big Data Case study - caixa bank
Chungsik Yun
 
AppSphere 15 - Mining the World’s Largest Healthcare Data Warehouse while Ens...
AppSphere 15 - Mining the World’s Largest Healthcare Data Warehouse while Ens...AppSphere 15 - Mining the World’s Largest Healthcare Data Warehouse while Ens...
AppSphere 15 - Mining the World’s Largest Healthcare Data Warehouse while Ens...
AppDynamics
 
Fast Data Overview
Fast Data OverviewFast Data Overview
Fast Data Overview
C. Scyphers
 
Best Practices for Monitoring Cloud Networks
Best Practices for Monitoring Cloud NetworksBest Practices for Monitoring Cloud Networks
Best Practices for Monitoring Cloud Networks
ThousandEyes
 
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DAS Slides: Metadata Management From Technical Architecture & Business Techni...
DATAVERSITY
 
Contexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to ProductionContexti / Oracle - Big Data : From Pilot to Production
Contexti / Oracle - Big Data : From Pilot to Production
Contexti
 
Ensure a Successful SAP Hybris Implementation – Part 2: Architecture and Buil...
Ensure a Successful SAP Hybris Implementation – Part 2: Architecture and Buil...Ensure a Successful SAP Hybris Implementation – Part 2: Architecture and Buil...
Ensure a Successful SAP Hybris Implementation – Part 2: Architecture and Buil...
Kellton Tech Solutions Ltd
 
Cloud Services Brokerage Demystified
Cloud Services Brokerage DemystifiedCloud Services Brokerage Demystified
Cloud Services Brokerage Demystified
Zach Gardner
 
Data Virtualization Journey: How to Grow from Single Project and to Enterpris...
Data Virtualization Journey: How to Grow from Single Project and to Enterpris...Data Virtualization Journey: How to Grow from Single Project and to Enterpris...
Data Virtualization Journey: How to Grow from Single Project and to Enterpris...
Denodo
 
Redefining End-to-End Monitoring: The Foundation - High-Performance Architect...
Redefining End-to-End Monitoring: The Foundation - High-Performance Architect...Redefining End-to-End Monitoring: The Foundation - High-Performance Architect...
Redefining End-to-End Monitoring: The Foundation - High-Performance Architect...
SL Corporation
 
Oracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analyticsOracle Big Data Appliance and Big Data SQL for advanced analytics
Oracle Big Data Appliance and Big Data SQL for advanced analytics
jdijcks
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Cloudera, Inc.
 
Oracle communications data model product overview
Oracle communications data model   product overviewOracle communications data model   product overview
Oracle communications data model product overview
GreenHamster
 
First Friday Forum December 5th Featuring Pentaho
First Friday Forum December 5th Featuring PentahoFirst Friday Forum December 5th Featuring Pentaho
First Friday Forum December 5th Featuring Pentaho
ArchipelagoIS
 
The Shifting Landscape of Data Integration
The Shifting Landscape of Data IntegrationThe Shifting Landscape of Data Integration
The Shifting Landscape of Data Integration
DATAVERSITY
 
Realign Process & Data To Improve Your Customer-Centricity
Realign Process & Data To Improve Your Customer-CentricityRealign Process & Data To Improve Your Customer-Centricity
Realign Process & Data To Improve Your Customer-Centricity
Bizagi
 
CSC - Presentation at Hortonworks Booth - Strata 2014
CSC - Presentation at Hortonworks Booth - Strata 2014CSC - Presentation at Hortonworks Booth - Strata 2014
CSC - Presentation at Hortonworks Booth - Strata 2014
Hortonworks
 
Ad

More from AnalyticsWeek (8)

Understanding Customer Buying Journey with Big Data
Understanding Customer Buying Journey with Big DataUnderstanding Customer Buying Journey with Big Data
Understanding Customer Buying Journey with Big Data
AnalyticsWeek
 
Making sense of unstructured data by turning strings into things
Making sense of unstructured data by turning strings into thingsMaking sense of unstructured data by turning strings into things
Making sense of unstructured data by turning strings into things
AnalyticsWeek
 
Reimagining the role of data in government
Reimagining the role of data in governmentReimagining the role of data in government
Reimagining the role of data in government
AnalyticsWeek
 
The History and Use of R
The History and Use of RThe History and Use of R
The History and Use of R
AnalyticsWeek
 
Advanced Analytics in Hadoop
Advanced Analytics in HadoopAdvanced Analytics in Hadoop
Advanced Analytics in Hadoop
AnalyticsWeek
 
Rethinking classical approaches to analysis and predictive modeling
Rethinking classical approaches to analysis and predictive modelingRethinking classical approaches to analysis and predictive modeling
Rethinking classical approaches to analysis and predictive modeling
AnalyticsWeek
 
Using Topological Data Analysis on your BigData
Using Topological Data Analysis on your BigDataUsing Topological Data Analysis on your BigData
Using Topological Data Analysis on your BigData
AnalyticsWeek
 
Big Data Introduction to D3
Big Data Introduction to D3Big Data Introduction to D3
Big Data Introduction to D3
AnalyticsWeek
 
Understanding Customer Buying Journey with Big Data
Understanding Customer Buying Journey with Big DataUnderstanding Customer Buying Journey with Big Data
Understanding Customer Buying Journey with Big Data
AnalyticsWeek
 
Making sense of unstructured data by turning strings into things
Making sense of unstructured data by turning strings into thingsMaking sense of unstructured data by turning strings into things
Making sense of unstructured data by turning strings into things
AnalyticsWeek
 
Reimagining the role of data in government
Reimagining the role of data in governmentReimagining the role of data in government
Reimagining the role of data in government
AnalyticsWeek
 
The History and Use of R
The History and Use of RThe History and Use of R
The History and Use of R
AnalyticsWeek
 
Advanced Analytics in Hadoop
Advanced Analytics in HadoopAdvanced Analytics in Hadoop
Advanced Analytics in Hadoop
AnalyticsWeek
 
Rethinking classical approaches to analysis and predictive modeling
Rethinking classical approaches to analysis and predictive modelingRethinking classical approaches to analysis and predictive modeling
Rethinking classical approaches to analysis and predictive modeling
AnalyticsWeek
 
Using Topological Data Analysis on your BigData
Using Topological Data Analysis on your BigDataUsing Topological Data Analysis on your BigData
Using Topological Data Analysis on your BigData
AnalyticsWeek
 
Big Data Introduction to D3
Big Data Introduction to D3Big Data Introduction to D3
Big Data Introduction to D3
AnalyticsWeek
 

Recently uploaded (20)

The Fascinating World of Hats: A Brief History of Hats
The Fascinating World of Hats: A Brief History of HatsThe Fascinating World of Hats: A Brief History of Hats
The Fascinating World of Hats: A Brief History of Hats
nimrabilal030
 
NewBase 28 April 2025 Energy News issue - 1783 by Khaled Al Awadi_compressed...
NewBase 28 April 2025  Energy News issue - 1783 by Khaled Al Awadi_compressed...NewBase 28 April 2025  Energy News issue - 1783 by Khaled Al Awadi_compressed...
NewBase 28 April 2025 Energy News issue - 1783 by Khaled Al Awadi_compressed...
Khaled Al Awadi
 
2_English_Vocabulary_In_Use_Pre-Intermediate_Cambridge_-_Fourth_Edition (1).pdf
2_English_Vocabulary_In_Use_Pre-Intermediate_Cambridge_-_Fourth_Edition (1).pdf2_English_Vocabulary_In_Use_Pre-Intermediate_Cambridge_-_Fourth_Edition (1).pdf
2_English_Vocabulary_In_Use_Pre-Intermediate_Cambridge_-_Fourth_Edition (1).pdf
ThiNgc22
 
Treis & Friends One sheet - Portfolio IV
Treis & Friends One sheet - Portfolio IVTreis & Friends One sheet - Portfolio IV
Treis & Friends One sheet - Portfolio IV
aparicioregina7
 
Alan Stalcup - The Enterprising CEO
Alan  Stalcup  -  The  Enterprising  CEOAlan  Stalcup  -  The  Enterprising  CEO
Alan Stalcup - The Enterprising CEO
Alan Stalcup
 
India Advertising Market Size & Growth | Industry Trends
India Advertising Market Size & Growth | Industry TrendsIndia Advertising Market Size & Growth | Industry Trends
India Advertising Market Size & Growth | Industry Trends
Aman Bansal
 
Alec Lawler - A Passion For Building Brand Awareness
Alec Lawler - A Passion For Building Brand AwarenessAlec Lawler - A Passion For Building Brand Awareness
Alec Lawler - A Passion For Building Brand Awareness
Alec Lawler
 
From Sunlight to Savings The Rise of Homegrown Solar Power.pdf
From Sunlight to Savings The Rise of Homegrown Solar Power.pdfFrom Sunlight to Savings The Rise of Homegrown Solar Power.pdf
From Sunlight to Savings The Rise of Homegrown Solar Power.pdf
Insolation Energy
 
TMG - Q3 2025 Earnings Call Slides - v4.pptx
TMG - Q3 2025 Earnings Call Slides - v4.pptxTMG - Q3 2025 Earnings Call Slides - v4.pptx
TMG - Q3 2025 Earnings Call Slides - v4.pptx
Marketing847413
 
20250428 CDB Investor Deck_Apr25_vFF.pdf
20250428 CDB Investor Deck_Apr25_vFF.pdf20250428 CDB Investor Deck_Apr25_vFF.pdf
20250428 CDB Investor Deck_Apr25_vFF.pdf
yihong30
 
Top 5 Mistakes to Avoid When Writing a Job Application
Top 5 Mistakes to Avoid When Writing a Job ApplicationTop 5 Mistakes to Avoid When Writing a Job Application
Top 5 Mistakes to Avoid When Writing a Job Application
Red Tape Busters
 
intra-mart Accel series 2025 Spring updates-en.ppt
intra-mart Accel series 2025 Spring updates-en.pptintra-mart Accel series 2025 Spring updates-en.ppt
intra-mart Accel series 2025 Spring updates-en.ppt
NTTDATA INTRAMART
 
www.visualmedia.com digital markiting (1).pptx
www.visualmedia.com digital markiting (1).pptxwww.visualmedia.com digital markiting (1).pptx
www.visualmedia.com digital markiting (1).pptx
Davinder Singh
 
EquariusAI analytics for business water risk
EquariusAI analytics for business water riskEquariusAI analytics for business water risk
EquariusAI analytics for business water risk
Peter Adriaens
 
Solaris Resources Presentation - Corporate April 2025.pdf
Solaris Resources Presentation - Corporate April 2025.pdfSolaris Resources Presentation - Corporate April 2025.pdf
Solaris Resources Presentation - Corporate April 2025.pdf
pchambers2
 
The Rise of Payroll Outsourcing in the UK: Key Statistics for 2025
The Rise of Payroll Outsourcing in the UK: Key Statistics for 2025The Rise of Payroll Outsourcing in the UK: Key Statistics for 2025
The Rise of Payroll Outsourcing in the UK: Key Statistics for 2025
QX Accounting Services Ltd
 
From Dreams to Threads: The Story Behind The Chhapai
From Dreams to Threads: The Story Behind The ChhapaiFrom Dreams to Threads: The Story Behind The Chhapai
From Dreams to Threads: The Story Behind The Chhapai
The Chhapai
 
waterBeta white paper - 250202- two-column.docx
waterBeta white paper - 250202- two-column.docxwaterBeta white paper - 250202- two-column.docx
waterBeta white paper - 250202- two-column.docx
Peter Adriaens
 
Kiran Flemish - A Dynamic Musician
Kiran  Flemish  -  A   Dynamic  MusicianKiran  Flemish  -  A   Dynamic  Musician
Kiran Flemish - A Dynamic Musician
Kiran Flemish
 
Harnessing Hyper-Localisation: A New Era in Retail Strategy
Harnessing Hyper-Localisation: A New Era in Retail StrategyHarnessing Hyper-Localisation: A New Era in Retail Strategy
Harnessing Hyper-Localisation: A New Era in Retail Strategy
RUPAL AGARWAL
 
The Fascinating World of Hats: A Brief History of Hats
The Fascinating World of Hats: A Brief History of HatsThe Fascinating World of Hats: A Brief History of Hats
The Fascinating World of Hats: A Brief History of Hats
nimrabilal030
 
NewBase 28 April 2025 Energy News issue - 1783 by Khaled Al Awadi_compressed...
NewBase 28 April 2025  Energy News issue - 1783 by Khaled Al Awadi_compressed...NewBase 28 April 2025  Energy News issue - 1783 by Khaled Al Awadi_compressed...
NewBase 28 April 2025 Energy News issue - 1783 by Khaled Al Awadi_compressed...
Khaled Al Awadi
 
2_English_Vocabulary_In_Use_Pre-Intermediate_Cambridge_-_Fourth_Edition (1).pdf
2_English_Vocabulary_In_Use_Pre-Intermediate_Cambridge_-_Fourth_Edition (1).pdf2_English_Vocabulary_In_Use_Pre-Intermediate_Cambridge_-_Fourth_Edition (1).pdf
2_English_Vocabulary_In_Use_Pre-Intermediate_Cambridge_-_Fourth_Edition (1).pdf
ThiNgc22
 
Treis & Friends One sheet - Portfolio IV
Treis & Friends One sheet - Portfolio IVTreis & Friends One sheet - Portfolio IV
Treis & Friends One sheet - Portfolio IV
aparicioregina7
 
Alan Stalcup - The Enterprising CEO
Alan  Stalcup  -  The  Enterprising  CEOAlan  Stalcup  -  The  Enterprising  CEO
Alan Stalcup - The Enterprising CEO
Alan Stalcup
 
India Advertising Market Size & Growth | Industry Trends
India Advertising Market Size & Growth | Industry TrendsIndia Advertising Market Size & Growth | Industry Trends
India Advertising Market Size & Growth | Industry Trends
Aman Bansal
 
Alec Lawler - A Passion For Building Brand Awareness
Alec Lawler - A Passion For Building Brand AwarenessAlec Lawler - A Passion For Building Brand Awareness
Alec Lawler - A Passion For Building Brand Awareness
Alec Lawler
 
From Sunlight to Savings The Rise of Homegrown Solar Power.pdf
From Sunlight to Savings The Rise of Homegrown Solar Power.pdfFrom Sunlight to Savings The Rise of Homegrown Solar Power.pdf
From Sunlight to Savings The Rise of Homegrown Solar Power.pdf
Insolation Energy
 
TMG - Q3 2025 Earnings Call Slides - v4.pptx
TMG - Q3 2025 Earnings Call Slides - v4.pptxTMG - Q3 2025 Earnings Call Slides - v4.pptx
TMG - Q3 2025 Earnings Call Slides - v4.pptx
Marketing847413
 
20250428 CDB Investor Deck_Apr25_vFF.pdf
20250428 CDB Investor Deck_Apr25_vFF.pdf20250428 CDB Investor Deck_Apr25_vFF.pdf
20250428 CDB Investor Deck_Apr25_vFF.pdf
yihong30
 
Top 5 Mistakes to Avoid When Writing a Job Application
Top 5 Mistakes to Avoid When Writing a Job ApplicationTop 5 Mistakes to Avoid When Writing a Job Application
Top 5 Mistakes to Avoid When Writing a Job Application
Red Tape Busters
 
intra-mart Accel series 2025 Spring updates-en.ppt
intra-mart Accel series 2025 Spring updates-en.pptintra-mart Accel series 2025 Spring updates-en.ppt
intra-mart Accel series 2025 Spring updates-en.ppt
NTTDATA INTRAMART
 
www.visualmedia.com digital markiting (1).pptx
www.visualmedia.com digital markiting (1).pptxwww.visualmedia.com digital markiting (1).pptx
www.visualmedia.com digital markiting (1).pptx
Davinder Singh
 
EquariusAI analytics for business water risk
EquariusAI analytics for business water riskEquariusAI analytics for business water risk
EquariusAI analytics for business water risk
Peter Adriaens
 
Solaris Resources Presentation - Corporate April 2025.pdf
Solaris Resources Presentation - Corporate April 2025.pdfSolaris Resources Presentation - Corporate April 2025.pdf
Solaris Resources Presentation - Corporate April 2025.pdf
pchambers2
 
The Rise of Payroll Outsourcing in the UK: Key Statistics for 2025
The Rise of Payroll Outsourcing in the UK: Key Statistics for 2025The Rise of Payroll Outsourcing in the UK: Key Statistics for 2025
The Rise of Payroll Outsourcing in the UK: Key Statistics for 2025
QX Accounting Services Ltd
 
From Dreams to Threads: The Story Behind The Chhapai
From Dreams to Threads: The Story Behind The ChhapaiFrom Dreams to Threads: The Story Behind The Chhapai
From Dreams to Threads: The Story Behind The Chhapai
The Chhapai
 
waterBeta white paper - 250202- two-column.docx
waterBeta white paper - 250202- two-column.docxwaterBeta white paper - 250202- two-column.docx
waterBeta white paper - 250202- two-column.docx
Peter Adriaens
 
Kiran Flemish - A Dynamic Musician
Kiran  Flemish  -  A   Dynamic  MusicianKiran  Flemish  -  A   Dynamic  Musician
Kiran Flemish - A Dynamic Musician
Kiran Flemish
 
Harnessing Hyper-Localisation: A New Era in Retail Strategy
Harnessing Hyper-Localisation: A New Era in Retail StrategyHarnessing Hyper-Localisation: A New Era in Retail Strategy
Harnessing Hyper-Localisation: A New Era in Retail Strategy
RUPAL AGARWAL
 

Data-As-A-Service to enable compliance reporting

  • 1. Data as a service to enable compliance reporting Girish Juneja, CTO October 7, 2014 © 22001144 AAllttiissoouurrccee LLaabbss.. AAllll RRiigghhttss RReesseerrvveedd.. Page | 1
  • 2. Chairman: William C. Erbey CEO: William B. Shepro Employees: ~8,000 NASDAQ: ASPS Market Cap: $2.2 Billion (Sept. 15, 2014) Performance since August 2009 Separation from Ocwen® CAGR Share Price: (Through Sept. 15, 2014) 47% CAGR Service Revenue: (Through Sept. 30, 2013) 39% Altisource Overview  Separated from Ocwen in August 2009  Created and separated RESI and AAMC in December 2012  Strong free cash flow  Strong growth prospects in very large markets © 2014 Altisource Labs. All Rights Reserved. Page | 2
  • 3. Altisource Vision Vision To be the premier real estate and mortgage marketplace offering both content and distribution to the marketplace participants Mission To offer homeowners, buyers, sellers, agents, mortgage originators and servicers trusted and efficient marketplaces to conduct real estate and mortgage transactions, and improve outcomes for market participants Real Estate Marketplace Mortgage Marketplace  Home Sales  Home Rentals  Home Maintenance  Mortgage Originations  Mortgage Servicing © 2014 Altisource Labs. All Rights Reserved. Page | 3
  • 4. State of the Business: Servicing COMPLEXITY - Meeting borrower /customer expectations - Elevated scrutiny of borrower interactions - Proliferation of servicer products - Reporting requirements Increased Risk Increased costs Increased penalties and fines Decreased customer satisfaction COMPLIANCE - Velocity of new and changing rules - Magnitude of financial exposure - Existing technology limits compliance capabilities CHANGE - Lack of end-to-end visibility - Rigid and inflexible systems - Volume and nature of data interoperability between data silos Compliance © 2014 Altisource Labs. All Rights Reserved. Page | 4
  • 5. Future of Servicing For servicers’ businesses to grow, a modern servicing platform must be: Flexible and Adaptable to easily and cost effectively respond to evolving market and business dynamics Scalable and Automated to enable cost effective business growth Interoperable to seamlessly interface with third party apps and other software platforms Compliance Centric to meet ever-changing regulatory mandates Analytical to drive continuous improvement and manage risk © 2014 Altisource Labs. All Rights Reserved. Page | 5
  • 6. Common Foundational Layer Customer Experience Menus & Navigation API Management Caching DMZ Gateway Identity Mgmt Single Sign-on Multi-tenant Authorization & RBAC Compliance & Entitlements Security Framework Encryption MFA Authentication Access Governance User Profile Rules Mgmt Workflows, Business & Compliance Workflow Mgmt Messaging Notification & Subscription Rules, Messaging, Integrations Search 3rd Party Integrations Metadata Management Master Data Management Reporting Compliant Auditing Data Management Transactional, Reporting & Analytics Data Archival Warehousing Data as a Service Provisioning Monitoring/Alerting Backup/Restore Configuration & Customization Elastic Performance Multi-tenant Operational Framework Availability/DR Metering High performance scala based framework App Provisioning, isolation, multi-tenancy & Life-cycle Service Registry Multi-tenant Cloud Provider Independent Rapid Deployment PaaS Deployment & Test Automation Cloud Abstraction layer © 2014 Altisource Labs. All Rights Reserved. Page | 6
  • 7. Environment Overview Financial Industry faces: – Increasing regulatory requirements – Increasing customer compliance requirements – Regulatory & customers’ changing requirements need correlating data across sources – Risk Analysis requires correlating internal and non-conforming structured external data sets – Existing data stores unable to respond rapidly Organizations need solutions that: – Enable financial institutions to address changing regulations – Enable automated compliance monitoring processes and systems – Improve internal controls by retaining data lineage to unmodified source datasets – Provide actionable & timely information from data – Enable on-demand reporting on massive datasets based on schema defined with the reporting request © 2014 Altisource Labs. All Rights Reserved. Page | 7
  • 8. The Traditional Enterprise Data Warehouse approach – Maintain data integrity by storing the organizational data with regulatory information in respective data dimensions. – Data modeling and the design of facts and dimensions are very critical to the success of compliance data warehouse. – The regulatory sufficiency needs to be maintained in the regulation dimension of the compliance data warehouse. – From data warehouse data is transmitted into a regulatory data mart. – Transformation of base data elements and regulation rules are maintained in the meta data. © 2014 Altisource Labs. All Rights Reserved. Page | 8
  • 9. Metadata, ETL & Core Model Metadata – Reflects the business and business processes – In EDW, all functionality is metadata driven Data definitions • Source and Core model from technical perspective • business perspective – Transformations & Aggregations • Transformations to derive cleansed Core data from source data • Aggregations and de-normalizations to create Access model data Loading Service – Reflects the business and business processes – In EDW, all functionality is metadata driven Data definitions • Source and Core model from technical perspective • business perspective – Transformations & Aggregations • Transformations to derive cleansed Core data from source data • Aggregations and de-normalizations to create Access model data Access Model – Data in the Access model is directly traceable to the Core model • De-normalized • Aggregated • Designed for query and access performance. • End user requirements • Access restrictions and controls – As a design decision, access model objects can either be physical structures, or structures materialized on access. Core Model – Control • Only approved, tested, and validated processes can update data in the Core model. – Data Model • Highly Normalized • No redundancy • Subject areas, loans, investors.. • Not optimized for apps • target data formats post-cleansing • It is optimized for efficiency and correctness. © 2014 Altisource Labs. All Rights Reserved. Page | 9
  • 10. The Change/Update Process Each Change involves the following steps: – Update the extraction module – Update the staging module – Update the Transformation module – Update the metadata – Update the data repository This process is tedious, involved, and brittle. © 2014 Altisource Labs. All Rights Reserved. Page | 10
  • 11. Enhancing with Big Data Technologies We were driven to adopt big data technology for many reasons: – Demand to analyze new data sources in an ever shorter timeframe – Growth in data complexity – Variety of data types – Volume of data and inability to move it around due to time constraints – Velocity of data generation, internal and external – Veracity of data from multiple sources – Growth in analytical complexity – Increasing availability of cost-effective computing and data storage The big reason for us was the frequent change of requirements due to changing business & regulatory changes. Spark is a more flexible platform © 2014 Altisource Labs. All Rights Reserved. Page | 11
  • 12. Data as a Service © 22001144 AAllttiissoouurrccee LLaabbss.. AAllll RRiigghhttss RReesseerrvveedd.. Page | 12
  • 13. Data Mobilization View for Data Lake © 2014 Altisource Labs. All Rights Reserved. Page | 13
  • 14. Borrower Data Service Request Details: Service Name: Borrower Data Context: Current/Cleansed & Conformed/History Request Filter: <Borrower Name>, <Date Range> Response Details: © 2014 Altisource Labs. All Rights Reserved. Page | 14
  • 15. Borrower Data Service © 2014 Altisource Labs. All Rights Reserved. Page | 15
  • 16. Mortgage Data Service Request Details: Service Name: Mortgage Data Context: Current/Cleansed & Conformed/History Request Filter: <Loan Number>, <Date Range> Response Details: © 2014 Altisource Labs. All Rights Reserved. Page | 16
  • 17. Mortgage Data Service © 2014 Altisource Labs. All Rights Reserved. Page | 17
  • 18. Loan Default Event Event Details: Event Name: Loan Default Context: Current © 2014 Altisource Labs. All Rights Reserved. Page | 18
  • 19. Loan Default Event © 2014 Altisource Labs. All Rights Reserved. Page | 19
  • 20. Data Lake vs. Data Warehouse Feature Data Lake Data warehouse Data Volume Extremely large (Petabytes) Large (Terabytes) Access Methods NoSQL SQL Schema Schema on read Schema on write Scalability Scales horizontally Scales vertically Hardware Commodity hardware Specialized hardware/ appliances Data Structure Structured and unstructured Structured Data Raw Cleansed/Aggregated © 2014 Altisource Labs. All Rights Reserved. Page | 20
  • 21. Data Lake Technology Stack GraphX Services/ Application Portals Spark (DAG construct and execute engine) RDD Instances/Schemas 3 rd Party Drivers Analytics Portals Data Storage Cassandra /HDFS/ Parquet BI/ETL tools ODBC/ JDBC Spark Streaming External Data Stores In-house API Interactive Mlib/ SparkR Hive/HQL Spark SQL In-house Drivers YARN © 2014 Altisource Labs. All Rights Reserved. Page | 21
  • 22. Benefits of Apache Spark based Data Lake - – Load data as its stored in the source system - no transformation needed – Build structure on it, apply Hive external tables on this raw data – Data sets built with our business logic – The intermediate and final results saved back to data storages – Working data sets saved as Parquet files – Distinction between data view and update view – When the data file changes in Hadoop or Cassandra, we have to update the Hive or Schema RDD’s: then we are done. © 2014 Altisource Labs. All Rights Reserved. Page | 22
  • 23. Data Storage Access Layer – Abstract the details of data accessing through contexts/drivers.  Hive/HQL  Spark Sql  Cassandra driver for Spark – Unify the data into RDD interfaces.  SchemaRDD  HadoopRDD  CassandraRDD © 2014 Altisource Labs. All Rights Reserved. Page | 23
  • 24. Code Samples - Apply Hive Schema to Raw Data Pour data Into HDFS Create Hive Schema Use HQL inside Spark SQL Save result in Parquet format RDBMS’s Excel Files Documents External Sources Cluster Details: 16 VM’s 128 GB Memory 126 GB Disk © 2014 Altisource Labs. All Rights Reserved. Page | 24
  • 25. The Spark Cluster App App Service Service Tool Tool … … … … Spark Driver Worker Worker Worker Worker Worker Worker … … … … Data Data … … … … Data Data Data Data Worker Worker Worker Worker Worker Worker … … … … Storage Storage Storage Storage Storage Storage … … … … © 2014 Altisource Labs. All Rights Reserved. Page | 25
  • 26. Performance observations 10 18 Rows 4.5 hrs 48 minutes 1 min Engineered Solutions Cores 128 Memory 2048 Gb Disk 12 Tb In-memory Databases Cores 160 Memory 2048 Gb Disk 12 Tb Spark Cluster VM’s Cores 128 Memory 2048 Gb Disk 12 Tb © 2014 Altisource Labs. All Rights Reserved. Page | 26
  • 27. Challenges – Open source Apache Spark, while very promising, has to mature – Spark production deployment is complicated – Security of data is not enterprise class, needs additional layers – Tools eco system is still developing – BI Tools still in development But.. – Done right has a lot of business value – We are hiring engineers! © 2014 Altisource Labs. All Rights Reserved. Page | 27
  • 28. Q & A © 2014 Altisource Labs. All Rights Reserved. Page | 28

Editor's Notes

  • #3: Thank you and good morning Altisource separated from Ocwen in August 2009 As you can see from slide 7, our market capitalization has grown to $2.6 billion since separation Ocwen is the largest independent residential mortgage servicer in the U.S. Altisource is a provider of services to Ocwen Altisource has fundamentally different investment characteristics from Ocwen Altisource is capital light. Because of our high margins, we are very unique in that the faster we grow our revenue, the faster our net free cash flow grows. Even with our strong cash flow, we are seeing such significant opportunities that we are turning to the senior secured term loan market. We view senior secured term loans as a short to medium term capital extender to take advantage of attractive opportunities. Given our strong cash generating capability, we hope that your greatest complaint will be that we paid the loan back too quickly.