Silicon Valley x 日本 / Tech x Business Meetup #12 (2015/04/17)
『並列分散処理基盤Hadoopの紹介と、開発者が語るHadoopの使いどころ』
NTTデータ 基盤システム事業本部
システム方式技術事業部 OSSプロフェッショナルサービス
鯵坂 明
This document discusses the application of PostgreSQL in a large social infrastructure project involving smart meter management. It describes three main missions: (1) loading 10 million datasets within 10 minutes, (2) saving data for 24 months, and (3) stabilizing performance for large scale SELECT statements. Various optimizations are discussed to achieve these missions, including data modeling, performance tuning, reducing data size, and controlling execution plans. The results showed that all three missions were successfully completed by applying PostgreSQL expertise and customizing it for the large-scale requirements of the project.
IBM Tivoli Storage Manager Fastback is a data protection and recovery software solution for Windows and Linux servers. It provides mission-critical application protection, remote office backup consolidation, and the ability to restore any type of data from any location. The presentation discusses Tivoli Storage Manager Fastback's product suite, architecture, features like block-level incremental capture and deduplication, support for virtual environments, and strengths like resource efficiency and instant recovery compared to weaknesses like lack of integration with VM management platforms. A demo and discussion period followed the presentation.
Big Data Developers Moscow Meetup 1 - sql on hadoopbddmoscow
This document summarizes a meetup about Big Data and SQL on Hadoop. The meetup included discussions on what Hadoop is, why SQL on Hadoop is useful, what Hive is, and introduced IBM's BigInsights software for running SQL on Hadoop with improved performance over other solutions. Key topics included HDFS file storage, MapReduce processing, Hive tables and metadata storage, and how BigInsights provides a massively parallel SQL engine instead of relying on MapReduce.
This document discusses filesystems, RPC, HDFS, and I/O schedulers. It provides an overview of Linux kernel I/O schedulers and how they optimize disk access. It then discusses the I/O stack in Linux, including the virtual filesystem (VFS) layer. It describes the NFS client-server model using RPC over TCP/IP and how HDFS uses a similar model with its own APIs. Finally, it outlines the write process in HDFS from the client to data nodes.
This document provides an introduction to IBM Tivoli Storage Manager (TSM), covering basic concepts such as backup, restore, archive, retrieval, versioning, and retention policies. It describes the components of TSM including storage pools, the metadata database, TSM server files, and backup/archive client files. The document is intended as a refresher for system administrators and backup administrators already familiar with TSM.