SlideShare a Scribd company logo
ENERGY-EFFICIENT QUERY PROCESSING IN WEB SEARCH
ENGINES
ABSTRACT:
Web search engines are composed by thousands of query processing nodes, i.e., servers
dedicated to process user queries. Such many servers consume a significant amount of energy,
mostly accountable to their CPUs, but they are necessary to ensure low latencies, since users
expect sub-second response times (e.g., 500 ms). However, users can hardly notice response
times that are faster than their expectations. Hence, we propose the Predictive Energy Saving
Online Scheduling Algorithm (PESOS ) to select the most appropriate CPU frequency to process
a query on a per-core basis. PESOS aims at process queries by their deadlines, and leverage
high-level scheduling information to reduce the CPU energy consumption of a query processing
node. PESOS bases its decision on query efficiency predictors, estimating the processing volume
and processing time of a query. We experimentally evaluate PESOS upon the TREC
ClueWeb09B collection and the MSN2006 query log. Results show that PESOS can reduce the
CPU energy consumption of a query processing node up to ∼ 48 percent compared to a system
running at maximum CPU core frequency. PESOS outperforms also the best state-of-the-art
competitor with a ∼ 20 percent energy saving, while the competitor requires a fine parameter
tuning and it may incurs in uncontrollable latency violations.
EXISTING SYSTEM
In the past, a large part of a datacenter energy consumption was accounted to inefficiencies in its
cooling and power supply systems. However, Barroso et al. [1] report that modern datacenters
have largely reduced the energy wastage of those infrastructures, leaving little room for further
improvement. On the contrary, opportunities exist to reduce the energy consumption of the
servers hosted in a datacenter. In particular, our work focuses on the CPU power management of
query processing nodes, since the CPUs dominate the energy consumption of physical servers
dedicated to search tasks. In fact, CPUs can use up to 66% of the whole energy consumed by a
query processing node at peak utilization [1]. Modern CPUs usually expose two energy saving
mechanism, namely C-states and P-states. C-states represent CPU cores idle states and they are
typically managed by the operating system. C0 is the operative state in which a CPU core can
perform computing tasks.
DISADVANTAGES
YDS is an offline algorithm to schedule generic computing jobs and cannot be used to schedule
online queries.
YDS requires to know in advance the processing volumes of jobs. Conversely, we do not know
how much work a query will require before its completion.
YDS schedules job using processing speeds (defined as units of work per time unit). The speed
value is continuous and unbounded (i.e., the speed can be indefinitely large). However, the
frequencies available to CPU cores are generally discrete and bounded.
PROPOSED SYSTEM
In this paper we propose the Predictive Energy Saving Online Scheduling algorithm (PESOS),
which considers the tail latency requirement of queries as an explicit parameter. Via the DVFS
technology, PESOS selects the most appropriate CPU frequency to process a query on a per-core
basis, so that the CPU energy consumption is reduced while respecting required tail latency. The
algorithm bases its decision on query efficiency predictors rather than core utilization. Query
efficiency predictors are techniques to estimate the processing time of a query before its
processing. They have been proposed to improve the performance of a search engine, for
instance to take decision about query scheduling or query processing parallelization However, to
the best of our knowledge, query efficiency predictor have not been considered for reducing the
energy consumption of query processing nodes. We build upon the approach described in and
propose
ADVANTAGES
YDS have several issues that make unfeasible to use it in a search engine. In the following, we
discuss:
1) A heuristic based on YDS which works in online scenarios without job preemption,
2) A methodology to estimate the processing volume of a query,
3) An algorithm to translate processing speeds into CPU core frequencies. Eventually, we
introduce and discuss our approach to select the most appropriate CPU core frequency to process
a query in a search engine.
OBJECTIVES
A novel Predictive Energy Saving Online Scheduling (PESOS) algorithm. In the context of Web
search engines, PESOS aims to reduce the CPU energy consumption of a query processing node
while imposing a required tail latency on the query response times. For each query, PESOS
selects the lowest possible CPU core frequency such that the energy consumption is reduced and
the deadlines are respected. PESOS selects the right CPU core frequency exploiting two different
kinds of query efficiency predictors (QEPs). The first QEP estimates the processing volume of
queries. The second QEP estimates the query processing times under different core frequencies,
given the number of postings to score. Since QEPs can be inaccurate,during their training we
recorded the root mean square error (RMSE) of the predictions. In this work, we proposing to
sum the RMSE to the actual predictions to compensate prediction errors.We then defined two
possible configuration for PESOS: time conservative, where prediction correction is enforced,
and energy conservative, where QEPs are left unmodified.
SYSTEM CONFIGURATION:
HARDWARE REQUIREMENTS:
Hardware - Pentium
Speed - 1.1 GHz
RAM - 1GB
Hard Disk - 20 GB
Floppy Drive - 1.44 MB
Key Board - Standard Windows Keyboard
Mouse - Two or Three Button Mouse
Monitor - SVGA
SOFTWARE REQUIREMENTS:
Operating System : Windows
Technology : Java and J2EE
Web Technologies : Html, JavaScript, CSS
IDE : My Eclipse
Web Server : Tomcat
Tool kit : Android Phone
Database : My SQL
Java Version : J2SDK1.5

More Related Content

Similar to Energy-efficient Query Processing in Web Search Engines (20)

PDF
Accurately Simulating Energy Consumption of I/O-intensive Scientific Workflows
Rafael Ferreira da Silva
 
PPT
OK Labs - Virtualization as the Nexus of Multicore Power Management
Open Kernel Labs
 
PDF
Performance and Energy evaluation
GIORGOS STAMELOS
 
PDF
Kernel Recipes 2015: Introduction to Kernel Power Management
Anne Nicolas
 
PDF
22). smlevel energy eff-dynamictaskschedng
Poornima_Rajanna
 
PPTX
Briefing - Dynamic Workers for Scheduling
Bernie Chiu
 
PDF
LCU14-410: How to build an Energy Model for your SoC
Linaro
 
PDF
Tesis-Maestria-Presentacion-SIturriaga
Facultad de Ingeniería, UdelaR
 
PDF
ENERGY-AWARE LOAD BALANCING AND APPLICATION SCALING FOR THE CLOUD ECOSYSTEM
Shakas Technologies
 
PPTX
Energy efficient resource management for high-performance clusters
Xiao Qin
 
PPT
Hairong Qi V Swaminathan
FNian
 
PDF
Energy efficient-resource-allocation-in-distributed-computing-systems
Cemal Ardil
 
PDF
A Review of Different Types of Schedulers Used In Energy Management
IRJET Journal
 
PPTX
Intern presentation nicolechiou_2018_vareximaging_sanitized
NicoleChiou3
 
PDF
Parallel and Distributed Computing Chapter 9
AbdullahMunir32
 
PPTX
Hardware-aware thread scheduling: the case of asymmetric multicore processors
Achille Peternier
 
DOCX
ENERGY-AWARE LOAD BALANCING AND APPLICATION SCALING FOR THE CLOUD ECOSYSTEM
Nexgen Technology
 
DOCX
ENERGY-AWARE LOAD BALANCING AND APPLICATION SCALING FOR THE CLOUD ECOSYSTEM
Nexgen Technology
 
PPTX
Optimizing High Performance Computing Applications for Energy
David Lecomber
 
PDF
Energy Efficient Change Management in a Cloud Computing Environment
IRJET Journal
 
Accurately Simulating Energy Consumption of I/O-intensive Scientific Workflows
Rafael Ferreira da Silva
 
OK Labs - Virtualization as the Nexus of Multicore Power Management
Open Kernel Labs
 
Performance and Energy evaluation
GIORGOS STAMELOS
 
Kernel Recipes 2015: Introduction to Kernel Power Management
Anne Nicolas
 
22). smlevel energy eff-dynamictaskschedng
Poornima_Rajanna
 
Briefing - Dynamic Workers for Scheduling
Bernie Chiu
 
LCU14-410: How to build an Energy Model for your SoC
Linaro
 
Tesis-Maestria-Presentacion-SIturriaga
Facultad de Ingeniería, UdelaR
 
ENERGY-AWARE LOAD BALANCING AND APPLICATION SCALING FOR THE CLOUD ECOSYSTEM
Shakas Technologies
 
Energy efficient resource management for high-performance clusters
Xiao Qin
 
Hairong Qi V Swaminathan
FNian
 
Energy efficient-resource-allocation-in-distributed-computing-systems
Cemal Ardil
 
A Review of Different Types of Schedulers Used In Energy Management
IRJET Journal
 
Intern presentation nicolechiou_2018_vareximaging_sanitized
NicoleChiou3
 
Parallel and Distributed Computing Chapter 9
AbdullahMunir32
 
Hardware-aware thread scheduling: the case of asymmetric multicore processors
Achille Peternier
 
ENERGY-AWARE LOAD BALANCING AND APPLICATION SCALING FOR THE CLOUD ECOSYSTEM
Nexgen Technology
 
ENERGY-AWARE LOAD BALANCING AND APPLICATION SCALING FOR THE CLOUD ECOSYSTEM
Nexgen Technology
 
Optimizing High Performance Computing Applications for Energy
David Lecomber
 
Energy Efficient Change Management in a Cloud Computing Environment
IRJET Journal
 

More from Prasadu Peddi (17)

PDF
Pointers
Prasadu Peddi
 
PDF
String notes
Prasadu Peddi
 
DOCX
B.Com 1year Lab programs
Prasadu Peddi
 
DOCX
COMPUTING SEMANTIC SIMILARITY OF CONCEPTS IN KNOWLEDGE GRAPHS
Prasadu Peddi
 
DOCX
MINING COMPETITORS FROM LARGE UNSTRUCTURED DATASETS
Prasadu Peddi
 
DOCX
GENERATING QUERY FACETS USING KNOWLEDGE BASES
Prasadu Peddi
 
DOCX
UNDERSTAND SHORTTEXTS BY HARVESTING & ANALYZING SEMANTIKNOWLEDGE
Prasadu Peddi
 
DOCX
SOCIRANK: IDENTIFYING AND RANKING PREVALENT NEWS TOPICS USING SOCIAL MEDIA FA...
Prasadu Peddi
 
DOCX
QUERY EXPANSION WITH ENRICHED USER PROFILES FOR PERSONALIZED SEARCH UTILIZING...
Prasadu Peddi
 
DOCX
COLLABORATIVE FILTERING-BASED RECOMMENDATION OF ONLINE SOCIAL VOTING
Prasadu Peddi
 
DOCX
DYNAMIC FACET ORDERING FOR FACETED PRODUCT SEARCH ENGINES
Prasadu Peddi
 
PPTX
A Cross Tenant Access Control (CTAC) Model for Cloud Computing: Formal Specif...
Prasadu Peddi
 
PPTX
Time and Attribute Factors Combined Access Control on Time-Sensitive Data in ...
Prasadu Peddi
 
PPTX
Attribute Based Storage Supporting Secure Deduplication of Encrypted D...
Prasadu Peddi
 
PPTX
RAAC: Robust and Auditable Access Control with Multiple Attribute Authorities...
Prasadu Peddi
 
PPTX
Provably Secure Key-Aggregate Cryptosystems with Broadcast Aggregate Keys for...
Prasadu Peddi
 
PPTX
Identity-Based Remote Data Integrity Checking With Perfect Data Privacy Prese...
Prasadu Peddi
 
Pointers
Prasadu Peddi
 
String notes
Prasadu Peddi
 
B.Com 1year Lab programs
Prasadu Peddi
 
COMPUTING SEMANTIC SIMILARITY OF CONCEPTS IN KNOWLEDGE GRAPHS
Prasadu Peddi
 
MINING COMPETITORS FROM LARGE UNSTRUCTURED DATASETS
Prasadu Peddi
 
GENERATING QUERY FACETS USING KNOWLEDGE BASES
Prasadu Peddi
 
UNDERSTAND SHORTTEXTS BY HARVESTING & ANALYZING SEMANTIKNOWLEDGE
Prasadu Peddi
 
SOCIRANK: IDENTIFYING AND RANKING PREVALENT NEWS TOPICS USING SOCIAL MEDIA FA...
Prasadu Peddi
 
QUERY EXPANSION WITH ENRICHED USER PROFILES FOR PERSONALIZED SEARCH UTILIZING...
Prasadu Peddi
 
COLLABORATIVE FILTERING-BASED RECOMMENDATION OF ONLINE SOCIAL VOTING
Prasadu Peddi
 
DYNAMIC FACET ORDERING FOR FACETED PRODUCT SEARCH ENGINES
Prasadu Peddi
 
A Cross Tenant Access Control (CTAC) Model for Cloud Computing: Formal Specif...
Prasadu Peddi
 
Time and Attribute Factors Combined Access Control on Time-Sensitive Data in ...
Prasadu Peddi
 
Attribute Based Storage Supporting Secure Deduplication of Encrypted D...
Prasadu Peddi
 
RAAC: Robust and Auditable Access Control with Multiple Attribute Authorities...
Prasadu Peddi
 
Provably Secure Key-Aggregate Cryptosystems with Broadcast Aggregate Keys for...
Prasadu Peddi
 
Identity-Based Remote Data Integrity Checking With Perfect Data Privacy Prese...
Prasadu Peddi
 
Ad

Recently uploaded (20)

PDF
Digital water marking system project report
Kamal Acharya
 
PPTX
Introduction to File Transfer Protocol with commands in FTP
BeulahS2
 
PPTX
Unit_I Functional Units, Instruction Sets.pptx
logaprakash9
 
PDF
WD2(I)-RFQ-GW-1415_ Shifting and Filling of Sand in the Pond at the WD5 Area_...
ShahadathHossain23
 
PPTX
OCS353 DATA SCIENCE FUNDAMENTALS- Unit 1 Introduction to Data Science
A R SIVANESH M.E., (Ph.D)
 
PPTX
UNIT 1 - INTRODUCTION TO AI and AI tools and basic concept
gokuld13012005
 
PDF
Clustering Algorithms - Kmeans,Min ALgorithm
Sharmila Chidaravalli
 
PDF
A Brief Introduction About Robert Paul Hardee
Robert Paul Hardee
 
PDF
NFPA 10 - Estandar para extintores de incendios portatiles (ed.22 ENG).pdf
Oscar Orozco
 
PPTX
Precooling and Refrigerated storage.pptx
ThongamSunita
 
PDF
NTPC PATRATU Summer internship report.pdf
hemant03701
 
PPTX
Engineering Quiz ShowEngineering Quiz Show
CalvinLabial
 
PPTX
Biosensors, BioDevices, Biomediccal.pptx
AsimovRiyaz
 
PPTX
Introduction to Internal Combustion Engines - Types, Working and Camparison.pptx
UtkarshPatil98
 
PDF
this idjfk sgfdhgdhgdbhgbgrbdrwhrgbbhtgdt
WaleedAziz7
 
PPTX
Distribution reservoir and service storage pptx
dhanashree78
 
PPT
Footbinding.pptmnmkjkjkknmnnjkkkkkkkkkkkkkk
mamadoundiaye42742
 
PDF
Module - 4 Machine Learning -22ISE62.pdf
Dr. Shivashankar
 
PPTX
Functions in Python Programming Language
BeulahS2
 
PPTX
Explore USA’s Best Structural And Non Structural Steel Detailing
Silicon Engineering Consultants LLC
 
Digital water marking system project report
Kamal Acharya
 
Introduction to File Transfer Protocol with commands in FTP
BeulahS2
 
Unit_I Functional Units, Instruction Sets.pptx
logaprakash9
 
WD2(I)-RFQ-GW-1415_ Shifting and Filling of Sand in the Pond at the WD5 Area_...
ShahadathHossain23
 
OCS353 DATA SCIENCE FUNDAMENTALS- Unit 1 Introduction to Data Science
A R SIVANESH M.E., (Ph.D)
 
UNIT 1 - INTRODUCTION TO AI and AI tools and basic concept
gokuld13012005
 
Clustering Algorithms - Kmeans,Min ALgorithm
Sharmila Chidaravalli
 
A Brief Introduction About Robert Paul Hardee
Robert Paul Hardee
 
NFPA 10 - Estandar para extintores de incendios portatiles (ed.22 ENG).pdf
Oscar Orozco
 
Precooling and Refrigerated storage.pptx
ThongamSunita
 
NTPC PATRATU Summer internship report.pdf
hemant03701
 
Engineering Quiz ShowEngineering Quiz Show
CalvinLabial
 
Biosensors, BioDevices, Biomediccal.pptx
AsimovRiyaz
 
Introduction to Internal Combustion Engines - Types, Working and Camparison.pptx
UtkarshPatil98
 
this idjfk sgfdhgdhgdbhgbgrbdrwhrgbbhtgdt
WaleedAziz7
 
Distribution reservoir and service storage pptx
dhanashree78
 
Footbinding.pptmnmkjkjkknmnnjkkkkkkkkkkkkkk
mamadoundiaye42742
 
Module - 4 Machine Learning -22ISE62.pdf
Dr. Shivashankar
 
Functions in Python Programming Language
BeulahS2
 
Explore USA’s Best Structural And Non Structural Steel Detailing
Silicon Engineering Consultants LLC
 
Ad

Energy-efficient Query Processing in Web Search Engines

  • 1. ENERGY-EFFICIENT QUERY PROCESSING IN WEB SEARCH ENGINES ABSTRACT: Web search engines are composed by thousands of query processing nodes, i.e., servers dedicated to process user queries. Such many servers consume a significant amount of energy, mostly accountable to their CPUs, but they are necessary to ensure low latencies, since users expect sub-second response times (e.g., 500 ms). However, users can hardly notice response times that are faster than their expectations. Hence, we propose the Predictive Energy Saving Online Scheduling Algorithm (PESOS ) to select the most appropriate CPU frequency to process a query on a per-core basis. PESOS aims at process queries by their deadlines, and leverage high-level scheduling information to reduce the CPU energy consumption of a query processing node. PESOS bases its decision on query efficiency predictors, estimating the processing volume and processing time of a query. We experimentally evaluate PESOS upon the TREC ClueWeb09B collection and the MSN2006 query log. Results show that PESOS can reduce the CPU energy consumption of a query processing node up to ∼ 48 percent compared to a system running at maximum CPU core frequency. PESOS outperforms also the best state-of-the-art competitor with a ∼ 20 percent energy saving, while the competitor requires a fine parameter tuning and it may incurs in uncontrollable latency violations. EXISTING SYSTEM In the past, a large part of a datacenter energy consumption was accounted to inefficiencies in its cooling and power supply systems. However, Barroso et al. [1] report that modern datacenters have largely reduced the energy wastage of those infrastructures, leaving little room for further improvement. On the contrary, opportunities exist to reduce the energy consumption of the servers hosted in a datacenter. In particular, our work focuses on the CPU power management of query processing nodes, since the CPUs dominate the energy consumption of physical servers dedicated to search tasks. In fact, CPUs can use up to 66% of the whole energy consumed by a query processing node at peak utilization [1]. Modern CPUs usually expose two energy saving mechanism, namely C-states and P-states. C-states represent CPU cores idle states and they are
  • 2. typically managed by the operating system. C0 is the operative state in which a CPU core can perform computing tasks. DISADVANTAGES YDS is an offline algorithm to schedule generic computing jobs and cannot be used to schedule online queries. YDS requires to know in advance the processing volumes of jobs. Conversely, we do not know how much work a query will require before its completion. YDS schedules job using processing speeds (defined as units of work per time unit). The speed value is continuous and unbounded (i.e., the speed can be indefinitely large). However, the frequencies available to CPU cores are generally discrete and bounded. PROPOSED SYSTEM In this paper we propose the Predictive Energy Saving Online Scheduling algorithm (PESOS), which considers the tail latency requirement of queries as an explicit parameter. Via the DVFS technology, PESOS selects the most appropriate CPU frequency to process a query on a per-core basis, so that the CPU energy consumption is reduced while respecting required tail latency. The algorithm bases its decision on query efficiency predictors rather than core utilization. Query efficiency predictors are techniques to estimate the processing time of a query before its processing. They have been proposed to improve the performance of a search engine, for instance to take decision about query scheduling or query processing parallelization However, to the best of our knowledge, query efficiency predictor have not been considered for reducing the energy consumption of query processing nodes. We build upon the approach described in and propose ADVANTAGES
  • 3. YDS have several issues that make unfeasible to use it in a search engine. In the following, we discuss: 1) A heuristic based on YDS which works in online scenarios without job preemption, 2) A methodology to estimate the processing volume of a query, 3) An algorithm to translate processing speeds into CPU core frequencies. Eventually, we introduce and discuss our approach to select the most appropriate CPU core frequency to process a query in a search engine. OBJECTIVES A novel Predictive Energy Saving Online Scheduling (PESOS) algorithm. In the context of Web search engines, PESOS aims to reduce the CPU energy consumption of a query processing node while imposing a required tail latency on the query response times. For each query, PESOS selects the lowest possible CPU core frequency such that the energy consumption is reduced and the deadlines are respected. PESOS selects the right CPU core frequency exploiting two different kinds of query efficiency predictors (QEPs). The first QEP estimates the processing volume of queries. The second QEP estimates the query processing times under different core frequencies, given the number of postings to score. Since QEPs can be inaccurate,during their training we recorded the root mean square error (RMSE) of the predictions. In this work, we proposing to sum the RMSE to the actual predictions to compensate prediction errors.We then defined two possible configuration for PESOS: time conservative, where prediction correction is enforced, and energy conservative, where QEPs are left unmodified. SYSTEM CONFIGURATION: HARDWARE REQUIREMENTS: Hardware - Pentium
  • 4. Speed - 1.1 GHz RAM - 1GB Hard Disk - 20 GB Floppy Drive - 1.44 MB Key Board - Standard Windows Keyboard Mouse - Two or Three Button Mouse Monitor - SVGA SOFTWARE REQUIREMENTS: Operating System : Windows Technology : Java and J2EE Web Technologies : Html, JavaScript, CSS IDE : My Eclipse Web Server : Tomcat Tool kit : Android Phone Database : My SQL Java Version : J2SDK1.5