0% found this document useful (0 votes)

53 views

Part Ii: Applications of Gas: Ga and The Internet Genetic Search Based On Multiple Mutation Approaches

This document discusses using genetic algorithms for intelligent internet search. It describes a system designed at a university that uses genetic algorithms with phases including input, spidering, agent, generator, topic, space, and time to iteratively evolve search results. The system begins with an input set, spiders links to generate the first generation, evaluates fitness, and performs crossover, mutation and reproduction over multiple iterations to obtain satisfactory results. The document also discusses applications of genetic algorithms, innovations needed at different levels, and simulation results showing combined topic, spatial and temporal mutation improves search quality.

Uploaded by

Srikar Chintala

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views

Part Ii: Applications of Gas: Ga and The Internet Genetic Search Based On Multiple Mutation Approaches

Uploaded by

Srikar Chintala

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 31

Part II: Applications of GAs

GA and the Internet Genetic search based on multiple mutation approaches

GAs are useful and efficient when

The search sapace is large, complex or poorly understood Domain knowledge is scarce or expert knowledge is difficult to encode to narrow the search space No mathematical analysis is available Traditional search methods fail For problem solving and for modeling

Applications
GAs are applied to many scientific, engineering problems , In business and entertainment , including: 1. Optimization: It is used in wide variety of optimization tasks including numerical optimization such as traveling Salesman Problem, Job Scheduling Problem, video and sound quality optimization. 2. Automatic Programming: It is used to evolve or generate computer program for specific task automatically 3. In machine and robot Learning 4. In Models of social systems 5. Interactions between evolution and learning

Some Applications of Gas

Control systems design Software guided circuit design

Optimization

Internet search

Path finding

Mobile robots

Data mining

Trend spotting

Stock prize prediction

Algorithms Phases
Process set of URLs given by user Select all links from input set Evaluate fitness function for all genomes Perform crossover, mutation, and reproduction

Satisfactory solution obtained?

The End

Introduction

GA can be used for intelligent internet search. GA is used in cases when search space is relatively large. GA is adoptive search. GA is heuristic search method.

System for GA Internet Search

Designed at faculty for electrical engineering, university of belgrade

Input set

C O N T R O L
P R O G R A M

Generator Agent Spider

Topic Current set Space

Top data

Time Output set

Net data

Spider

Spider is software packages, that picks up internet documents from user supplied input with depth specified by user. Spider takes one URL, fetches all links, and documents thy contain with predefined depth. The fetched documents are stored on local hard disk with same structure as on the original location. Spiders task is to produce the first generation. Spider is used during crossover and mutation.

Agent

Agent takes as an input a set of urls, and calls spider, for every one of them, with depth 1. Then, agent performs extraction of keywords from each document, and stores it in local hard disk.

Generator

Generator generates a set of urls from given keywords, using some conventional search engine. It takes as input the desired topic, calls yahoo search engine, and submits a query looking for all documents covering the specific topic. Generator stores URL and topic of given web page in database called topdata.

Topic

It uses topdata DB in order to insert random urls from database into current set. Topic performs mutation.

Space

Space takes as input the current set from the agent application and injects into it those urls from the database netdata that appeared with the greatest frequency in the output set of previous searches.

Time

Time takes set of urls from agent and inserts ones with greatest frequency into DB netdata. The netdata DB contains of three fields: URL, topic, and count number. The DB is updated in each algorithm iteration.

How Does The System Work?

command flow Input set C O N T R O L P R O G R A M Generator Agent Spider

data flow

Topic Current set Space

Top data

Time Output set

Net data

GA and the Internet: Conclusion

GA for internet search, on contrary to other gas, is much faster and more efficient that conventional solutions, such as standard internet search engines.

INTERNET

Genetic Search Based on Multiple Mutation Approaches

Concept and its improvements adapted to specific applications in e-business, and concrete software package
Main problems in finding information on the Internet: How to find quickly and retrieve efficiently the potentially useful information considering the fact of the fast growth of the quantity and variety of Internet sites Huge number of documents , many of which are completely unrelated to what the user originally attempted to find, searched with indexing engines Documents placed on the top of the result list are often less acceptable then the lower ones Indexing process may take days, weeks , or even longer, because the volume of new information being created daily

Links Based Approach

The question is: How to locate and retrieve the needed information before it gets indexed?

The efficient way to locate the new not-yet-indexed information: Using links-based approaches genetic search simulated annealing Best result: indexing - based approaches

+
links - based approaches

Genetic Search Algorithm

GENETIC ALGORITHM OF ZERO ORDER, with no mutation
Start: Model Web presentation that contains all the needed types of information (fitness function is evaluated). It is assumes that it includes URL pointers to other similar Web presentations, and these are downloaded. The Web presentations that survived the fitness function are assumed to include additional URL pointers, and their related Web presentations are downloaded next. After the end-of-search condition is met, the Web presentations are ranked according to their fitness value.

Genetic Search Algorithm

Type of mutation:

Topic-oriented database mutation Semantic mutations - based on the principles of spatial locality - based on the principles of temporal locality Logical reasoning and semantics consideration is involve in picking out URLs for mutation.

Innovations Required by Domain Area

APPLICATION LEVEL

LEVEL OF THE GENERAL PROJECT APPROACH AND PRODUCT ARCHITECTURE

ALGORITHMIC LEVEL

IMPLEMENTATION LEVEL

Application Level

Statistical analysis and data mining has to be performed, in order to figure out the common and typical patterns of behavior and need The state-of-the-art of mutual referencing has to be determined The trends and asymptotic situations foreseen for the time of project finalization has to be determined

Level of the General Project Approach and Product Architecture

Decisions have to be made about the most important goals to be achieved:

Maximizing the speed of search

Maximizing the sophistication of search

Maximizing specific effects of interest for a given institution or a customer

Maximizing a combination of the above

Decision on this level affect the applicability of the final product / tool.

Algorithmic Level
Develop an efficient mutation algorithm of interest for the application

in the direction of database architecture and design in introducing the elements of semantic-based mutation

Semantics-based mutations are especially of interest for chaotic markets, typical of new markets in developed countries or traditional markets in under-developed countries.

Semantics-based Mutation
Mutation based on spatial localities

After a fruitful Web presentation is reached (using a tradicional algorithm with mutation), the site of the same Internet service provider is searched for other presentations on the same or similar topic

Explanation : In chaotic markets, it is very unlikely that service/product offers from the same small geographic area each other on their Web presentations After a successful side trip based on spatial mutation, one continue with the traditional database mutation.

Semantics-based Mutation
Mutation based on temporal localities

One comes back periodically to a Web presentation which was fruitful in the past One comes back periodically to other Web presentations developed by the author who created some fruitful Web presentations in the past Temporal mutation can use direct revisits or a number of indirect forms or revisit.

Implementation Level

Utilization of novel technologies, for maximal performance and minimal implementation complexity Important for: - good flexibility - extendibility - reliability - availability Utilization of mobile platforms and mobile agents

Implementation Level

Static agents - one has to download megabytes of information - treat that information with a decision-making code of size measured in kilobytes - derive the final business related decision, which is binary in size (one bit: yes or no) A huge amount of data is transferred through the network in vain, because only a small percent of fetched documents will turn out to be useful

Mobile agents - they would browse through the network and perform the search locally, on the remote servers, transferring only the needed documents and data - they load the network only with kilobytes and a single bit

Simulation Result

Links-based approach in the static domain How various mutation strategies can affect the search efficiency Set of software packages have developed , that would perform Internet search using genetic algorithms (by Veljko Milutinovic, Dragana Cvetkovic, and Jelena Mirkovic) As the fitness function they have measured average Jaccards score for the output documents, while changing the type and rate of mutation

Simulation Result

The simulation result for topic mutation

The simulation result for temporal and spatial mutation combined with topic mutation

Simulation Result
The simulation result for topic, spatial and temporal mutation combined.
Constant increase in the quality of pages found.

Conclusion: Evolution

Tutorial download: galeb.etf.bg.ac.yu/~vm Option:Tutorials

Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
From Everand
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
alasdair gilchrist
5/5 (1)
Assignment 2
No ratings yet
Assignment 2
2 pages
Human Computer Interaction Assignment Details CSE, Sharda University
No ratings yet
Human Computer Interaction Assignment Details CSE, Sharda University
7 pages
Introduction To PSAP - September 2019 PDF
No ratings yet
Introduction To PSAP - September 2019 PDF
43 pages
Web Mining Based On Genetic Algorithm: AIML 05 Conference, 19-21 December 2005, CICC, Cairo, Egypt
No ratings yet
Web Mining Based On Genetic Algorithm: AIML 05 Conference, 19-21 December 2005, CICC, Cairo, Egypt
6 pages
Internet Searching Technique - Last Edited
No ratings yet
Internet Searching Technique - Last Edited
36 pages
Accelerated DevOps with AI, ML & RPA: Non-Programmer’s Guide to AIOPS & MLOPS
From Everand
Accelerated DevOps with AI, ML & RPA: Non-Programmer’s Guide to AIOPS & MLOPS
Stephen Fleming
5/5 (2)
PSO10
No ratings yet
PSO10
6 pages
Web Scraping with Python Step by Step: A Practical Guide with Examples
From Everand
Web Scraping with Python Step by Step: A Practical Guide with Examples
William E. Clark
No ratings yet
L001
No ratings yet
L001
49 pages
Practical Data Strategies and Recipes
From Everand
Practical Data Strategies and Recipes
Tom Henricksen
No ratings yet
Seo Learning Guide
From Everand
Seo Learning Guide
ngencoband
No ratings yet
Software Testing Interview Questions You'll Most Likely Be Asked
From Everand
Software Testing Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
CS571-Note
No ratings yet
CS571-Note
2 pages
Search Engine
No ratings yet
Search Engine
42 pages
Ijcsi 13 6 68 75
No ratings yet
Ijcsi 13 6 68 75
8 pages
Automatic Image Annotation: Enhancing Visual Understanding through Automated Tagging
From Everand
Automatic Image Annotation: Enhancing Visual Understanding through Automated Tagging
Fouad Sabry
No ratings yet
Automatic Image Annotation: Fundamentals and Applications
From Everand
Automatic Image Annotation: Fundamentals and Applications
Fouad Sabry
No ratings yet
5 Unit Notes
100% (1)
5 Unit Notes
166 pages
ITR notes (2)
No ratings yet
ITR notes (2)
166 pages
Information Search and Visualization: - Who Earns $50,000 Among The Residents of Eugene, Oregon?
No ratings yet
Information Search and Visualization: - Who Earns $50,000 Among The Residents of Eugene, Oregon?
9 pages
irs mid
No ratings yet
irs mid
13 pages
Neural Networks in Big Data and Web Search: Will Serrano
No ratings yet
Neural Networks in Big Data and Web Search: Will Serrano
41 pages
Automated Network Technology: The Changing Boundaries of Expert Systems
From Everand
Automated Network Technology: The Changing Boundaries of Expert Systems
Carl P. Catalano Ph.D.
No ratings yet
1preprocessing Crawling Laws PDF
No ratings yet
1preprocessing Crawling Laws PDF
53 pages
Application Design: Key Principles For Data-Intensive App Systems
From Everand
Application Design: Key Principles For Data-Intensive App Systems
Rob Botwright
No ratings yet
Chap 1
No ratings yet
Chap 1
22 pages
Information Filtering Agent: Debre Birhan University College of Computing Department of Information Systems
No ratings yet
Information Filtering Agent: Debre Birhan University College of Computing Department of Information Systems
23 pages
Flux Architecture
From Everand
Flux Architecture
Adam Boduch
No ratings yet
Image Retrieval: Fundamentals and Applications
From Everand
Image Retrieval: Fundamentals and Applications
Fouad Sabry
No ratings yet
ASP.NET Core 1.0 High Performance
From Everand
ASP.NET Core 1.0 High Performance
James Singleton
No ratings yet
43 Megha Jain
No ratings yet
43 Megha Jain
3 pages
UNIT - 6
No ratings yet
UNIT - 6
12 pages
IRT SYLLABUS
No ratings yet
IRT SYLLABUS
3 pages
SEARCH ENGINE (Synopsis) - Vivek
No ratings yet
SEARCH ENGINE (Synopsis) - Vivek
17 pages
Learning Dynamics NAV Patterns: Create solutions that are easy to maintain, are quick to upgrade, and follow proven concepts and design
From Everand
Learning Dynamics NAV Patterns: Create solutions that are easy to maintain, are quick to upgrade, and follow proven concepts and design
Marije Brummel
No ratings yet
Search Engine
100% (2)
Search Engine
42 pages
Web Mining
No ratings yet
Web Mining
48 pages
9633
No ratings yet
9633
12 pages
Unit - 1
No ratings yet
Unit - 1
51 pages
Hci Unit 5
No ratings yet
Hci Unit 5
22 pages
Go Programming Blueprints - Second Edition
From Everand
Go Programming Blueprints - Second Edition
Mat Ryer
4.5/5 (3)
Image Retrieval: Unlocking the Power of Visual Data
From Everand
Image Retrieval: Unlocking the Power of Visual Data
Fouad Sabry
No ratings yet
Information Search and Retrieval
No ratings yet
Information Search and Retrieval
23 pages
Ijcet: International Journal of Computer Engineering & Technology (Ijcet)
No ratings yet
Ijcet: International Journal of Computer Engineering & Technology (Ijcet)
6 pages
Web Search Engingine Indexing Crawling and Ranking
No ratings yet
Web Search Engingine Indexing Crawling and Ranking
63 pages
Real-Time Analytics: Techniques to Analyze and Visualize Streaming Data
From Everand
Real-Time Analytics: Techniques to Analyze and Visualize Streaming Data
Byron Ellis
No ratings yet
(Ebook) Intelligent multimedia databases and information retrieval: advancing applications and technologies by Li Yan, Zongmin Ma ISBN 9781613501269, 9781613501276, 1613501269, 1613501277 - Read the ebook online or download it for the best experience
No ratings yet
(Ebook) Intelligent multimedia databases and information retrieval: advancing applications and technologies by Li Yan, Zongmin Ma ISBN 9781613501269, 9781613501276, 1613501269, 1613501277 - Read the ebook online or download it for the best experience
56 pages
Introducción a Recuperación de Información y Sistemas de Recomendación
No ratings yet
Introducción a Recuperación de Información y Sistemas de Recomendación
40 pages
Applied Architecture Patterns on the Microsoft Platform Second Edition
From Everand
Applied Architecture Patterns on the Microsoft Platform Second Edition
Andre Dovgal
No ratings yet
Information_Retrieval_systems_and_Web_Search_Engin
No ratings yet
Information_Retrieval_systems_and_Web_Search_Engin
4 pages
Hci Unit 5 PDF
No ratings yet
Hci Unit 5 PDF
22 pages
Learning Advanced Programming
From Everand
Learning Advanced Programming
IT Campus Academy
No ratings yet
CS317 IR W1a
No ratings yet
CS317 IR W1a
20 pages
Lect 1 IRIntroduction
No ratings yet
Lect 1 IRIntroduction
59 pages
Learning .NET High-performance Programming
From Everand
Learning .NET High-performance Programming
Antonio Esposito
No ratings yet
Webmininglec
No ratings yet
Webmininglec
75 pages
Search engines
No ratings yet
Search engines
4 pages
Chapter 1 Search Engine 1. Objective
No ratings yet
Chapter 1 Search Engine 1. Objective
63 pages
How A Search Engine Works - Slide
No ratings yet
How A Search Engine Works - Slide
40 pages
Oracle Modernization Solutions
From Everand
Oracle Modernization Solutions
Tom Laszewski
No ratings yet
(Ebook) New Programming Paradigms by Marvin Zelkowitz Ph.D. MS BS. ISBN 9780080459585, 9780120121649, 0120121646, 0080459587 download
100% (2)
(Ebook) New Programming Paradigms by Marvin Zelkowitz Ph.D. MS BS. ISBN 9780080459585, 9780120121649, 0120121646, 0080459587 download
51 pages
Lizarani Senapati: Udayanath Autonomous College of Science and Technology Prachi Jnanapitha, Adaspur
No ratings yet
Lizarani Senapati: Udayanath Autonomous College of Science and Technology Prachi Jnanapitha, Adaspur
31 pages
Network Programming in Java
No ratings yet
Network Programming in Java
56 pages
Training PPT Details
No ratings yet
Training PPT Details
17 pages
OOSE Syllabus
No ratings yet
OOSE Syllabus
2 pages
Object Oriented Software Engineering
No ratings yet
Object Oriented Software Engineering
50 pages
Sharda University, Greater Noida: School of Engineering & Technology Department of Computer Science & Engineering
No ratings yet
Sharda University, Greater Noida: School of Engineering & Technology Department of Computer Science & Engineering
4 pages
Graph Theory With Applications
85% (20)
Graph Theory With Applications
487 pages
Graph Theory With Applications
85% (20)
Graph Theory With Applications
487 pages
Black Mamba Ventures
No ratings yet
Black Mamba Ventures
8 pages
Zeekr - Leaflet X - A4 - NL 1
No ratings yet
Zeekr - Leaflet X - A4 - NL 1
10 pages
Data Sheet: 13.3 Inch Maritime Multi Computer - Series X
No ratings yet
Data Sheet: 13.3 Inch Maritime Multi Computer - Series X
2 pages
Gluten Free Cookbook Ed1 2024 Freemagazines Top
No ratings yet
Gluten Free Cookbook Ed1 2024 Freemagazines Top
132 pages
X. Rotational Equilibrium and Rotational Dynamics
No ratings yet
X. Rotational Equilibrium and Rotational Dynamics
17 pages
04 April Otis Gazette All Pages
No ratings yet
04 April Otis Gazette All Pages
40 pages
CC TPL
No ratings yet
CC TPL
12 pages
Environment International: Ryoiti Kiyama, Yuko Wada-Kiyama
No ratings yet
Environment International: Ryoiti Kiyama, Yuko Wada-Kiyama
30 pages
FINAL-PRINT-Bangla-Deen-e-illahi(1)
No ratings yet
FINAL-PRINT-Bangla-Deen-e-illahi(1)
79 pages
Dokumentacija Za Rezervoare PDF
No ratings yet
Dokumentacija Za Rezervoare PDF
270 pages
Forty Years of Fanger's Model of Thermal Comfort Comfort For All PDF
No ratings yet
Forty Years of Fanger's Model of Thermal Comfort Comfort For All PDF
20 pages
Multicore Processor Technology-Advantages and Challenges: Anil Sethi, Himanshu Kushwah
No ratings yet
Multicore Processor Technology-Advantages and Challenges: Anil Sethi, Himanshu Kushwah
3 pages
Improving Design and Operation at LNG
No ratings yet
Improving Design and Operation at LNG
21 pages
Saic Q 1006
100% (1)
Saic Q 1006
2 pages
MSDS Suncron Black S-FWN
No ratings yet
MSDS Suncron Black S-FWN
7 pages
Aircraft Technology Roadmap To 2050
No ratings yet
Aircraft Technology Roadmap To 2050
51 pages
Proto-oncogenes and oncogenes in cancer
No ratings yet
Proto-oncogenes and oncogenes in cancer
33 pages
Food Production & Adulteration
100% (1)
Food Production & Adulteration
23 pages
Hazel Metals & Minerals Pvt. Ltd-04112012 Didwania
No ratings yet
Hazel Metals & Minerals Pvt. Ltd-04112012 Didwania
3 pages
Work Power Energy - JEE Main 2024 January Question Bank - MathonGo
No ratings yet
Work Power Energy - JEE Main 2024 January Question Bank - MathonGo
8 pages
Environmental Standards
No ratings yet
Environmental Standards
32 pages
TDS Consol Epoxy Injection-1
No ratings yet
TDS Consol Epoxy Injection-1
1 page
5 Experimental Verification of Electrical Circuit Problems Using Kirchhoff's Voltage and Current Laws
No ratings yet
5 Experimental Verification of Electrical Circuit Problems Using Kirchhoff's Voltage and Current Laws
5 pages
ALDEHYDES, KETONES, ACIDS-01-170419: Neet-Crash-2017 Chemistry Test
No ratings yet
ALDEHYDES, KETONES, ACIDS-01-170419: Neet-Crash-2017 Chemistry Test
6 pages
Vitafoods Europe 2019 New Exhibitor Directory
100% (1)
Vitafoods Europe 2019 New Exhibitor Directory
12 pages
The Shuffling of Mathematics Problems Improves Lea - Co Pia
No ratings yet
The Shuffling of Mathematics Problems Improves Lea - Co Pia
19 pages
Kisi Kisi PTS 2022-2023
No ratings yet
Kisi Kisi PTS 2022-2023
3 pages
TRHT TB PL 18 Oo 201 - 0
No ratings yet
TRHT TB PL 18 Oo 201 - 0
21 pages
p1. Item 14 Fc-bkm-0001 - Honeywell
No ratings yet
p1. Item 14 Fc-bkm-0001 - Honeywell
2 pages

Part Ii: Applications of Gas: Ga and The Internet Genetic Search Based On Multiple Mutation Approaches

Uploaded by

Part Ii: Applications of Gas: Ga and The Internet Genetic Search Based On Multiple Mutation Approaches

Uploaded by

Part II: Applications of GAs

GA and the Internet Genetic search based on multiple mutation approaches

GAs are useful and efficient when

Some Applications of Gas

Stock prize prediction

Satisfactory solution obtained?

System for GA Internet Search

Designed at faculty for electrical engineering, university of belgrade

Generator Agent Spider

Topic Current set Space

Time Output set

How Does The System Work?

Topic Current set Space

Time Output set

GA and the Internet: Conclusion

Genetic Search Based on Multiple Mutation Approaches

Links Based Approach

Genetic Search Algorithm

Genetic Search Algorithm

Innovations Required by Domain Area

LEVEL OF THE GENERAL PROJECT APPROACH AND PRODUCT ARCHITECTURE

Level of the General Project Approach and Product Architecture

Maximizing the speed of search

Maximizing the sophistication of search

Maximizing a combination of the above

The simulation result for topic mutation

Tutorial download: galeb.etf.bg.ac.yu/~vm Option:Tutorials

You might also like