0% found this document useful (0 votes)

2 views

ctrl

Uploaded by

kalyana sundaram

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

ctrl

Uploaded by

kalyana sundaram

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/220487302

Data Engineering for the Analysis of Semiconductor Manufacturing Data

Article · December 2002

DOI: 10.48550/arXiv.cs/0212040 · Source: DBLP

CITATIONS READS
7 359

1 author:

Peter David Turney

Ronin Institute
118 PUBLICATIONS 20,477 CITATIONS

SEE PROFILE

All content following this page was uploaded by Peter David Turney on 05 January 2014.

The user has requested enhancement of the downloaded file.

NRC Publications Archive (NPArC)
Archives des publications du CNRC (NPArC)

Data Engineering for the Analysis of Semiconductor

Manufacturing Data
Turney, Peter

Web page / page Web

http://nparc.cisti-icist.nrc-cnrc.gc.ca/npsi/ctrl?action=rtdoc&an=5763791&lang=en
http://nparc.cisti-icist.nrc-cnrc.gc.ca/npsi/ctrl?action=rtdoc&an=5763791&lang=fr

Access and use of this website and the material on it are subject to the Terms and Conditions set forth at
http://nparc.cisti-icist.nrc-cnrc.gc.ca/npsi/jsp/nparc_cp.jsp?lang=en
READ THESE TERMS AND CONDITIONS CAREFULLY BEFORE USING THIS WEBSITE.

L’accès à ce site Web et l’utilisation de son contenu sont assujettis aux conditions présentées dans le site
http://nparc.cisti-icist.nrc-cnrc.gc.ca/npsi/jsp/nparc_cp.jsp?lang=fr
LISEZ CES CONDITIONS ATTENTIVEMENT AVANT D’UTILISER CE SITE WEB.

Contact us / Contactez nous: [email protected].

Semiconductor Manufacturing Data April 3, 1995

Data Engineering for the Analysis of

Semiconductor Manufacturing Data

Peter Turney
Knowledge Systems Laboratory
Institute for Information Technology
National Research Council Canada
Ottawa, Ontario, Canada
K1A 0R6

613-993-8564 (voice)
613-952-7151 (fax)
[email protected]

Abstract
We have analyzed manufacturing data from several different semiconductor manu-
facturing plants, using decision tree induction software called Q-YIELD. The soft-
ware generates rules for predicting when a given product should be rejected. The
rules are intended to help the process engineers improve the yield of the product,
by helping them to discover the causes of rejection. Experience with Q-YIELD has
taught us the importance of data engineering — preprocessing the data to enable
or facilitate decision tree induction. This paper discusses some of the data engi-
neering problems we have encountered with semiconductor manufacturing data.
The paper deals with two broad classes of problems: engineering the features in a
feature vector representation and engineering the definition of the target concept
(the classes). Manufacturing process data present special problems for feature
engineering, since the data have multiple levels of granularity (detail, resolution).
Engineering the target concept is important, due to our focus on understanding the
past, as opposed to the more common focus in machine learning on predicting the
future.

Keywords: data engineering, preprocessing data, semiconductor manufacturing,

manufacturing data, decision tree induction.

IJCAI-95 Workshop on Data Engineering for Inductive Learning 1

Semiconductor Manufacturing Data April 3, 1995

1. Introduction
We define data engineering as the transformation of raw data into a form useful as
input to algorithms for inductive learning.1 This paper is concerned with the trans-
formation of semiconductor manufacturing data for input to a decision tree induc-
tion algorithm. We have been using decision tree induction for process
optimization. The optimization task that we address is to discover what aspects of a
manufacturing process are responsible for a given class of rejected products. A
product in semiconductor manufacturing may be considered to be an integrated
circuit, a wafer (a disk of silicon, usually holding about 100 to 1,000 integrated cir-
cuits), or a batch of wafers (usually about 20 to 30 wafers). A product is usually
accepted or rejected on the basis of electrical measurements. (See Van Zant (1986)
for a good introduction to semiconductor manufacturing.)
We have analyzed data from several different semiconductor manufacturing
plants. The data were analyzed using Q-YIELD, which generates rules for predict-
ing when a given product will be rejected, given certain process measurements
(Famili & Turney, 1991; 1992).2 The rules are intended to help process engineers
improve the yield of the product, by helping them discover the causes of rejection.
In general, there are two types of applications for inductive learning algorithms:
they may be used to predict the future or to understand the past. Our emphasis has
been on understanding the past. This places certain constraints on the software that
we use. For example, it is very important that the induced model should be readily
understandable by process engineers, which excludes neural network models.
This real-world application of machine learning has presented us with some
interesting technical problems, which do not seem to have been considered in the
machine learning literature. Section 2 discusses the problems of engineering the
features. Manufacturing process data have multiple levels of granularity (levels of
detail; levels of resolution), which makes the data difficult to represent in the stan-
dard feature vector notation. Section 3 discusses the problems of engineering the
classes. Most papers on inductive learning assume that the definition of the target
concept is given. In our work, determining the best definition of the target concept
is a large part of the task. For each problem we describe, we outline the solutions
we have adopted and the open questions. In general, we have not had the resources
to validate our solutions by comparing them with alternative approaches.
The conclusion is that data engineering is essential to the successful applica-
tion of decision tree induction to semiconductor manufacturing. Data engineering
is currently much more an art than a science. We present here a list of recipes for
data engineering with semiconductor manufacturing data, but what we need is a
unifying scientific foundation for this diverse collection of recipes.

IJCAI-95 Workshop on Data Engineering for Inductive Learning 2

Semiconductor Manufacturing Data April 3, 1995

Data engineering presents interesting philosophical issues: How can we make

sure we do not massage the data too much and see only what we want to see? Is it
better to invent new techniques for data engineering for existing learning algo-
rithms or to invent new learning algorithms that do not require data engineering? Is
data engineering inherently limited to the non-scientific collection of ad hoc reci-
pes and techniques? We do not have answers for these questions.3

2. Data Engineering for Defining the Features

In this section, we examine the problems we have encountered with engineering of
features. Manufacturing process data do not fit painlessly into the feature vector
representation that is used by most inductive learning algorithms.

2.1 Multiple Levels of Granularity

There are (at least) four levels of granularity in semiconductor process data:
1. IC-Level: At the lowest (finest, most detailed) level of granularity, there are
individual integrated circuits (ICs).4
2. Site-Level: The next level of granularity is the test site. ICs are manufactured
on disks of silicon, about 12 centimeters in diameter. These disks are called
wafers. Each wafer typically has five test sites. A test site is a location on the
wafer where a simple circuit, with well-known properties, is constructed,
instead of the more complex circuits that fill the remainder of the wafer. These
simple circuits are used to test the manufacturing process. Since the test sites
are well-understood, they facilitate diagnosis of the manufacturing process.
3. Wafer-Level: At the next level, there is a wafer. A wafer is a disk of silicon,
usually holding about 100 to 1,000 ICs.
4. Batch-Level: At the top level, there is a batch of wafers. A batch usually
consists of about 20 to 30 wafers. The wafers in a batch stay close together for
most of the manufacturing process, so they often have many properties in
common (such as similar types of defects).5
There may be 30,000 ICs in a batch (e.g., 30 wafers with 1,000 ICs each). Thus
each measurement made at the batch-level of granularity may correspond to
30,000 measurements made at the IC-level of granularity. This shows that these
four levels of granularity span a significant range. In some of our analyses, we only
deal with two or three of these four levels.
Typically from 10 to 100 measurements are made at each level of granularity.
Batch-level and wafer-level measurements deal with aspects of the manufacturing
process. For example, when a batch of wafers is baked in an oven, one of the
batch-level measurements might be the maximum oven temperature. There is no

IJCAI-95 Workshop on Data Engineering for Inductive Learning 3

Semiconductor Manufacturing Data April 3, 1995

reason to record this information at the site-level (for example), since all the site-
level temperature measurements would be (approximately) identical within a given
batch. Site-level and IC-level measurements are typically electrical measurements
that are used for quality control. A wafer may be rejected if certain site-level mea-
surements are not within bounds. An IC may be rejected if certain IC-level mea-
surements are not within bounds. Part of the reason that granularity is important is
that decisions (accept or reject) are made at different levels of granularity (reject a
whole wafer or reject a single IC).
The data at these different levels of granularity are frequently stored in sepa-
rate databases. Data from the manufacturing process are recorded at the batch-level
in one database, while data from electrical testing are recorded at the IC-level in a
second database. This introduces the mundane difficulty of extracting data from
two or more databases, but there is also a challenging data engineering problem:
The data have a structure that is not naturally represented in the feature vector for-
mat required by most decision tree induction algorithms.
So far, our practice has been to convert the data to a feature vector format by
moving all measurements up to the highest level of granularity that is relevant for
the given manufacturing problem (often the batch-level). We have experimented
with two methods for transforming lower-level data to higher-level data:
Method A: Suppose that we wish to move site-level data up to the batch-level. Let
X be an electrical parameter (the voltage drop across a diode, for example) that is
measured at five test sites on a wafer. In a batch of 24 wafers, there will be 120
( 24 × 5 = 120 ) measurements of X . To bring X up from the site-level to the
batch-level, we can introduce five new batch-level features:
1. the average of X in the 120 measurements in the given batch
2. the standard deviation of X in the 120 measurements in the given batch
3. the median of X in the 120 measurements in the given batch
4. the minimum of X in the 120 measurements in the given batch
5. the maximum of X in the 120 measurements in the given batch
This can result in a large feature vector, since every lower-level measurement
results in five higher-level features. It can also result in a shortage of cases. A data-
base with 1,200 records (cases, examples), where each field (attribute, feature,
measurement) is measured at the site-level, yields 10 batch-level feature vectors
( 1, 200 ⁄ 120 = 10 ). Thus an abundance of data is transformed into a shortage of
data. However, if the manufacturing problem is due to fluctuations in the process at
the batch-level, then the apparent abundance of data was an illusion, since the data-
base only has a small amount of information about batch-level fluctuations.

IJCAI-95 Workshop on Data Engineering for Inductive Learning 4

Semiconductor Manufacturing Data April 3, 1995

Method B: The electrical measurements at the site-level are generally related to

the decision to accept or reject the wafer. Suppose that a wafer is rejected when
two or more of the five measurements of electrical parameter X are above a thresh-
old T . To bring X up from the site-level to the batch-level, we can introduce a new
variable Y defined as the percentage of the 24 wafers for which two or more of the
five measurements of electrical parameter X are above the threshold T . This new
variable Y now has the same level of granularity as the batch-level process data.
We prefer Method B to Method A, since the new variable Y is easier for the
process engineers to understand than the five variables in Method A. However,
Method B does not always work, since not all site-level measurements are directly
tied to the decision to accept or reject the wafer: some measurements may be
recorded purely for the information they provide about the manufacturing process.
These two methods for changing granularity usually work for us, but a num-
ber of open questions arise:
1. What happens if we bring the batch-level data down to the site-level, instead
of bringing the site-level data up to the batch-level?
2. Is there an algorithm that can handle the raw data, without preprocessing the
data to either reduce or increase the level of granularity? This appears to be an
ideal problem for Inductive Logic Programming (Lavrac & Dzeroski, 1994).
3. Are there better ways of changing granularity?
The issue of granularity is an interesting research problem. It will be a common
problem in any discrete industrial manufacturing process (a discrete process
involves individual components, in contrast to a continuous process, such as petro-
leum refining). It is somewhat surprising that it has not received more attention.

2.2 Deciding What Information to Record or Analyze

With semiconductor manufacturing data, we are frequently faced with several hun-
dred potential features in our feature vectors. Our rule of thumb is to use every fea-
ture that is possibly relevant for the given target class. We leave it to the decision
tree induction algorithm to select the best subset from the set of features that are
given as input. This is perhaps overloading the decision tree induction algorithm,
but it appears to work (most of the time).
Since we rely on the decision tree induction algorithm to ignore irrelevant fea-
tures, our task is to attempt to include all possibly relevant features. We list here
some potential features that might be overlooked, for the benefit of those who
intend to apply inductive learning to manufacturing data:
1. It can be useful to record the name or employee ID of the operator of the
machinery, when there is manually operated machinery in the manufacturing

IJCAI-95 Workshop on Data Engineering for Inductive Learning 5

Semiconductor Manufacturing Data April 3, 1995

process. Some of the problems in the process can be due to inexperienced

operators or variation in the operators’ methods.
2. When several different machines perform the same task in parallel, it can be
useful to record for each product which machine was used. One of the
machines might have a defect, which can be exposed by a rule of the form, “If
machine number 3 is used, the wafer will be rejected.”
3. It might be useful to record the name of the supplier for raw materials, when
there are several suppliers. For example, a rule of the form, “If the silicon
wafer comes from supplier Z, then the wafer will be rejected,” might suggest
that the wafers from supplier Z have impurities.
4. It might be useful to record environmental conditions, such as humidity.
5. It is often useful to record time. This is discussed in the next section.
Part of data engineering is deciding what data to supply to the induction algorithm.
To some extent, this is automated with decision tree induction, since we can put
everything in and let the algorithm select the relevant subset. However, this
approach has the disadvantage that the speed of the algorithm decreases as the
number of features increases.

2.3 Representing Time

Many aspects of a manufacturing process vary with time. Time can sometimes be a
surrogate for important variables that are not recorded. Suppose the yield in a cer-
tain process is determined by the skill of the operators, but the operator ID is not
recorded. However, perhaps the less skilled workers are assigned to the night shift,
so the yield will be correlated with the time of day.
There are many ways to represent time in a feature vector format. If it is possi-
ble that the target class has a cyclical pattern, then time should be represented in a
way that encourages the discovery of cyclical patterns. The features could be the
hour of the day, the day of the week, and the week of the month. An extra feature
might be added to flag weekends or holidays, if the manufacturing process operates
differently on these days.
It is possible that a unique event suddenly caused the manufacturing process
to go awry. In this case, we want to emphasize the sequential nature of time, rather
than its cyclical nature. Often a database record has a time-stamp that records the
year, month, day, hour, and minute of a measurement. When we want to emphasize
the sequential nature of time, we convert these five numbers into a single feature,
such as “minutes from midnight, January 1, 1990”. If a rule makes use of this fea-
ture, we convert it back into a more comprehensible format for reporting purposes.
When time is not explicitly recorded in a database, it is sometimes possible to

IJCAI-95 Workshop on Data Engineering for Inductive Learning 6

Semiconductor Manufacturing Data April 3, 1995

extract sequential order from the batch ID. Most plants assign an ID to each batch
and the IDs are often a combination of digits that are assigned sequentially. This
ordering information is sufficient to detect unique events — it may be unnecessary
to know the absolute time of an event.

2.4 Other Feature Engineering Problems

There are some common feature engineering problems that we have not discussed
here, because they are not specific to manufacturing data. We briefly mention three
of these problems. (1) Occasionally there are missing feature values in the data. We
simply throw out all cases that have any missing values. (2) Bad sensor readings
can cause outliers. To handle this problem, we have an upper and lower limit for
each sensor reading. Cases are flagged and discarded when any feature value is
outside of its limits. (3) Often the features are highly correlated with each other.
This confuses the analysis, since highly correlated features can act as “synonyms”
in the decision tree. When the decision tree induction algorithm is run on several
batches of data from the same process, minor variations in the data can cause radi-
cally different trees to be generated. We screen the data for highly correlated fea-
tures by generating a table of correlations for all pairs of features.

3. Data Engineering for Defining the Target Class

The yield of a process is a continuous variable but decision tree induction usually
requires a discrete variable for the class. Some of the problems we have encoun-
tered come from attempting to convert a continuous variable into a discrete vari-
able. This section examines some of these problems. We should begin by
explaining why we convert a continuous variable into a discrete variable.
In statistics, induction with discrete classes is called classification or discrimi-
nation. Induction with a continuous dependent variable is called regression or
curve-fitting. Although decision tree induction is usually applied to classification,
it can be applied to regression, as demonstrated by CART (Breiman et al., 1984).
Our goal is to produce easily understood rules, to assist the process engineers. We
believe that regression trees are harder to understand than classification trees, so
we prefer to convert the continuous dependent variable into a discrete class,
instead of applying regression trees.

3.1 Defining the Target Class

The target class is the class that is to be learned by the induction algorithm. In our
work, the target class is the manufacturing problem that interests the process engi-
neer. Usually the process engineer is concerned with low yield. The yield of a pro-
cess is the percentage of ICs or wafers that are acceptable. Suppose that the yield

IJCAI-95 Workshop on Data Engineering for Inductive Learning 7

Semiconductor Manufacturing Data April 3, 1995

of a process is usually above 90% but sometimes dips below 90% and the process
engineer wants to understand what is causing the dip. In the simplest case, there is
a batch-level measurement called yield and the target class is “yield is less than
90%”. We can define the target class as a symbolic variable with the value 1 when
the yield is below 90% and the value 0 when the yield is above 90%.
We convert the continuous yield variable to a discrete variable using a thresh-
old, such as 90%. There are (at least) three ways to set a threshold for the yield. (1)
We may use external factors (economic factors, management decisions, pressure
from competition) to determine the desired minimum yield for the process; (2) we
can choose the median yield, so that we have a balance of examples and counter-
examples; or (3) we can look at the data to see whether there is a natural threshold,
based on clusters in the data. We find that we tend to get better results with
approaches (2) and (3), rather than (1). We often experiment with several different
thresholds. We use visual aids to suggest possible thresholds. One aid is a histo-
gram of the yield (the x axis is the yield and the y axis is the number of batches
with the given yield). Sometimes there will be a valley in the histogram that sug-
gests a natural place for a threshold. Another aid is a plot of the yield over time (the
x axis is time and the y axis is yield). Sometimes there are recurrent dips in the plot
that can readily be isolated with the right threshold.
The yield of a process is a composite variable, since there are many different
reasons for rejection of a wafer or IC. In a process with a yield of 90%, there may
be 30 different classes of problems in the 10% of parts that are rejected. Treating
each problem separately can make the task simpler for the induction algorithm.
Suppose that electrical measurements are made at five test sites on a wafer and a
wafer is rejected when two or more of the five measurements of electrical parame-
ter X are above a threshold T . This electrical measurement X is one way that a
wafer can be rejected and we can focus on X instead of examining yield. To bring
X up from the wafer-level to the batch-level, we can introduce a new variable Y
defined as the percentage of the wafers for which two or more of the five measure-
ments of electrical parameter X are above the threshold T (as we discussed in
Section 2.1). We can then define two classes of batches, those for which Y is
below some threshold U and those for which Y is above U . The target class is “ Y
is below U ”. The same issues arise in setting the threshold U as arose in setting
the threshold on the yield.
Some open questions are:
1. Can we automate the selection of a threshold in the definition of a target class?
2. Should we use regression trees instead of classification trees (Breiman et al.,
1984)? Is there a way to make regression trees easier to understand?

IJCAI-95 Workshop on Data Engineering for Inductive Learning 8

Semiconductor Manufacturing Data April 3, 1995

We have not investigated these issues.

3.2 Grey Cases

As we described in Section 3.1, we typically define the target class by applying a
threshold to a continuous variable. Some cases are clearly instances of the target
class (“black”) and some cases are clearly not instances of the target class
(“white”), but there are frequently borderline cases (“grey”) that can confuse the
induction algorithm. We have found that we can occasionally improve the perfor-
mance of the induction algorithm by defining a “grey region” around the threshold
that defines the target class. We delete the cases in the grey region (in both the
training and the testing data).
It may be a dubious practice to delete cases when making predictions, but our
focus is on understanding the cause of the manufacturing problem; the focus is not
on prediction. We have found that dropping the grey region appears to enhance
understanding. However, we have not rigorously tested this hypothesis.
Another issue is deciding the boundaries of the grey region. There is a trade-
off between a large region, which increases the contrast between the examples and
counter-examples, and a small region, which increases the amount of data avail-
able for the induction algorithm. We do not yet have a principled approach to set-
ting the boundaries. A histogram can be helpful.

3.3 Other Class Engineering Problems

Our target classes can overlap, since a rejected part (wafer or IC) may have several
different problems simultaneously. We handle this in the standard way, by generat-
ing a separate decision tree for each target class. Each tree has only two classes,
target and non-target.

4. Conclusions
The above examples show that a significant amount of data engineering is involved
in the application of decision tree induction to semiconductor manufacturing data.
There are many open questions raised by our data engineering methods and many
assumptions that have not yet been investigated. We believe that it is possible and
worthwhile to build a firm theoretical foundation for data engineering. We are
hopeful that the recipes and open questions raised here can contribute to such a
foundation.

IJCAI-95 Workshop on Data Engineering for Inductive Learning 9

Semiconductor Manufacturing Data April 3, 1995

Notes
1. This definition suggests that data engineering is always done by hand. We do
not mean to exclude the possibility of automatic data engineering, but we have
not been able to invent a more satisfying definition of data engineering.
2. Q-YIELD is a commercial product, available from Quadrillion Corporation,
380 Pinhey Point Road, Dunrobin, Ontario, Canada, K0A 1T0. The software
is based on a prototype that was developed at the NRC.
3. These issues were raised by Joel Martin, in conversation.
4. For some tasks, it is reasonable to consider a lower level of granularity, such
as the components (flip flops, transistors, gates) within an IC. The four levels
listed here are not meant to be exhaustive.
5. There are higher levels of granularity, such as a production run, but we do not
usually analyze the data at this level of granularity.

Acknowledgments
Thanks to Michael Weider and Joel Martin for their very helpful comments on ear-
lier versions of this paper.

References
Breiman, L., Friedman, J., Olshen, R., & Stone, C. (1984). Classification and
regression trees. California: Wadsworth.
Famili, A. and Turney, P.D. (1991), “Intelligently helping the human planner in
industrial process planning”, Artificial Intelligence for Engineering Design,
Analysis, and Manufacturing, Vol. 5, No. 2, pp. 109-124.
Famili, A. and Turney, P.D. (1992), “Application of machine learning to industrial
planning and decision making”, in Artificial Intelligence Applications in Man-
ufacturing, edited by A. Famili, S. Kim, and D. Nau, MIT Press, Cambridge,
MA, pp. 1-16.
Lavrac, N., & Dzeroski, S. (1994). Inductive Logic Programming: Techniques and
Applications. New York: Ellis Horwood.
Van Zant, P. (1986). Microchip Fabrication: A Practical Guide to Semiconductor
Processing. California: Semiconductor Services.

IJCAI-95 Workshop on Data Engineering for Inductive Learning 10

View publication stats

Computer and Network Technology: BCS Level 4 Certificate in IT study guide
From Everand
Computer and Network Technology: BCS Level 4 Certificate in IT study guide
Gary Thornton
No ratings yet
Design and Build Modern Datacentres, A to Z practical guide
From Everand
Design and Build Modern Datacentres, A to Z practical guide
Engineer Said AL Hosni
3/5 (2)
Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting Data
From Everand
Data Science and Big Data Analytics: Discovering, Analyzing, Visualizing and Presenting Data
EMC Education Services
No ratings yet
Accelerating Chip Design With Machine Learning
No ratings yet
Accelerating Chip Design With Machine Learning
10 pages
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
From Everand
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
alasdair gilchrist
5/5 (1)
SuperCollider-Step by Step
100% (3)
SuperCollider-Step by Step
124 pages
MAR224 Lab Properties of Seawater Complete V3-Laborator
No ratings yet
MAR224 Lab Properties of Seawater Complete V3-Laborator
14 pages
Processes: A Review of Data Mining Applications in Semiconductor Manufacturing
No ratings yet
Processes: A Review of Data Mining Applications in Semiconductor Manufacturing
38 pages
PublishedPaper 2020-APCSM MachineLearning
No ratings yet
PublishedPaper 2020-APCSM MachineLearning
8 pages
Intel Technology Journal
No ratings yet
Intel Technology Journal
14 pages
Smart Research Questions and Analytical Hints: Manufacturing Industry
From Everand
Smart Research Questions and Analytical Hints: Manufacturing Industry
Dr. Zemelak Goraga
No ratings yet
Paper Report
No ratings yet
Paper Report
30 pages
Introduction to Quantum Computing & Machine Learning Technologies: 1, #1
From Everand
Introduction to Quantum Computing & Machine Learning Technologies: 1, #1
M. Sreedevi
No ratings yet
Industrial Network Security, Second Edition
From Everand
Industrial Network Security, Second Edition
David J. Teumim
3/5 (2)
Edge Computing 101: Expert Techniques And Practical Applications
From Everand
Edge Computing 101: Expert Techniques And Practical Applications
Rob Botwright
No ratings yet
Artificial intelligence: AI in the technologies synthesis of creative solutions
From Everand
Artificial intelligence: AI in the technologies synthesis of creative solutions
Alexander V. Andreichikov
No ratings yet
Smarter Decisions – The Intersection of Internet of Things and Decision Science
From Everand
Smarter Decisions – The Intersection of Internet of Things and Decision Science
Jojo Moolayil
No ratings yet
Machine Learning: Design, Development and Augmented Intelligence
No ratings yet
Machine Learning: Design, Development and Augmented Intelligence
25 pages
Advanced Backend Code Optimization
From Everand
Advanced Backend Code Optimization
Sid Touati
No ratings yet
Smart Manufacturing: The Lean Six Sigma Way
From Everand
Smart Manufacturing: The Lean Six Sigma Way
Anthony Tarantino
No ratings yet
Application of Machine Learning Techniques To Semiconductor Manufacturing
No ratings yet
Application of Machine Learning Techniques To Semiconductor Manufacturing
10 pages
Building Scalable Data-Intensive Applications
From Everand
Building Scalable Data-Intensive Applications
Chandani Kaul
No ratings yet
Hareesh Sir Portion
No ratings yet
Hareesh Sir Portion
121 pages
Machine Learning in Additive Manufacturing A Review 3i7yvrpj2j
No ratings yet
Machine Learning in Additive Manufacturing A Review 3i7yvrpj2j
31 pages
Real-Time Big Data Analytics: Emerging Trends
From Everand
Real-Time Big Data Analytics: Emerging Trends
Trilokesh Khatri
No ratings yet
PH4418 Physics in Industry - Semiconductors - Part5
No ratings yet
PH4418 Physics in Industry - Semiconductors - Part5
72 pages
Distributed Storage Networks: Architecture, Protocols and Management
From Everand
Distributed Storage Networks: Architecture, Protocols and Management
Thomas C. Jepsen
No ratings yet
Smart Manufacturing, Artificial Intelligence and Industry 4.0: The Next Industrial Revolution.: Industrial Automation, #5
From Everand
Smart Manufacturing, Artificial Intelligence and Industry 4.0: The Next Industrial Revolution.: Industrial Automation, #5
The Digital Allchemist
No ratings yet
Wireless Networks for Industrial Automation, Fourth Edition
From Everand
Wireless Networks for Industrial Automation, Fourth Edition
Dick Caro
No ratings yet
Review - Machine Learning Techniques in Analog - RF Integrated Circuit Design, Synthesis, Layout, and Test
No ratings yet
Review - Machine Learning Techniques in Analog - RF Integrated Circuit Design, Synthesis, Layout, and Test
23 pages
Fix Common Failures
From Everand
Fix Common Failures
Mei Gates
No ratings yet
Machine Learning Ai Manufacturing PDF
No ratings yet
Machine Learning Ai Manufacturing PDF
6 pages
Chien 2014
No ratings yet
Chien 2014
12 pages
Machine Learning Re Defining Semiconductor Industry 1598272842
No ratings yet
Machine Learning Re Defining Semiconductor Industry 1598272842
33 pages
Data Mining Models: Techniques and Applications
From Everand
Data Mining Models: Techniques and Applications
Ravi Deshpande
No ratings yet
Machine Learning Applications in Physical Design - Recent Results and Directions
No ratings yet
Machine Learning Applications in Physical Design - Recent Results and Directions
6 pages
Yu - 2023 - Machine Learning in EDA When and How
No ratings yet
Yu - 2023 - Machine Learning in EDA When and How
6 pages
Blockchain Foundation Courseware - English
From Everand
Blockchain Foundation Courseware - English
Eppo Luppes
No ratings yet
Architecting Big Data & Analytics Solutions - Integrated with IoT & Cloud
From Everand
Architecting Big Data & Analytics Solutions - Integrated with IoT & Cloud
Dr Mehmet Yildiz
4.5/5 (2)
Requirements Engineering For Machine Learning: Perspectives From Data Scientists
No ratings yet
Requirements Engineering For Machine Learning: Perspectives From Data Scientists
8 pages
Predicting Quality of Castings via Supervised Learning Method
No ratings yet
Predicting Quality of Castings via Supervised Learning Method
13 pages
The CompTIA Network+ & Security+ Certification: 2 in 1 Book- Simplified Study Guide Eighth Edition (Exam N10-008) | The Complete Exam Prep with Practice Tests and Insider Tips & Tricks | Achieve a 98% Pass Rate on Your First Attempt!
From Everand
The CompTIA Network+ & Security+ Certification: 2 in 1 Book- Simplified Study Guide Eighth Edition (Exam N10-008) | The Complete Exam Prep with Practice Tests and Insider Tips & Tricks | Achieve a 98% Pass Rate on Your First Attempt!
Comptia Ace5
5/5 (1)
Big Data and Data Science: Analytics for the Future
From Everand
Big Data and Data Science: Analytics for the Future
Dhaanyalakshmi Ahuja
No ratings yet
Framework for SCADA Cybersecurity
From Everand
Framework for SCADA Cybersecurity
Richard Clark
5/5 (1)
Application Design: Key Principles For Data-Intensive App Systems
From Everand
Application Design: Key Principles For Data-Intensive App Systems
Rob Botwright
No ratings yet
Greening the Data Center: A Pocket Guide
From Everand
Greening the Data Center: A Pocket Guide
George Spafford
No ratings yet
Hardware Reliability Secrets
From Everand
Hardware Reliability Secrets
Kai Turing
No ratings yet
Introduction to Machine Learning and Neural Classification
From Everand
Introduction to Machine Learning and Neural Classification
Trilokesh Khatri
No ratings yet
ML Interactively
No ratings yet
ML Interactively
273 pages
Data Mining for Beginners: A Programmer’s Guide
From Everand
Data Mining for Beginners: A Programmer’s Guide
Agasti Khatri
No ratings yet
Machine Learning For Electronic Design Automation: A Survey
No ratings yet
Machine Learning For Electronic Design Automation: A Survey
44 pages
Digital Electronics with Arduino: Learn How To Work With Digital Electronics And MicroControllers
From Everand
Digital Electronics with Arduino: Learn How To Work With Digital Electronics And MicroControllers
Bob Dukish
5/5 (1)
AI and ML Innovations in Nanotechnology
From Everand
AI and ML Innovations in Nanotechnology
Dr. Zemelak Goraga
No ratings yet
The Internet of Things (IoT) in Industrial Automation: Industrial Automation, #4
From Everand
The Internet of Things (IoT) in Industrial Automation: Industrial Automation, #4
The Digital Allchemist
No ratings yet
Machine Learning in Engineering Applications and Trends
100% (1)
Machine Learning in Engineering Applications and Trends
27 pages
ASP.NET Core 1.0 High Performance
From Everand
ASP.NET Core 1.0 High Performance
James Singleton
No ratings yet
020 Vol3 Iss 3 Pub
No ratings yet
020 Vol3 Iss 3 Pub
6 pages
Operational Technology: The Beginner's Guide
From Everand
Operational Technology: The Beginner's Guide
W.J Bickerstaffe
No ratings yet
Data Science, AI, and Blockchain: Integrated Approaches
From Everand
Data Science, AI, and Blockchain: Integrated Approaches
Ekaaksh Deshpande
No ratings yet
Msie 08 L M2S5
No ratings yet
Msie 08 L M2S5
21 pages
Fault IC Detection PPT[1]
No ratings yet
Fault IC Detection PPT[1]
30 pages
EMERGING TECHNOLOGIES IN OIL AND GAS INDUSTRY
From Everand
EMERGING TECHNOLOGIES IN OIL AND GAS INDUSTRY
Matthew Sadiku
No ratings yet
BGT - Final Presentation
No ratings yet
BGT - Final Presentation
24 pages
Chapter 1 Vector Analysis
No ratings yet
Chapter 1 Vector Analysis
22 pages
Braeden Culp About Me
No ratings yet
Braeden Culp About Me
1 page
Parts Guide Manual: Bizhub C250
No ratings yet
Parts Guide Manual: Bizhub C250
112 pages
Astm e 754
No ratings yet
Astm e 754
8 pages
Enzymes Notes PPT Pre AP
100% (1)
Enzymes Notes PPT Pre AP
37 pages
Technical Report Sealed Air 01
No ratings yet
Technical Report Sealed Air 01
2 pages
Fire Protection 101
No ratings yet
Fire Protection 101
126 pages
Starter Unit : Gram AND VOC Questions
No ratings yet
Starter Unit : Gram AND VOC Questions
81 pages
NR449 Quiz 3 Review
No ratings yet
NR449 Quiz 3 Review
1 page
Supply Chain Mangement
No ratings yet
Supply Chain Mangement
22 pages
9 Abcd Skills Exam2a
No ratings yet
9 Abcd Skills Exam2a
1 page
Magnet Project 1
No ratings yet
Magnet Project 1
4 pages
System Data Dll-Resources Dat
No ratings yet
System Data Dll-Resources Dat
63 pages
Alexis, Gerald (2010) Contemporaty Haitian Art Private Collections-Public Property
No ratings yet
Alexis, Gerald (2010) Contemporaty Haitian Art Private Collections-Public Property
10 pages
Napkin Foldings o Impress
100% (1)
Napkin Foldings o Impress
96 pages
Quality Review Documents Recommendations Expression Strength Name Centrally Authorised Human - en
No ratings yet
Quality Review Documents Recommendations Expression Strength Name Centrally Authorised Human - en
5 pages
Plantito & Plantita Week 3 NSTP
No ratings yet
Plantito & Plantita Week 3 NSTP
4 pages
Display All Lines Items of SC To Ad Hoc Approvers in Approval Workitem in SRM Inbox PDF
No ratings yet
Display All Lines Items of SC To Ad Hoc Approvers in Approval Workitem in SRM Inbox PDF
15 pages
Subsurface Textile Irrigation
50% (2)
Subsurface Textile Irrigation
2 pages
Acct Statement XX7519 10082023
No ratings yet
Acct Statement XX7519 10082023
26 pages
Baa3023-Project MGMT in Construction 21213
No ratings yet
Baa3023-Project MGMT in Construction 21213
6 pages
Windmill Spares List
No ratings yet
Windmill Spares List
39 pages
Jurnal Internasional 1
No ratings yet
Jurnal Internasional 1
11 pages
True False Not Given Reading
No ratings yet
True False Not Given Reading
4 pages
RC Structures I - CE
No ratings yet
RC Structures I - CE
4 pages
Catalogo ProMusic 2015
No ratings yet
Catalogo ProMusic 2015
182 pages
HOVAL Caldaia Cosmo English
No ratings yet
HOVAL Caldaia Cosmo English
10 pages

ctrl

Uploaded by

ctrl

Uploaded by

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

Data Engineering for the Analysis of Semiconductor Manufacturing Data

Article · December 2002

Peter David Turney

The user has requested enhancement of the downloaded file.

Data Engineering for the Analysis of Semiconductor

Web page / page Web

Contact us / Contactez nous: [email protected].

Data Engineering for the Analysis of

Keywords: data engineering, preprocessing data, semiconductor manufacturing,

IJCAI-95 Workshop on Data Engineering for Inductive Learning 1

IJCAI-95 Workshop on Data Engineering for Inductive Learning 2

Data engineering presents interesting philosophical issues: How can we make

2. Data Engineering for Defining the Features

2.1 Multiple Levels of Granularity

IJCAI-95 Workshop on Data Engineering for Inductive Learning 3

IJCAI-95 Workshop on Data Engineering for Inductive Learning 4

Method B: The electrical measurements at the site-level are generally related to

2.2 Deciding What Information to Record or Analyze

IJCAI-95 Workshop on Data Engineering for Inductive Learning 5

process. Some of the problems in the process can be due to inexperienced

2.3 Representing Time

IJCAI-95 Workshop on Data Engineering for Inductive Learning 6

2.4 Other Feature Engineering Problems

3. Data Engineering for Defining the Target Class

3.1 Defining the Target Class

IJCAI-95 Workshop on Data Engineering for Inductive Learning 7

IJCAI-95 Workshop on Data Engineering for Inductive Learning 8

We have not investigated these issues.

3.2 Grey Cases

3.3 Other Class Engineering Problems

IJCAI-95 Workshop on Data Engineering for Inductive Learning 9

IJCAI-95 Workshop on Data Engineering for Inductive Learning 10

View publication stats

You might also like