0% found this document useful (0 votes)
7 views

Geek Interview Question

Interview questions

Uploaded by

Shwetha M N
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
7 views

Geek Interview Question

Interview questions

Uploaded by

Shwetha M N
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 48

Geek interview question

1. How to separate duplicate records with out dedup sorted from the grouped
input file?
Ans:- We can achieve that through the rollup ...
(or)
Rollup will help to avoid the duplicate without using dedup component.
It takes the first record and reject the rest
2. 3 way join with different column names
Ans:- SELECT C.DEPTNAME,sum(B.SALARY) FROM FILE1 A ,FILE2 B
,FILE3 C
1. WHERE A.EMPLID=B.EMPLID AND A.DEPTID=C.DEPTID
2. GROUP BY C.DEPTNAME
(or
B. Setup the join component- Fine .. I will answer this myself You don't
need to mention a key here ..neither key nor override key So in the transform you
have find the columns u need to match and put a...
3. Removing header and trailer records
Ans:-A. Use Unix SED #--------------------------------------------------------------# #
This line removes both the header and the trailer record #------------------------------
----------------------------...
(or)
B. You cam redefine format, Filter by expression & Run program components.
4. Abinitio dml usage in Informatica source creation
Ans:- Can we create an Informatica source by using the dml of the corresponding
abinitio source,by importing the dml,without using the manual create source
option
5. What is a control file in abinitio?
Ans:- In Abinitio, we have Multifile System. Multifiles are like different
partitions of an original file. For example, if we take a 4 way partition file, which
implies, data of an original file is stored i...
6. Which file should we keep it as a look up file, large file or less data records
file & why?
Ans:- When we say lookup, it can be in serial or it can in MFS and generally
based on the application logic we decide what kind of lookup it needs to be. And
again we need to think on the natire of the loo...
(or)
B:- Lookup File should be of small size or less number records. This is because,
when the graph is invoked, all the data in the lookup file file will be loaded into
memory..This means..if the data in the...
7. Abinitio sorted component
Ans:- You can use sort component to get the sorted output.
8.What are the two types of .Dbc files
. Ans :-dbc file is database configuration file. In this file, information to connect
the database is given.
Types of files:
1. Fixed length file
2. Delimited file
B:- We don't have diff types of dbc file. We have only one dbc file which is used
to connect the component in the GDE with particular database
9. Package editor
Ans:- What is "package" and "package editor" and how and why it is used.
Once you log in to transform editor click on view and go to package editor. It is
used to have extra tranform abilities like intialise, finalise etc
10. Wrapper and .Ksh in ab initio
Ans:- Why wrapper scripts are used in job scheduler to run graphs in ab initio?
Why cannot we use .Ksh script in job scheduler to schedule job?
(or)
wrapper Script executes multiple graphs(or) jobs,that means multiple .ksh jobs are
going to execute here,
where coming to .ksh which will be execute a single component process. that
means every component process having the .ksh process
11. What is vector field? Explain
Ans:- Hi Can u please explain it with example, i need to build the vector for
comparison of the inputs records , if that records matches then i take the output
(or)
Vector field is like an array which is used in C/C++.
Its mainly used in normalize and denormalize component
12. Factless fact(What is factless fact)?
Ans:- The fact which is not having any measures(like Quantity) are called factless
fact
13. Config variables in abinitio
Ans:- For all ab-initio config variables the command starts with AB_
14. Sort component
Ans:- It can read both.(or) Sort reads records in a parallel way, because all groups
of records should read and sorted at a time.
15. What is lookup file?
Ans:- Lookup file .dat which contains small amount of data with specified key or
keys. (or)
Lookup file is basically a specific dataset which is keyed. This can be used to
mapping values as per the data present in a particular file (serial/multi file). The
dataset can be static as well dyna...
16. Pset(What is .Pset?)
Ans:- .pset is a input parameter file. (or)
Above version 1.15 the inputs which we pass as graph parameters are stored in a
file called pset.Its stored in folder pset.Generally pset file will be save as same as
graph name with extension .pset
17. Graph process
Ans:- After developing graph save it. Automatically it will generate .ksh, .db,
.mp, .xfr, .dml.
If you want to send the saved or latest version of the graph you can go for check
in option in GDE or From UNIX you can go for EXPORT.
18. Hi if want to run the graph in UNIX !What command I need to use
?Correct me if im wrong :-run
Ans:- You can deploy the graph and generate the .ksh file which you can directly
execute or use the following command to execute the graph:
air sandbox run .mp
(or)
To run a graph in unix follow the simple steps.1. Deploy the graph to a wrapper
script .ksh2. In unix, run the wrapper script as described
below.script_path/xyz.ksh --> If no parameters defined.scr...

19. Invalid paramenter in layout url


Ans:- What will happen when we pass dot or invlaid parameters in the inout
component layout url? (or)
I myself found the answer.
If the environment parameters are not set, we get this error.
20. Version control in abinitio
Ans:- Generally we are doing the version control using the EME (Enterprise Meta
Environment).
You have to check in or check out the graphs(mp, ksh, dml, dbc, xfr etc files).
You can do the check in or check out using GDE or command prompt.
21. Records filter?( How to filter the records by using reformat?)
Ans:- thru select (or) We can also use the Reformat feature output_index and
utput_indexes. If you have more filter condition then increase the no of count and
use the if else condition with the Priority assignment.
22. How to execute wrappers script?
Ans:- ./.ksh (or)
Different methods to run the unix wrapper scripts . ./script_name.sh sh
script_name.sh ksh script_name.sh sh -x script_name.sh ksh -x script_name.sh
Running...
23. How to insert header and trailer records?
Ans:- Keep the header in a separate file, footer in a separate file.
flow 1 - header
flow 2 - content
flow 3 trailer
use contact with the same flow 1, 2 and 3 and redirect it to the output file.
(or)
Use the CONCATENATE component in your graph.
24. Replicate and broadcast components?
Ans:- When can we go replicate and when we cannot go broadcast
components?What is the advantage & disadvantage of replicate?What is the
advantage & disadvantage of broadcast?
Broadcast - Takes data from multiple inputs, combines it and sends it to all the
output ports. For example, You have 2 incoming flows (This can be data
parallelism or component parallelism) on Bro...
25. What is ab local?
Ans:- The use of ABLOCAL() construct is in Some complex SQL statements
contain grammar that is not recognized by the Ab Initio parser. When data is
unloading in parallel. Then ABLOCAL() construct
26. Db lookup?( How do you use db lookup)
Ans:- For DB Lookup you can use the componant JOIN WITH DB. This is very
expensive operation in the Abinitio. If Join table caontain less data then first
create a serial file and use as a lookup. OR go for abinitio Join.
27. Broadcast vs replicate(What is the functional difference between replicate and
broadcast component)?
Ans:- Broadcast : Data parallelism.

Replicate : Component parallelism.


1) Basic difference is broad caste is partition component and replicate is
miscelleanous component.2) As broad cast is partition component it always
defaultly throw in a fan out mode only where ...
27. What is pset file in abinitio?
Ans:- .pset file is an input values or input parameters set file. By using the air
sandbox run command we can run the .pset file. Internally calling the .mp file and
passing all the inputs parameter during run time.
28. What is air sandbox in ab initio?

29. What is the structure of parameter sets?


30. Graph parameters(How can you set the parameters to call the graphs)?
31. Graph deployment(Explain the deployment of graph, with start and end
script)?
Ans:- Thanks for the response,My actual question is when there is a start script
and end script. When the execution starts from start script > graph>end script. But
the deployed ksh in unix is not of ...
(or)
basically when we execute the graph
first start script are executed before graph and later end scritpt is executed in the
GDE.
32. What are wrapper scripts? What does it contain?
Ans:- wrapper scripts are unix scripts,they also known as calling scripts.they used
to run the deployed scripts.in the order what se can specify in the unix script
33. How do we use SCD types in the abinitio graphs?
Ans:- sorry the before answer is wrong.we can work out the scds with redefining
the formats
34. How does component folding works in ab initio?
Ans:- It reduces the number of processes created by the graph and can enhance
performance of the graphs. It is combined saveral graph componants at a time in a
group during run. groups runs as a single process and use the less memory during
runtime.
35. Convert 4 way mfs to 8 way mfs?( How to convert 4 way to 8 way mfs files
without throw down of data in the file?)
Ans:- 1. To convert 4 way to 8 way partition we need to change the layout in the
partioning component. 2. Departition the data then repartition.
36. How do you do production support for graph? How do you check whether
graph is completed successfully or not (is it $? Of UNIX?)what are different
return values?
Ans:- If you running the Abinitio ksh file then you have to check the status using
$? .
$mpjret:- is using in the end script of the abinitio to check the abintio mp runs
sucessfully or not.
(or)
How do you do production support for Graph? If the graph failed in the
production, usually we get emergency access to see the failure, then analyse the
failure, if it is a code bug, then we go back to...
37. How can we count a number of records in a flat file using abinitio?
Ans:- already two answers are there for your question one more method is you
can youse the next_in_sequence()
function (or )
Use rollup component to count the number of record in the flat file.
Use {} as key in the key specifier. It will consider all the fields as one record and
count the total number of records.
38. How to create SCDs(slowly changing dimensions) in abinitio?
Ans:- scds means its slowly changing dimension in sizes
like int to long data types.
we will work out scds with delta processing or redefining the formats by redefine
format. thats it .dont take tension any more dont get confuse. (or)
If you want to implement the SCDs in abinitio then you should do the delta
processing
39. How to pass force_error with condition?
Ans:- You can write force_error function in following fashion
out:reformat(in)
begin
if (in.cust_name=="XYZ") force_error("Invalid Customer");
out.*::in.*
end
40. Which component does not work in pipeline parallelism?
Ans:- Any component which contains Max cor value.
Sort, Sort within group, Roll up will break pipeline parallelism
41. How do we append records to an already existing file usin abinitio graph?
Ans:- if you want to append the record to same out put file just go to acess in
properties window there you will find the option s uncheck the option "delete file
if tits exit" and check the...
Create a graph by taking the existing file as the out put file and keep the mode of
the output file in Append Mode. Pass the new records from the input file to
this output file through a reformat. This will append new records in the existing
File.
42. How to identify the ascii , ebcdic formats !How to write the dml for same
..Please explain..
Ans:- For ASCII formats write the DML as below.

cust_id decimal(8)...

For EBCIDIC formats write the DML as below.

ebcidic cust_id decimal(8)...

The delimiters varies in each and every ebcidic formats send from source files.
But Its generally hexadecimal embedded.
(or)
Ascii vs. EbcdicWhen you view your data does it have different charaters
inbetween that do not look ASCII? If so, then in your record format type the
following:ebcdic record record details...
43. Record counts by reformat component(How to calculate the total number of
records in the file using reformat instead of rollup?)
Ans:- Connect onnect and Configure
Series:... Subject:... Topic:... There are four routhers and 4 swiches and...
http://www.geekinterview.com/question_details/81094
What is meant by session in db_connect
Series:... Subject:... Topic:... Asked by: Interview Candidate | Asked on:...
http://www.geekinterview.com/question_details/21542
Can a universe be connected to different databases?
Series:... Subject:... Topic:... Asked by: Interview Candidate | Asked on:...
http://www.geekinterview.com/question_details/34161
JVM and JDBC Connectivity
Series:... Subject:... Topic:... Explain working of Java Virtual Machine...
http://www.geekinterview.com/question_details/83801
How do you connect to or query Oracle database from a UNIX platform.?
Series:... Subject:... Topic:... Asked by: Interview Candidate | Asked on:...
http://www.geekinterview.com/question_details/15704
)reformat to log port and use this code and in select parameter specify
event_type="finish"type reformat_final_msg = recorddecimal("records")
read_count; string("readn&... (OR)
Via its log port.
44. Open abinitio in UNIX(How to open abinitio in UNIX)
Ans:- We cannot open AB Initio in UNIX. We can only run graphs in UNIX
using the ksh.
45. Why and when do we get the "pipeline broken error" in ab initio?
Ans:- Pipeline broken error will actually indicates the failure of a downstream
component.
It normally occurs when the database is running out of memory which
makes database components in the graph unavailable.
There are various scenarios where you can get this error. How did you encounter
it. You need to elaborate on it so that the group can give you a specific answer.
46. What is the order of execution of a graph when it runs?
Ans:- 1. Initialisation of Parameters
2. Start script execution
3. Graph execution
4. End script execution
All its parameters are initialized
2. Start script call
3. Graph start
4. End script executed.
47. How to avoid duplicates without using dedup component?
Ans:- We can avoid duplicate by using "key_change" method of the rollup
component.
The code will be like below.
out :: key_change(prev,curr)=
begin out :: cur != prev ; end out :: rollup(in) = begin out :: in ; end
Answered by: biyyanimv on: Feb 2nd, 2009
Use dedup sort to avoid duplicates
48. Sort within group(What is the advantage of sort within group clause)
Ans:- Sort within Groups refines the sorting of data records already sorted
according to one key and it sorts the records within the groups formed by the first
sort according to a second key.
49. What is output index? How does it work in reformat?Does below function
show output index in useoutput:1:if(in.Emp.Sal
Ans:- Output index function is used in reformat having multiple output ports to
direct which record goes to which out port. for eg. for a reformat with 3 out ports
such a function could be likeif (value==&#...
50. Abinitio environment variables(What are environment variable? Why
are they required)
Ans:- Environment Variables or other wise know as ABINITIO environment
variable. Its set in stdenv under which private project and pubic project will be
there.
Parameters like $AB_HOME , $AB_AIR_RROT will be present in environment
variable and this will link to the relational path respectively
51. Input table component layout - abinitio(My question is related to the
input table component layout. If we select the layout to be database-
(database default), where exactly is it pulling that layout from? When should
this option be used? Within the database layout we also see the serial ,table
and explicit args options, when do we use those?...)
Ans:- I would suggest you not to use the default database. either ai_serial or
ai_mfs.
Default database will pick the data from the database in the same structure as it is
in db.
The data in db might be serial or partition.
52. What is use of ab_job command in abinitio?
Ans:- When you want to run the same instance of the graph many times which is
palced in one place then we go for AB_JOB. its should be defined in sandbox
parameter. If you dont give the value for it it will take
AB_JOB as defaultAB_JOB parameter is set when we want to run the same graph
at the same time for different job names.
53. How do you join two files with different layouts
Ans:- I think your question is to join a serial file and a multifile, if that is the case
use broadcast component after the serial file and before join. (or)
If the layout is totally different ----use Fuse Component
54. What is the usage of .Mfctl and .Mdir files in the mfs directory of ab initio?
Ans:- .mfctl and .mdir are both related to multifile system. .mfctl extension of
control file created when we are using the MFS. The file extension .mfctl will
contain the URLs of all the data partitions. The file with the extension .mdir will
contain the URL of the control file used by MFS
55. How to rerun a graph in UNIX
Ans:- when ever a graph fails it creates a .rec file in the working directory ,the
directory may be where ur graph deployed script is stored .so remove that .rec file
an dthen run the deployed script of the graph from unix,u may use m_rollback -d
(or)
you can run the graph by giving the following command in unix
dtm run <recvory file name> -continue
56. How do you pass parameters to a graph in ai ?
Ans:- yes you can declare parametes in edit paramter option in GDE, while
running the ksh you can pass the value in command line.
If you want to pass a parameter to your graph then declare a formal parameter in
edit-parametrs region.
57. How to use a normal batch graph as a sub graph in continuous graph?
Ans:-
58. How many sandboxes can be there for a project?
Ans:- There can be numerous sandboxes for a project but there should be only one
sandbox associated with EME for a project.
we can have any no of sandboxes, sand box is nothing but users work area where
each user will get copy of the project & do the modifications acc.
59. How metadata management takes place in abinitio?
Ans:- it is possible with help of EME. it follows UNIX file structure.
60. How will you connect two servers?
Ans;- Connecting two different servers in Abinito is done thorugh a file called
abinitio.rc. This is used for remote connectivity. This file contains information
like the server ip(or name),the user name and the password required to connect
.(Connect and Configure
Series:... Subject:... Topic:... There are four routhers and 4 swiches and...
http://www.geekinterview.com/question_details/81094
What is meant by session in db_connect
Series:... Subject:... Topic:... Asked by: Interview Candidate | Asked on:...
http://www.geekinterview.com/question_details/21542
Can a universe be connected to different databases?
Series:... Subject:... Topic:... Asked by: Interview Candidate | Asked on:...
http://www.geekinterview.com/question_details/34161
JVM and JDBC Connectivity
Series:... Subject:... Topic:... Explain working of Java Virtual Machine...
http://www.geekinterview.com/question_details/83801
How do you connect to or query Oracle database from a UNIX platform.?
Series:... Subject:... Topic:... Asked by: Interview Candidate | Asked on:...
http://www.geekinterview.com/question_details/15704
)
61. Installation of ab initio in windows
Ans:- Hi,i've installed gde 2.8 on windows xp,and our database running as
Oracle9i,how do we install co-operating system... And on the same machine
Informatica server& client also running.Does it matter of Informatica software
exists where abinitio gonna installed...Pls do let me know the steps of
installation...(or)
Hi vasuInstalled in XP professional Abinito 2.10 co>op sys and 1.10.8 GDE
version.I created dml and data in serial file.--dml is simpleas well as graph with
FBE & out filei want to see data from file ...(or )
There is no matter what u have installed on u'r pc(either INFORMATICA r
DATASTAGE).but it is better to stop the services of the INFORMATICA when r
going to work with ABINITIO.When u want to work with...
62. Call web service component - ab initio(How does one make use of the
"call web service" component in the $ab_home/connectors/internet directory
of the component selector window of the ab initio console? Explain with
sample code )
Ans:-
63. Implementing file listener in abinitio
Ans:- Is there a way of implementing file listener in abinitio? It should
continuously scan a given directory, as soon as a file is placed in that directory, it
should copy that file to a working directory and trigger a corresponding abinitio
graph(or)
You can use the CONTINOUS components to build this. It requires and
environment setup though. You can read through the Ab Initio help by searching
on 'Continuous graphs'.
64. How can you extract and load without transforming?
Ans:- Provided the DML is same you can directly connect both input and output
datasets and perform and extract and load operation. For example, If the input
dataset is a table and output is file, you can directly connect both these, making
sure the DML of the file is propagated from table.
65. What do .Rec file contain ?
Ans:- .rec file contains the information which is required to rollback the graph
when it fails

Generally it will have some meta char information with hld and nld extension.

On failure we use to rollback using m_rollback to rollback the job to last


committed check point.
(or)
.rec is a recovery file. recovery will be done by using check point.
66. What is max core value? Wat is the use of max core?
Ans:- Max core is nothing but allocating the memory in the component.

Rollup, sort etc are some of the components


67. How will you convert 8 way mfs to 10 way mfs for multi files?
Ans:- We can use re-partition component just like partition by key & sort. In that
layout we can change the depth. (or)
We can simply use any partition component (partition by key, round robin or
anything) to achieve this. Just Specify the target multi file system location in the
layout of the partition component. There you go!!
68. Abinitio eme(In my sandbox I am having 10 graphs, I checked-in those
graphs into eme. Again I checked-out the graph and I do the modifications, I
found out the modifications was wrong. What I have to do if I want to get the
original graph..)?
Ans:- Check Out from older version which is the correct one and check it in
again. Make sure of using correct version number while checking out. (or)
Could we check out using the version number?
69. Inner join(What will be output for inner join having no key when i/p in
two respective input ports are:- input1 = 1,1 (one record with two fields)
input2 = 1,1,1 (one record with three fields)?
Ans:- Yes, this is wonderful scenario.
in0:1,1
in1:1,1,2
when we are using inner join wil get 4 records
(1,1)(1,1)(1,1)(1,1).
when we are using cartesian join (with out key) wil get 6 records.
i.e. 2 X 3 records....(1,1)(1,1)(1,2)(1,1)(1,1)(1,2) records.
Thank you (or)
Yes you will get only one record when you have not given any key in join that
will give you the cartisean product. since you have the inputs as in0-1record and
ina-1record 1 cross 1 is 1. the output ...
70. What is m_dump
Ans:- cat file.dml will work fine but it wont display the record format in a
formatted way..You can think of m_dump as "view data in formatted " manner.
(or)
cant we use an easy way to see dml of file by just giving
cat in the directory where dml is present. it wil use less CPU time and memory as
compared to m_dump
71. How we can create job sequencer in abinitio i.E running number of
graphs at atime
Ans:- Yes by using Plans we can make the jobs to run in sequence. means the
output of the abc.mp graph is input for xyz.mp. and also by using Autosys or
Control M we can schedule the jobs with proper dependencies.(or)
There is something called Conduct IT...Plans where u can schedule multiple
graphs in abinitio...
72. How to get the source from oltp systems to the staging area in real time
Ans:- Continuous flows can be used to enable real time processing in abinitio.(or)
I think it depends upon the bussiness logicpr whether it is one to one
mapping.ODS to Staging is smwhat which depends upon the various logics.
73. What is semi-join
Ans:- A semi-join returns rows from one table that would join with another table
without performing a complete join. It doesn't have explicit syntax. Eg : semi join
to evaluate an exists sub query {gesh...(or)
in abinitio,there are 3 types of join...1.inner join. 2.outer join and 3.semi
join.for inner join 'record...
74. Frequent compilation errors(Give 5 most frequently coming compilation
errors in your graph)
Ans:- Reject record should be the runtime error.(or)
Can be many. Some are:
File/Source cannot be read.
Data out of order.
Metadata mismatch.
Reject Record.
etc...
75. Describe how you would ensure a T-SQL script ran on a particular
database where there are multiple databases on a single server.
Ans:- at the start of the script, you can insert below statement to make sure that
script will run on a particular database L What is Operational Database
Get Latest Learning Series Updates delivered directly to your Inbox.........
http://www.learn.geekinterview.com/data-warehouse/dw-basics/what-is-
operation...
what is the difference between SID and Global Database Name ?
Series:... Subject:... Topic:... : A SID (almost) uniquely identifies an...
http://www.geekinterview.com/question_details/12376
Decentralized Database
Get Latest Learning Series Updates delivered directly to your Inbox.........
http://www.learn.geekinterview.com/data-warehouse/dw-basics/decentralized-
dat...
Database Concepts
Get Latest Learning Series Updates delivered directly to your Inbox.........
http://www.learn.geekinterview.com/database/concepts/
Multi-Dimensional Database
Get Latest Learning Series Updates delivered directly to your Inbox.........
http://www.learn.geekinterview.com/data-warehouse/data-architecture/multi-
dim..
76. How can we login into the abinitio as a developer?
Ans:- If u have access to Development server provided by the firm u r working
with, then you can log in as developer by giving host name (Dev server name),
host type, connection method, your login id and password from Settings >
Manage connection menu in abinitio tool
77. How many input files?(With open a graph how should we know how
many input files are using in that graph)
Ans:- there is one air command that can help you - air project get-required-file ==
This command finds and displays the files in a project that the specified object(s)
depends on. This command is branch-sp...
78. How to do we run sequences of jobs ,,like output of a job is input to bhow
do we co-ordinate the jobs
You can also use PLAN. create a plan with Graph execution components.(or)
in script(.ksh) we redirect the outut of A job is input to B job
79. What is phasing & checkpointing? What is the use ?
Ans:-What is phasing & checkpointing? What is the use ?
Phasing: Phasing in ab-initio means dividing the complex graphs into pieces. In
order to improve the performance by reducing the resource utilization in different
phases with in the same graph. Check... (OR)
Phases are used to divide the graph into small pieces and less the usage of
resources like CPU and memory
Checkpoints are the commit points where we can revert back the saved data,
80. What is the importance of eme in abinitio?
Ans:- EME is a repository in Ab Inition and it used for checkin and checkout for
graphs also maintains graph version. There will be lock put once it is access by
any users. (OR)
By using command line prompt. We have AIR COMMANDS for Check In &
Check Out Process

Check In : air project import/project// - base dir /ai/src//users/dev//sand/ -


files<<'EOF'

Check In : air project export/project// - base dir /ai/src//users/dev//sand/ -


files<<'EOF'
81. Header details and trailer validation for multi file?
My requirement header contains - business date details contains - reccords trailler
contains - total no of records in detail. 1) header , detail , trailer separation with
indicator filed ( conditional dml with indicator field ) 2)comparing header with
business date , if not match fail. ( using...(OR)
in Reformat have output index separate header,trailer and data based on it in
transform 0 write xfr for matching business date using lookup expression for that
use(OR)
Follow the steps:- 1) Define a param in graph and store the record count of the
mutifile in it. 2) Create your graph like this. file --> Filter having
next_in_sequence() == 1 --> Gather --> dedup{} w...
82. Ab initio dependency analysis(By enabling which parameter, dependency
analysis feature get enabled)
Changes to the above answer ..... In the GDE choose File > Preferences to open
the Preferences dialog. Click the Advanced tab, select the Documentation
Transforms checkbox and click OK. To set th...(OR)
On the GDE menu bar, select File > Preferences.

In the Parameters group of the GDE Preferences dialog, select Document


Function.

Open the Properties dialog; in parameter Document Function should be visible.


83.How can we use a function defined in the $ai_xfr as pdl without adding
the path of the .Xfr file in ab_air_defs
give the path of .xfr file in AB_DM_DEFS as graph input value parameter.(OR)
You can include the transform in test. transer using AB_DML_DEFS in Input
Value parameter of Graph..
84.Wrapper script(How to create a wrapper script in ab initio)?
wrapper scripts are used to run the graphs though command prompt
ex
abc.sh
within the abc.sh, we define the graphs ksh like
$/xyz.ksh -param1 -param2.....

then we use the wrapper script(abc.sh) to run the graph from command
prompt(unix) using
./abc.sh

i hope this will help you.


85. Abinitio display records between 50-75..
answers
In input dataset I am having 100 records. I want records between 50-75 and I
don't want to read 5th record? Which component I have to use..
Answered by: pramod on: Sep 21st, 2012
use LEADING RECORDS component
with condition
next_in_sequence()>=50 && next_in_sequence()<=75
Answered by: harpreet singh on: Sep 20th, 2012
We cannot use next_in_sequence() function more than once in a transform. this
next_in_sequence()>=50 &&next_in_sequence() <=75 wont give correct answer.

One way to do is:


Use reformat and add another field record_number and assign next_in_sequence()
to it. Then in filter_by_expression : mention record_number >=50 &&
record_number <=75.
86.What is the syntax of m_dump command?
answers:-Answered by: Ashutosh Prasad on: Sep 8th, 2012
m_dump is abinitio utility to view the content of a file[Serial/MFS] in a formatted
way.
Syntax :
m_dump
There are other options such as :
-select
-no-print-data
-print-data
-start # -end #
-record
etc..
m_dump [] [ ...]
87. Using partition components?
answers:-i having 1000 records in in0 file. i want first 250 records into out0 file, next
250 (250 to 500)records into out1 file, next 250(500 to 750) records into out2 file, last
250(750 to 1000) records into out3 file using partition components?
Answered by: Debasmita on: Aug 21st, 2012
Use output_index in the reformat component and make count 4 for four outputs.
Answered by: vSudheer on: Mar 2nd, 2012
You can use partition by % and say 25% for each port..
are you can use partition by range and provide the range of keys for each port
88. How do find out particular record in partition by using pbk?
answers:-I have 400 records in my in0 file, its 4-way multifile system. How can I find
out particular 150th record will goes to which partition using partition by key?
Answered by: RAM on: Aug 17th, 2012
There is no guarantee for particular partition will get particular record but we will
tell one thing suppose we r having records like 1,1,1,2,2,3,4,4,5,5,6,7,7,8,9,10 in
this records reach to first p...
Answered by: karan_88 on: Jun 27th, 2012
@manishekhar : even if the key is unique still u cant say which partition it will go
to...
89. How to reuse transformed function into some other component
?Asked By: Emanisreddy | Asked On: Apr 19th, 2012
2 answers
Answered by: RAM on: Aug 17th, 2012
By the use of common xfr & by the use of include option
Answered by: mangal on: May 17th, 2012
we have to save reformat transformations as .xfr file, which can reuse in across
graphs
90. Difference between m_mkfs and m_touch?
2 answers
What is the difference between m_mkfs and m_touch. both create a mfs how do
they differ ? also what is the syntax for creating mfs using m_touch
(OR)
M_mkfs will creates a multifile system
M_touch will creates empty multifile
Answered by: mukesh on: Jul 9th, 2012
m_mkfs creates a multi file system
m_touch creates just a multifile
91. What is the latest version that is available in ab-initio?
Ans:- Latest version of Co>operating system is 3-1-2
92. Creating dynamic .Pset
3 answers
Hi guys, i have a requirement : we need to design a graph for transition of some
files. The details like filename, parameter_id (unique) we need to get from a table
in db and these should be included in the graph pset one at a time. That means for
every unique parameter_id graph should run once. like...
(OR)
you can use the air commands to create PSET for graph and add parameters to
PSET. These commands can be executed in the wrapper scripts inside the while
loop.
example/syntax:
air sandbox pset graph1.pset -protoype $PROJECT_DIR/mp/mygraph.mp
air sandbox parameter -path
(OR)
what i understood from ur ques is: You have a pset having parameters- filename,
parameter_id (unique), & you need to populate these parameters with the values
present in a table. assuming ur table hav...
93. Broadcasting partition
1 answer:-On which parameter based broadcasting partition distribute the records to
the outputs?
(OR)
There is no parameter in broadcast. It simply combines all the records from input
ports to a single flow, and copies it to each of its output flow partition
(OR)
94. Can anyone please explain with example how does m_mkfs command
work to create mfs? What is its syntax?
Hi,In abinitio we can create mfs with this command press F8 write the m_mkfs
pathnamefilename
(OR)
Syntax: m_mkfs <name of the control partion> <URL of the first partion> <URL
of the second partion> <URL of the third partion>....... Eg: m_mkfs controlpartion
c:abin...
95. How to get dml using utilities in UNIX?
4 answers:-we can use gendml command to generate the dml file, pls, see the below
syntax of command
m_db gendml ur_dbc_file -table -select sql statement > dml file name
hope this would help you :-) :-)
(OR)
96. Anybody plz reply that answer?
Ans:- ?
96. 3 way join component?
1 answer:-Hi, i have 3 files each having 2 cols e.G file1 empid deptid 1 100 2 200 3 200
4 100 5 300 file2 empid salary 1 10 2 20 3 30 4 40 5 ...
(OR)
1.Perform inner join for both file1 & file2
2.perform sort on dept id then do Roll up
3.perform inner join for roll up o/p and file3
4.finally you will get required result
97. What are $ai parameters in abinitio?
1 answer:- Answered by: Sachin on: Apr 26th, 2012
$AI parameters are sandbox level parameters like $AI_XFR, $AI_XML
98. Why go for sort within groups?
9 answers:-We have sort and sort within groups components. We can achieve the sort
within group functionality by placing two keys in sort group. Then why we have to go for
sort within groups?
(OR)
Sort with in group sort the data based on major & miner key in single time.

Sort component sort the data on a single key at a time.If we have sort with some
other key again we have to use the sort component again.It take some extra time.
That sort with in group using best way.
(OR)
Use Sort within Groups only if records within a group are processed before being
sorted on a minor key i.e. if processing "A" needs records to be sorted on field
{"1"} and later in...
99. Please let me know whether we have ab initiogde version 1.14 and what is
the latest gdeversion and co-op version?
6 answers
Latest GDE version is 3.0.5 and Latest COPS version is 3.0.4
(OR)
Latest GDE version is now 1.15.6 and with Co-operative system version 2.14.
100. Find the size of mfs file?
13 answers:-How do you find the size of mfs file?
(OR)
Use m_ls -ltr file name you will have the record count and also size of it.
(OR)
using m_expand you can know the count of partition on the file
101. How to recover a control file that is removed using rm ?
1 answer:-Instead of m_ filename if we do a rm filename by mistake and want to
recover that control file, how do we do it? Can it be done using a checkpoint, if yes how
and is there another way?
(OR)
if data files are present simply do m_touch control file name
102. What is skew and skew measurement?
8 answers:-Answered by: vSudheer on: Mar 2nd, 2012
statistically, skew represent distribution of data.. when all partitions share equal
amount of data, it is the best use of portioning. This can be achieved by partition-
by-roundrobbin or by using equ...
(OR)
Skew is the measure of the data flow on the particular partition Take an example
4 way partitioned 1flow---200recs 2flow---600recs 3flow---400recs 4flow---
800recs Take average = (200+600+400+800)/4 =...
103. How do you insert break in .Dat output file using abinitio ?
2 answers:-I have already created graph,and need to modify it so that output file gets
page breaks. Any idea how can we implement page breaks in abinitio.
(OR)
as far I know there are no page breaks in files.. (anyone know if there are .doc or
.pdf creation components in AI)?

If you are trying to insert a line break, just insert a blank where you need
104. What do you mean by break in output file? Please elaborate.
Informatica source creation from abinitio dml
Hi, can we create an Informatica source by using the the abinitio dml directly
rather than manually creating the source.Just like the way,i tried for creating an
Informatica source from an Informatica target by making respective changes in
the XML structure and importing the same.
-3-
105. Speedy process of verification?
What are the important concepts required for speedy process of verification. . .?
Answer Question
106. I am working on Informatica.I am interested to learn abinitio.Please
help me to find some stuff which helps me to start.I am not understanding
where to start... Please guide me.. I have got abinitio gde...
14 answers:-"version it" institute is not teaching ab initio course;
By jakirajam
(OR)
Hello 2.18.32 is new version for co op and 1.18 is new version for GDE.
Ok keep it up .
107. XML reformat?
1 answer:-Hi,can anyone explain how XML reformat is being used to generate XML
file.Please let me know with one example.Suppose we have three records in the input
file as
below:xxx,001,company_1yyy,008,company_2zzz,002,company_3ppp,006,company_2
where xxx- name001 - emp idcompany_1 - company namecan anyone help...
(OR)
Hi,Let the input file data be as
below.PPP,006,Company_2XXX,001,Company_1YYY,008,Company_2zzz,002,
Company_3Let the record format for reading this data be as
below.recordstring(",") Name;deci...
108. What is the use of dependency analysis?
1 answer:- Data Lineage
109. Metadata level programming?
How can we do metadata level programming in dml itself to remove blank lines
from a file
110. What is the relation between eme , gde and co-operating system ?
6 answers:-Answered by: girish.v on: Dec 19th, 2010
GDE is Graphical Development Environment where user creates graphs(It is
having GUI environment installed in wondows).Co>Op system is used for
running the developed graphs either in Unix or in Wind...
(OR)
EME is Enterpise Meta Environment which is repository(stores data). GDE is
Graphycal Development Environment which is user's work area(develops Graphs)
and Co>Op is used to run the developed ga...
111. What is the use of aggregation when we have rollup as we know rollup
component in abinitio is used to summirize group of data record. Then where
we will use aggregation ?
15 answers:-Aggregate and ROLLUP generates records that summarize groups of
records. But Roll-up provides more control over record selection, grouping, and
aggregation.
(OR)
Rollup is multistage component. It is working as a text mode and package mode.
In package mode user has to develop own logic. In text mode it work as a
aggregate component.
It is one type of facility in roll up.
112. How do you improve the performance of a graph?
6 answers:-Adding to all the point mentioned by others -- Thumb Rule is to prefer
componet solution to a problem compared to logical solution.- Use Oracle hints and Ab
Initio Hints (ABLOCAL utility for lower ora...
(OR)
I would like to add couple of more points to the already given explanations:1. Try
to use lookups for joining instead of join if you have huge no. of records in one
flow and relatively less no. of rec...
113. Explain how the graph can be aborted at runtime by using it in the xfr.?
1 answer:-force_error() function can be used within the xfr to abort the graph.
Remeber to keep the transform component with 'Abort on first reject'.
Also, specific error messages can be captured from the error port (Never abort).
114. Unit testing?
2 answers
How do you perform unit testing?
(OR)
Divide the main components of graphs having required transformations and do
data validation for each of them seperately.
(OR)

Test the developed graphs with sample data (ie. our sample data).
115. What is mpjret?
1 answer:-
What is mpjret? Explain about the uses of mpjret. How to use mpjret?
(OR)
The variable $mpjret will hold the return status of Abintio graph.
Echo $mpjret will tell if the is successful or failed.
0 means sucess and any other number means failure.
116. Hi !I have worked on datastage and Informatica ..We have option called
report where we can generate the complete job ryt from components to code
..I want know ab-initio does it have facility of job report.?
2 answers:- There is a log port in every component available. You can use this port to
generate reports.
(OR)
While there is no option to prepare a report but if you go to options
View>Tracking Details in GDE you can clearly see how much each component is
taking CPU time,memory usage etc etc
117. Reformat component?
1 answer:-?
What is use of index in reformat component?
(OR)
output_indexes in Reformat functions like a Partition.
118. Most common runtime errors?
1 answer:-Give some runtime errors which are commonly comming in your graph
execution?
(OR)
Nerver abort
119. Abinitio partition component?
2 answers:-What is the optimistic partitioning component in abinitio? How can we say
that it is an optimistic partition component?
(OR)
Partition by Round Robin is Optimistic Component, by using data skew we can
say that it is optimistic component
(OR)
Round Robin is the optimistic partition.
120. Size of the file?
1 answer:-Which component will be used to reduce the size of the file?
(OR)
You can use the Compress or Deflate component to reduce the size of the file.
121. Loading 1 millon records?
2 answers
If a graph fails in between loading 1 millon records to a target table what is the
alternative solution? I.E will you run the grarh again? (the record count is very
huge)
(OR)
We can commit intermediate results in the target table by creating a commit table
in API mode. When we rerun the graph, it will skip over the previously commited
records. Use m_db creat...
(OR)
Yes,We need to run the graph again.1)If we use any checkpoint in the graph it
will recover the data from that point.2)If we set the parameter rows_per_commit
in the target table(the committed data = rows_per_commit) then data will be safe
in the target table.
122. What is the function you would use to transfer a string into a decimal?
8 answers:-If string conains the alphanuemica data the decimal-strip will remove the
alpanumeric data but the decimal(length) will not remove alphanumeric data.Please let
me know If I am wrong.
(OR)
decimal_lrepad(string name, decimaldata type length)
decimal_lpad(string name, decimaldata type length)

123. What is the difference between a db config and a cfg file?


6 answers:-.dbc file contains all the connectivity info which the user needs to connect
to the database . dbc means database configuration file. It will contain the server
name userid password type of db stuffs l...
(OR)
.dbc file basically helps abinitio coops to connect to database. It includes all the
required database parameter and other default value respective to it. you can
modify them.However .cfg is file is u...
124. Force_error function?
5 answers:-How does force_error function work ? If we set never abort in reformat ,
will force_error stop the graph or will it continue to process the next set of records ?
(OR)
It will not stop the execution of graph,it will continue with the next records.
It will used especially to send to send the data to an error port when it does not
meet the specified condition with error message given in this function. To abort
the graph use force_abort function.
(OR)
force_error() fuction will not stop the graph, it will write the error message to the
error port for that record and will process the next record.
-4-
125. Explain the differences between API and utility mode?
6 answers:-API mode is for more diagnostic purpose for retriving the data from db
means like selecting perticular fields, like retriving the data relatively according to the
constraints. Whenever we select...
(OR)
In API mode the data processing(load/update/insert/delete) is slow, however other
process can access the database tables during the update.Compared to above,
Utility mode processing(load/update/insert...
126. Compilation errors?
2 answers:-What are the compilation errors you came across while executing your
graphs?
(OR)
If fan-in is used to the input flow of a Roll-up component, you will get a complilation
error, though sort rquired is set false to the Roll-up.
Correct me if i'm wrong.
(OR)
Scenario 1: When you built any graphs with Abinitio component, but the output
DML is not same with the next component input port DML, then you will get the
compilation errors. Scenario 2: Suppose...
127. Explain what is lookup?
2 answers:-Lookup File consists of data records which can be held in main memory.
Lookup is basically a specific dataset which is keyed. This makes the transform function
to retrieve the records much faster th...
(OR)
Lookup is basically a specific dataset which is keyed. This can be used to
mapping values as per the data present in a particular file (serial/multi file). The
dataset can be static as well dynamic ( ...
128. Range lookup?
1 answerL What is range lookup)?
Its returns the first data record When you defined the range indicated by the lower
and upper bound arguments.
129. What is the difference between a scan component and a rollup
component?
4 answers:-
Finalize function of SCAN Component is called for each record for a group
where as for rollup component it called only once.
(OR)
roll up generates data records that summarizes group of data records the rollup
can be used to perform the data aggregatuin like sum,avg,max,min etcScan
component generates a series of cumulative summary records such as successive
yera to date totals for group of data records
130. What is fan in and fan out in ai (Whats the difference and explain about
usage)?
3 answers:-
Fan in flow is mainly used for sending data from partions to single file
(departioning) and its vice versa fan out flow for partitioning.
(OR)
fan in : fan in flow connects a component with a greater depth of parallelism to
one with lesser depth thati s it folls a many to one pattern. example departition
component.

fan out : fan out is opposite of fan out example component is partition component.
131. Changing to mfs(How do change serial file into mfs? Explain
)?
4 answers:-Go to run command or press F8You will get the one text box in that type
m_mkfs [path0] [path1]..This is for cration of multifile system after that create one file
in that path by using m_touch [path0]...
(OR)
1. Make sure you have multifile system in your ab-initio environment
2. Then use partition component to distribute your records to MF
132. Cummulative summary in ab initio?
5 answers:-
How can I achieve cummulative sumary in ab initio other than using scan
component. Is there any inbuilt function available for that?
(OR)
We can use rollup component.
(OR)
Other then scan, we can use rollup to do the cumulative summary.
133. Abinitio graphs(How will you test the abinitio graphs)?
Answer Question
Ab_constructs
(OR)
134. What are ab_constructs? What is its use?
Answer Question?
134. Behaviour of component in differnt version?
2 answers:-How will you ensure that the components created in one version do not
malfunction/cease functioning in other version?
(OR)
Components should be compatibile to run in previous versions of GDE. The
depreciated components would run in new versions.
(OR)
Runtime behaviour of components will remain same in all versions unless its
requires to have any additional paramter to be defined in any version. Evolution
of new version of ETL comes with some changes in component level parameters
(observation as of now).
135. Abinitio lookup?
4 answers:-Can we process 1 gb data(1 million records) by using lookup? How
(OR)

Processing 1GB data with a small lookup file is good ... but to make the 1GB data
as a lookup file is mistake.Use mfs lookups instead if its at most needed.
(OR)
I will suggest you to use Join instead of Lookup for such voluminous data
136. Whats the best way in a ai graph to read the header 1st line and trailer
(last line) in a i/p file, compare record counts and checksum (available in
header) and error out if actual recd count or checksum...
6 answers:-anujaja, use only one dedup sort. in dedup sort use the select statement
"next_in_sequence > 1" and use keyless Dedup "{}". With keep paramete "last". this
will remove trailer, you will get all the body part from the "dup" port

(OR)
You can also use a keyless Dedup with keep parameter as first.
This will put header in output port and rest of the record on Dup port . Then connect
dup port again with another dedup and use Keep parameter as last this will remove
trailer as well.
137. Where we keep the control file (controle partition)?
2 answers:-Ab Initio multifiles present in parallel directories called multidirectories,
which are present into multifile systems
The multidirectories contain the control file (controle partition) and data directory.
(OR)
control should present in the multifile directory, which contains the address of the serial
files
138. Parametrization of mfs?
3 answers:-I want to use an ab initio parameter to control whether I am in serial or mfs
mode. Basically, I will have an $ai_mfs_parts variable. If it is set to 1, then we are
running serial. If it is set to 4, then we are running 4-way mfs. Will the data need to be
located in physically different locations...
(OR)
I think we can have two parameters like AI_SERIAL and AI_MFS to determine
where we are in serial location or MFS location. This is best way of finding out
whether the file is serial or MFS. Hope this help.
(OR)
Its always better to use multiple file system instead of adhoc
multifiles.So,physical allocation is a good idea for separation of mfs and
serial.Say,i have a physical allocation to path1 for mfs and h...
(OR)
139.What is the function of xfr in abinitio?
3 answers
It would be great if one of you can explain me in brief what is the function of xfr
(like what does it do ,where is it stored ,how does it affect ) as I am very confused
on thisthank you
(OR)
XFR is used for externalizing a transform/function. A function which needs to be
reused at multiple places can be created as a common XFR instead of writing the
same code again and again.
(OR)
The XFR usually stores the mappings in the text view. Usually these mappings
are generalised. In the Transformation of any component these XFR's can be
called and abi nitio will automatically do the s...
140. Data modelling?
What data modelling do you follow while loading of data to tables? Also the db
you are inserting the data has star schema or snow flake schema?
Answer Question:-?
141. What is wrapper script in ab initio?
3 answers:-When you deploy an Ab Initio graph it becomes a ksh script saved in Run
directory.
Wrapper is another Unix script which calls the above mentioned deployed script.
It might also be use to handle other functionalities together with execution of graph.
(OR)
Wrapper script is an Unix script, which is helpful in running the graphs directly
through Unix and running it automatically. This is useful when we have Unix as
the operating system.
142. What command would you use to inspect the users currently logged into
SQL server database?
3 answers:-The below SQL Query can solve the purpose
select username from V$SESSION;
(OR)
sp_who and sp_who2
143.Abinitio mfs?
4 answers:-There is one mf while loading data I deleted one partition in a mf? Can it
show error/it will process data?
(OR)
yes the above point is also valid , if file does not exist , the ab initio will throw an
error. If it is truncated ,you will loose the data but abinitio will not throw an error.
(OR)
Did you delete data in partition or deleted partition itself?
If data is deleted (or truncated) in a partition, ab-initio does not throw the error.
If a partition itself is deleted, ab-initio is going to throw the error.
144.Describe which system/process elements you would review when trouble-
shooting general server slowness.?
1 answer:-
1) Space
2) Server Utilisation
3) Any script running taking lots of time
145.Join on partitioned flow?
7 answers:-If I have 2 files containing field file1(a,b,c) and file2(a,b,d), if we partition
both the files on key a using partition by key and pass the output to join component, if
the join key is (a,b) will it join or not and why?
(OR)
The partition key and join key do NOT have to be the exact same. In order to join
properly, you just have to make sure the records being compared are in the same
partition. So if the parti...
(OR)
I do not think the join output would be correct. The partition key fields for the two
input streams should be same as the join key fields in the join component,
otherwise the data from stream 1 would ...
146. What is the name of the SQL language used in SQL server stored
procedures?
4 answers:-The name of the SQL language is TSQL
(OR)
SQL stands for Structured Query Language.
147. What is the lookup function used to retrieve the particular duplictae
datarecords in the lookup file?
3 answers:-Use lookup_count for finding the duplicates and lookup_next for retrieving
it.
If lookup_count (string file_label, [ expression [ , expression ... ] ] )>0
lookup_next ( lookup_identifier_type lookup_id, string lookup_template )
Njoi!!
Abhi - fresh as dew!
(OR)
lookup_next function is used for retrieving duplicates records of that particular
record
148. What is the difference between the flows of 3 parallelisms?
4 answers:-I think i misunderstood the question ,there are 3 type of parallelisms in Ab
Initio , details of 3 types are expalined correctly above with difference.
(OR)
ab initio supports parallilesm..parallilesm 3 types 1.component parallilasem:
graph running the data sumaltenously on seperate data uses. every component
graph creates one component process ..this is ...
149. In a process which reads 15,000 rows from one table and EIther updates
or inserts them into another table and given the scenario where the process
may abnormally terminate, describe how you track progress...?
1 answer:-The following are some possible strategies for dealing with this problem: 1)
If a job does a bulk load into a table and you know the table was empty before the job
began, you can truncate the table be...
150. What are cartesian joins?
Asked By: Interview Candidate | Asked On: Aug 26th, 2004
5 answers:-When there are duplicate keys in either or all of the joined datasets the
result is a cartesion product. This is termed as Cartesian Join in Abinitio.Eg:If there are
two input datasets for a join func...
(OR)
join two tables without join key , that is crossproduct.
151. Phase verses checkpoint?
6 answers:-Difference between a phase and checkpoint as far as I know it something
related with how temporary files containing the data landed to disk is handled.That is
phases are used to break up a graph so that it does not use up all the memory , it reduce
the no of component running in parallel hence improves...
(OR)
Phase is breaking the graph into different block. It create some temp file while
running and deletes it once the completion is done.Checkpoint is used for
recovery purpose. when the graph is interrupt...
(OR)
Above GDE version 1.8 checkpoint is automatic. Phasing is when you design the
graph it should not have more than 20 components. In that particular phase the
component in that phase will alone run.Chec...
152.How to handle if dml changes dynamically in abinitio?
4 answers:-Give the parameters using edit parameter and make it as formal parameter
that is run time parameter.
Then in the DML eg. $AI_DML/abc_$parameter1.dml
According to the value of the parameter that particular DML will be called.
Answered by: Guru on: Sep 4th, 2007
If DML changes dynamically in source we can handle it using "Conditional DML'
But how it can be achieved in "parameterisation" and using "metadata". Please
explain?
153.How to run the graph without gde?
6 answers:-
From UNIX you can go to pset folder in your sandbox then type
air sandbox run abc.pset
Whatever may be your psets name or just call graph script .ksh
(OR)
Hi,it is possible in Unix... first you have to go to the directory where your
deployed script is saved for that particular graph then execute the shell script by
using the following command.../.kshI hope it will help you ...regards,Mahendra
154. How to improve performance of graphs in ab initio? Give some
examples or tips.Thanks?,
5 answers:-You can improve the performance of the graph by using parallelism
concept in your graph
(OR)
The performance can be improved in several ways, I put some of them what I
remembered...1. Use sort after partition component instead of before.2. Partition
the data as early as possible and departiti...
155. One of the performance tuning techiques is join with database kept to a
minimum. Why?
2 answers:-
Join with Database (Join with DB) should not be used if number of records to
be joined with a table is very high as it degrades performance. Better result
can be achieved by retrieving the data from...
(OR)
Performing join at database will improve the performance a lot.take a scenario of
joining between the two tables. Usally we take two database components and
place a join component to perform a join be...
156.How might you quantitatively measure an improvement made to a
query?
1 answer:-By checking the execution plan of the query after doing each modification
and determining the cost of the query.
157.What is the purpose of having stored procedures in a database?
4 answers:-1. It is used to increases the performance of the application. Because when
we create Store procedure, they are compiled and stored in database. Later when
applications call them, they are general...
(OR)
Stored procedure is a pre-compiled execution block,so they do not want to wait as
the time for compilation.It is call from the server.It reduce the execution time
158.Abinitio merge graphs?
4 answers
How do we merge graphs in abinitio?
(OR)
Get on using .plan in Abinitio in which we can encapsulate different graphs.
fresh as dew!!
(OR)
Any graph in ab initio is a separate and independent entity in itself. First of all,
there is no such concept of graph merging in ab initio, since it does not make any
sense, and two graphs that are s...
159.Indexes in abinitio?
Asked By: shylender | Asked On: Feb 10th, 2008
3 answers:-In abinitio which type of indexes used?
(OR)
There is as such no concept of indexing in Ab Initio, since it is a code based ETL
tool. Indexes come into picture on the database table side. But there is something
called "output indexing" in Ab Ini...
(OR)
I think we don't have any concept of indexes in ab inition , it is there only in
Database like table indexing to improve query performance of the tables.
In Ab initio it is only massive parallel processing concept.
160.Ab initio interview questions compiled. Someone please answer them. It
will help all of us here.1) what is the difference between dbc and cfg? When
do you use these two?2) what are the compilation errors...?
1 answer:-I am sending some answers which I know. Hope this is helpful to you ..1 a)
cfg is the databse table configuration file ,dbc is the data base configuration file. 2.1
databsae component have the ...
161. Output for sort and dedup sort with null key?
2 answers:-
I have file containing 5 unique rows and I am passing them through sort
component using null key and and passing output of sort to dedup sort. What will
happen, what will be the output.
(OR)
If there is no key used in the sort component, while using the dedup sort the
output depends on the keep parameter.If its set to firt then the output would have
only the first record,if its set to las...
(OR)
I think you need to you need to specify the option whether you need to keep the
first or last or unique records , as far i know that there won't be any change in
records , it will exact the same a...
162.What are the different schedulers used for running the abinitio graphs?
3 answers:-
You can do anything in Ab Initio, depends on the logic you useSay there are 10
jobs 1,2,3,4,5 .. 10 and all the graphs are deployed as script.Suppose 1-4 graps
should run at the same time and sequenti...
(OR)
In ab initio there are no specific scheduling options as in informatica. What we
can do is use cronjob options as in unix
163.Under what circumstances join should be used instead on lookup.?
5 answers:-If you have to perform full outer join then you should go for join not for
lookup.
(OR)
Lookup should be used only to speedup the graph execution but here there is
catch if the lookup file is small then only we should used lookup because when
the graph start executing it will load whole ...
-6-
164.What is main path and structure of abinitio?
2 answers:-
Main paths in Ab initio environment are AB_HOME , specifiec the directory
where the co>operating system is installed. This need only be set for those hosts
for which the installation directory is diff...
(OR)

I think the main thing in above is AB_HOME , AB_AIR_ROOT , then sandboxes


, standard environment. you need to be specific as what you want in this question.
165. What is eme?
3 answers:-As explained by Abishek, EME is the data repository(Datastore's data
files)Â as well a version controller compoent of ab initio software. It keeps track of
obhects such as graph, dml, xfr, ksh and sq...
(OR)
EME stands for Enterprise Meta Environment. It is nothing but like a repository
in Informatica. It contains all the metadata i.e data related to graphs, dml, xfr, etc.
It performs the following func...
166.Memory requirement of graph?
2 answers:-How can I calculate the total memory requirement of a graph?
(OR)
When we did some benchmarking we found that it was closer to ~8MB +
MAX_CORE (if any) per component and add in the file sizes.Basically, I agree
with the existing answer.Just remember, don't use &...
(OR)
You can roughly calculate memory requirement as:1. Each partition of a
component uses: ~ 7 MB + max-core (if any) 2. Add size of lookup files used
in phase (if multiple compon...
167.Have you used rollup component? Describe how.?
3 answers:-
Rollup summarises the group of data records based on key eg you want the
income of a particular city from a set of records , you can use the rollup
component.example dml ...
(OR)
rollup summarize the group of data records.for eg: if we want the maximum value
of the cust_id we will go for rollup.
168.Abinitio password?
4 answers:-
In production also do they will give same password which was given in .Dbc?
(OR)
Best way is using the environment variable to set the values in dbc file. By
this you can have the flexibility to change any variable value's by changing the
evironment variable.
(OR)
Good way doing this is, parameterize .dbc file. Parameter will have different
values for different environment.
169.How can we test the abintio manually and automation?
Asked By: satish kkumar | Asked On: Jun 24th, 2006
2 answers
Answered by: lakshmipavany.g on: Jun 25th, 2008
Yes thats correct. There is no automation testing available for Abinitio.
Manually U need to test the abinitio graphs using Validate category of components.
(OR)
Hi sathish, i hope abinitio testing is to be carried manually only no automation is
available as of now.we nee...
170. What is the difference between sandbox and eme, can we perform
checkin and checkout through sandbox/ can anybody explain checkin and
checkout?Thanks advsatish kumar?
3 answers:-Hi EME is like All-In one guide, where as Sand box is an idividual subjected
guide restricted for individual user.Check in/ check out are the procedure to
send/extract graphs from sandbox to EME....
(OR)
The project is under source control under EME. You can't do any manipulations
on the project which is in EME. So if you want to do any manipulations you have
to check out the project into your sandbo...
171.What are the most commonly used components in a abinition graph?Can
anybody give me a practical example of a trasformation of data, say
customer data in a credit card company into meaningful output based...?
3 answers
1. reformat
2. join
3. filter by expression
4. sort
5. rollup
6. partation components.
(OR)
Hi gayathri, We cannot say strightly that only certain componets only we have to
use,it depends on business requirement,functionality. most frequently we use
FBE,REFORMAT,ROLLUP,LOOKUP(LOOKUP_LO...
172. What is data skew? How can you eliminate data skew while I am using
partitiion by key?
3 answers:-
The skew of a data or flow partition is the amount by which its size deviates from
the average partition size, expressed as a percentage of the largest partition:

Skew of data =(partition size - avg.partition size)*100/(size of largest partition)


(OR)
data not evenly partitioned. PBE uses hash function. choose the key that has the
most distinct values might improve the data being distributed evenly.
173. What is the dbcc command used for?
2 answers:-DBCC is stand for database consistency checker.
(OR)
These statements check the physical and logical consistency of a database. Many
DBCC statements can fix detected problems.
174.Explain real time environment of ab initio?
2 answers:-I hope contineous flow components support upto certain extent
(OR)
I doubt whether Ab initio is used in real time processing , main use of ab initio is
massive parallel processing of huge chunk of data. you should be more specific
about your question.
174.Wat is the difference between rollup and scan?
Asked By: prakash | Asked On: Aug 9th, 2006
5 answers:-By using rollup we can generate aggregate values like sum ,count…..etchere
no of input records is not equal to output records. where as by using scan we can
generate cumulativesummaries of...
(OR)
guys,Rollup: 1)generally rollup is used for aggregation purposes such as
sum,avg,count,min,max etc.2)It can process grouped or ungrouped records.while
processing ungrouped records it maximises th...
175.What is the difference between dml expression and xfr expression ?
4 answers:-
DML are record format files used by the metadata files in Ab inition. DML can be
embedded in the graph or can be a separate file with .dml extension.
xfr file are transform functions used by graph.These are present in sandbox.
xfr file can used to manipulate data nad compute new data from it.
(OR)
DML describes the record format of the data. It is metadata.
XFR is the set of business rules called a transform function.
176What is a ramp limit?
6 answers:-limit parameter contain an interger that represents a number of reject
events
the ramp parameter contains a real number that represents a rate of reject events in
the number of records processed.
(OR)
we will go for ramp limit to set the reject threshold for the record being processed.
177. What is lock in abinitio?
2 answers:-Lock mechanism is used by EME datastore to avoid confusion in the version
control of a graph basically locking give a exclusive permission to modify and save it for
a particular users and prevents other from making changes to the same file.
(OR)
In order to edit a graph (which is present in the EME) or its related objects like
dmls, xfrs, etc we need to lock the graph so that the developer has absolute
permission to edit the graph or its...
178. Explain about multi update in abinitio and specific business reason for
it's use.?
1 answer:-multi update executes SQL statmts against incoing records to modify one
or more tables in a database.It treatss each input record as a completely separate piece
of work.
179. Give examples for pipeline, component and data parallesim?
1 answer:-
pipeline Parallelism occurs when the several connected progrm components on
the same branch of graph executes simultanteously. if you are using a sort
component the pipeline parallelism does not occur...
180. What is multistage component?
2 answers:-
Multistage component are nothing but the transform component where the records
are transformed into five stages like input selection , temporary records
initialisation , processing , finalization and ...
(OR)
Multistage component is nothing but transform component which are having
packages. In Abinitio we are having three multistagecomponents like scan,
Normalize and Denormalize.
181.Explain about deadlock in abinitio?
2 answers:-as explained in above answer that when two or more processes are
requesting for one singel resource , that time deadlock can occur.in Ab initio deadlock is
avoided by using phasing and also by using the resource pool.
(OR)
When 2 or more processors are requested for the one single processor, that time
deadlock situvation is occured.
182.Standard environment?
2 answers:-What is standarad environment? What it contains?
Answered by: sixto.dsilva on: Apr 17th, 2008
standard environment is the ab initio environment that has more then one private
project and also other public project , in addition to the required environment
project.Private project is one which no...
(OR)
std env is nothing but collection of public & private & environment project.-7-
183.Abinitio partitions?
2 answers:-What is the difference between partition, re-partition and departition?
(OR)
partition is coverting a serial file(mfs) into multiple flows --> component used is
partition by key or partition by round robin.
departition is opposite of partition , sometime after the parallel processing , you
want output to be serialized we using gather and merge component for this.
(OR)
Dividing a single flow of records(data) into multiple flows is known as
partitioning.
Dividing a x-way flows of records(data) into y-way flows is known as re-
partitioning.
e.g: 2 ways flow ino 4 ways flow
Combining multiple flows into a single flow of records(data) is known as
departitioning.
184.Abinitio bad-straight-flow(When did we get error like 'bad-straight-flow
)?
2 answers:-It is error you get when there is mismatch in flow that is one is serial flow
and other componet is mfs flow , the ab initio will complain about the bad flow and
graph is will be aborted because of this. you should be using the partition component in
order to avoid this.
(OR)
Layouts problems in components. e.g. a straight flow connecting a component
which is having serial layout with a component which is in muti-file layout.
185.How dynamic dml is created and in which context it is used?
1 answer:-dynamic DML is used if the input metadata can change i.e. suppose at
different time different input files are recieved for processing which have different
dml.in that case we can use flag in the dml a...
186.Abinitio air-project parameters?
1 answer:-
What are air-project parameters and air-sandbox? What does it contains?
(OR)
as we know a sandbox is filesystem which contains all the graphs and other file
that graphs depend on. when sandbox contents are put under sourc control in
EME datastore it becomes project. Now sanbox...
187. How can you run a graph infinitely?
3 answers:To run a graph infinitely, the end script in the graph should call the .Ksh file
of the graph. Thus if the name of the graph is abc.Mp then in the end script of the graph
there should be a call to abc.Ksh. Like this the graph will run infinitely.
(OR)
You can use "Continious flow" components for this
Answered by: Sai on: Jun 29th, 2006
I wonder why u require this Infinite loop concept if so how to get out if a graph
runs recursively..
188. Describe the “grant/revoke” DDL facility and how it is implemented.?
2 answers:-
GRANT command is used to give permission to users to do or create a specific
object in the database.
On the other side, REVOKE is used to pull back the permission.

Internally these 2 commands make entries into data dictionary tables.


(OR)
Basically,This is a part of D.B.A responsibilities GRANT means permissions for
example GRANT CREATE TABLE ,CREATE VIEW AND MANY MORE
. ...
(OR)
189.Describe the effect of the "checkpoint" T-SQL statement.?
2 answers:-As per database point of view, checkpoint is used to flush the data from DB
buffer (memory) to the physical disc.
(OR)
As per my knowledge in abinitio checkpoints are the intermediate stopping points.
These will save information about the recent processed data in the form of *rec
files. so whenever there occurs a ...
190.How to do abinitio certification?What are the prerequisites for doing
abinitio certification?
2 answers:-No it is not true they indeed offer certification and the cost is 1200$ .
(OR)
Abinitio Corporation is not offering any Certification like other i.e
Informatica......After verfication with the Abinitio Corporation people i found that
they are not offering this. Its disappoianting but true.
191.Can anyone please explain the difference between .Dbc file and .Cfg file
with example?
3 answers:-.dbc file contains all the connectivity info which the user needs to connect
to the database. dbc means database configuration file. It will contain the server name,
userid, password, type of db, stuf...
(OR)
.cfg file is for remote connection and .dbc file is for database connection.
192.How is referential integrity enforced in a SQL server database?
1 answer:-while creating table you can implement tat by adding
foreign key references p.tab_name(p.Col_name)
heree p refers primary table andd this statement to be written beside col you want
make as refrential integrity for the table.....................
193.What is a cursor? Within a cursor, how would you update fields on the
row just fetched?
3 answers:-cursor are database objects used by theaplication to manipulate the data in
a set row by row basis
(OR)
ab intitio doesnt uses cursors. it can fetch any number of records simultaneously.
194. What is the difference between API and utility modes in input table ?
Asked By: Bobby | Asked On: Mar 26th, 2007
2 answers:-API and utility are two possible interfaces to databases from Abinitio.
we might use API mode when you want to make use of a function that is not available
throgh a utility.
(OR)
These are the interfaces to the database.api is more advantageous than utility.api
use the sql queries where as the utility uses the options provided by the vendor
195. Hello all ab initio developers,i have recently graduated and I have just
joined the data warehousing profession, I got some basic training in ab intio
for few months now and have a few questions, can...?
1 answer:- 1) In abinitio component,pipeline parallesim supports default.so the user
cant change it. so for data parallesim supports only when ur graph has muitifiles wit...
196.What is the difference between clustered and non-clustered indices?
...And why do you use a clustered index?
1 answer:-Clustered Index creates physical order on the table based on the defination
given. You can only have one clustered index.
197. How many parallelisms are in abinitio? Please give a definition of each.?
4 answers:-Three kind of parallelisms1.Component Level Parallelism --> different
components acting on different data at same time.2.Data Parallelism --> A graph that
deals with data divided into segments a...
(OR)
The 3 kinds of parallelism are:Component parallelism: occurs when program
components executes simuteneously on diff branches of the graph.Pipeline
parallelism: occurs when connected sequence of progra...
198.How can you force the optimizer to use a particular index?
3 answers:-Explicitly chooses an index scan for the specified table. You can use the
INDEX hint for domain, B*-tree, and bitmap indexes.CREATE INDEX
ix_customers_genderON customers(gender);set autotrace traceonl...
an eg for hints:
199.Direct use of indexes.?
select /*+ index(a index_name) full(b) */ *from table1 a, table2 bwhere b.col1 =
a.col1 and b.col2= 'sid'and b.col3 = 1;
200. How would you do performance tuning for already built graph ? Can
you let me know some examples?
4 answers:-There are many ways to improve the performance of the graph. It also
depends on a particular graph, the components used in it. In general the following tips
can be used for improving perform...
hi sunny,
wat u said is right.
201. Can shADOws stored procedures be called from abinitio components? If
yes then how can it be done and how it is different from normal stored
procedures?
2 answers:-It is possible to write a stored procedure in abinitio. Just write the stored
procedureand save it but do not forget to use the stored procedure with recompile
option because it will improve the performance.
(OR)
yes it is possible to call a stored procedure in Ab Initio. to call a stored procedure
first of all you have to write a stored procedure and store it , then use one input
component to call a stored procedure .
202. How do you maintain historical data in data warehouse?
2 answers:-Normally to maintain short term historical data llike 3-6 months etc. there
is no need to go to DWH. For maintaining long ternm historical data i.e., 3-6 years etc.,
You need to go to DWH. In DWH we w...
(OR)
We can have active and inactive records in data warehouse to maintain history.
203. How to use lookup file in the graph?
2 answers:-Lookups are always used with combination of the reformat components. In
business rules you write the lookup function and use look up. You can find lookup
button in the description tab of the editor.
(OR)
If you have a already saved lookup file in some specific path, just call that file in
the graph.You can do that by using a 'lookup file' component in the graph, whose
URL will have the path of the loo...
-8-
204. How to schedule graphs in abinitio, like workflow schedule in
Informatica? And where we must is UNIX shell scripting in abinitio?
3 answers:-Hi Praveen,
In Ab Initio there is no In Built Schedular in the GDE...Most of them are using cron tab or
specific tools for scheduling
1)AutoSys or
2)Maestro.
(OR)
you dont have schedule in abinitio unlike informatica here in this tool we need to
buy the third party tool namely autosys for scheduling
205. Is it possible to do cics transaction through abinitio graphs?
1 answer:-No its not possiable to transaction of CICS in AbInitio Graph, because
Mainframe has limited data gateway connectivity via AbInitio it just uses the layout. You
have to get the stuff from Mainframe to Abinitio and carry on your transaction.
206. What are the commonly used or important functions used in the real time in
abintio?
3 answers:-Answered by: vijayk07 on: Mar 13th, 2007
string_ltrim,
string_lrtrim,
string_substring,
reinterpret_as,
(OR)
Hi Hema,I am not able to understand what is your requirement... see there are lot
of functions available in the Ab Initio so we can not decide which function will be
commonly used in Ab initio.. regards,mahendra
207. What is a design document in abinitio?
2 answers:-There is a design document for Ab Initio projects just like there is for any
project following a software development lifecycle framework. While in some flavors,
there is a High level technical design...
(OR)
I am able to understand ur question. there is no sch kind of document in abinitio.
The design document is project related.
208.What is driving port?
2 answers:-The largest input is nothing but a driving port
(OR)
In a join, it is sometimes advantageous to have the Sorted-Input parameter set to
"Input need not be sorted". This helps, when we are sure that one of the input
ports has far less records than the oth...
209. What are the benifits of abinitio over Informatica?
2 answers:-Hi Pradeepa,I never worked on Informatica... but as per my knowledge
concern the main difference between these two is In Ab initio you can do data
parallelism, which ultimately helps to save ur time t...
(OR)
1.abinitio holds parallelism
2.mulitifile system
3.amount of data migration is more in abinitio.
4.easy to run & bulid.
210. What is meant by vector field?Explain with an example?
Asked By: prakash | Asked On: Aug 9th, 2006
2 answers:-Hi Vector is nothing but Array's in C. for example Transactions[0]
Transactions[1] Transactions[2]Customer ID n dt Amount dt ...
(OR)
A vector is a sequence of the same type of elements. The element type may be
any type including a vector or record type. It is a field which tell us how many
times a particular field is rep...
211. What is meant by fencing in abinitio?
Asked By: raham | Asked On: Dec 9th, 2005
6 answers:-In SW world fencing means job controlling on priority basis. In AI it actually
refers to customized phase breaking. A well fenced graph means no matter what is
source data volume process will not cough in dead locks. It actually limits the number of
simultaneous processes.
(OR)
I think he is refering to "Phasing" only.
Phasing:- Its a way to manage the resource and avoid "deadlocks" in the Abi run.
it limits the number of simultaneous processes by breaking up the graph into
different phases, only one of which is running at any given time.
212. How can we do check in into eme ,pls give step by step process?
4 answers:-Hi,EME contains only the latest version graphs of your project. It is a area
where single or multiple projects will stored. for checkin or check out the graph into the
EME you have to look on to the...
(OR)
sandbox --> is the personal work area and u will find only latest vesrons of the
graphs.
eme is like vss ! there u will find old and new versions of the graphs.
when u check in / out the graph that will come into your sandbox area.
hope this will help you..
213. How can we achieve scrubbing or data validation thru ab initio.I know
that by using is_valid function in validate record component we can check
the values of the fields to be consistent with the data type...?
2 answers:-Hi,you can achive this by using one component called record validation... it
will use only if u want to validate the input record... if you want to validate field lavel
values then there are some func...
(OR)
What exactly is your requirement? Can you specify that, as there are various other
functions available apart from the more common ones like is_valid(as you had
mentioned), is_null, is_blank and is_defined. Besides which, there are comparison
operators available like any other programming language.
214.What is an outer join?
3 answers:-Just to complete the remaining considerations.
The outer join, just like the explicit semi joins, will assign null to the non existing
fields that should be assigned by the non existing flows. Therefore lookout for
non-nullable fields.
If you want to see all the records of one input file independent of whether there is
a matching record in the other file or not. then its an outer join.
215. Do you know what a local lookup is?
4 answers:Hi,lookup_local funtion retrieves the first matching record from a
partitioned multifile,which is partitioned on a particular key.consider a lookup file
"emp_detail" which is having the fields emp_id,...
(OR)
Lookup File consists of data records which can be held in main memory. This
makes the transform function to retrieve the records much faster than retirving
from disk. It allows the transform component to process the data records of
multiple files fastly.
VPN Reddy
216.What is .Abinitiorc and what it contain?
2 answers:-.abinitiorc file contains all configuration variables such as
AB_WORK_DIR,AB_DATA_DIR etc
this file can be find in "$AB_HOME/Config".
cheers,
mukund
(OR)
.abinitiorc is the config file for ab initio. It is found in user's home directory.
Generally it is used to contain abinitio home path, different log in information like
id encrypted password log...
217.For data parallelism, we can use partition components. For component
parallelism, we can use replicate component. Like this which component(s)
can we use for pipeline parallelism?
2 answers:-guys,this was a very good questionbefore learning abinitio or building any
graph u need to know the concepts of parallisms they are 1)data parallism 2)pipeline
parallism 3)component parallismgenerally...
(OR)
When connected sequence of components of the same branch of graph execute
concurrently is called pipeline parallelism. Componets like reformat where we
distribute input flow to multiple o/p flow using...
218. One of the advantages of abinitio over other etl tools is better error
handling..How can you say it provides better error handling.?
3 answers:-Hi visitor, i've very less information about datastage,but let me
differentiate abinitio with informatica,which are two major ETL tools in present market
today Both tools are fundamentally ...
(OR)

What are the advantages of Ab Initio over Data stage Px? You seem very
knowlegeble.
Thankx.
219. Can anyone please explain how to error trap your work?
2 answers:-yeah,error trapping is pretty essential in some cases depending on the
business requirement.for components like reformat,join etc we can trap records which
are not satisfying the respective rules at r...
(OR)
error trapping each component has reject, error, log ports, reject captures rejected
records, error captures corresponding error and log captures the execution
statistics of the component. You can co...
220.How to use max core number of rows?
Asked By: Godhuly | Asked On: Oct 2nd, 2006
3 answers:-Hi,max-core is very essential parameter in componets like sort,rollup,scan
etcmax-core gives the amount of momory which a component uses to process data
before spilling to disk ie we need to have an o...
(OR)
Most of the time you will not touch the max core value ! even though some one
ask in interview strght way say most of the time we use default value.
221. How to use override key in join !?
2 answers:-Hi folks, generally override key concept comes when we have two non-
matching fields ,where we want to have a join on it let me throw insight into
this suppose i have 2 files say FILE1,F...
(OR)
When we need to join 2 fields which has different field names.Then we can use
overridekey0 to specify what is the field name in in0 and field name from in1 in
overridekey1.Example:in0 DMLrecordstring(...
223. What is mean by co > operating system andwhy it is special for ab-initio
?
6 answers:-yeah, abinitio is a combination if GDE(graphical development
environment),co^operating system,and EME(enterprise metadata environment)each
have its own importance and are inter-related.generally...
(OR)
abintio is like blub ! co-op is like current(transformer) ..bulb will glow only when
current is passsed ..
224How do we handle if dml changing dynamicaly
3 answers:-Hey folks,
If there is any instance of having dynamically changing dml's,then we have to
pass both dml's as well as xfr's as graph parameters.
(OR)
I Think we can use MULTIREFORMAT component to handle dynamically
changing DML's-9-
225. Explain how you have used input parameters?
2 answers:-Hi,
we can give parameters,or any validations which is to be done in the start script
of the graph
o
r
pass all the necessary parameters as graph parameters or sandbox
parameters.these parameters are referenced by a "$" sign.
eg:${file_id}=AAA .
yuppy,
Mukund
(OR)
creation of input parameters: Go to the properties of the graph-->select
the parameters tab--&g...
226. How do you convert 4-way mfs to 8-way mfs?
3 answers:-We can do it by using PARTITON BY KEY AND SORT COMPONENT.
Sree
(OR)
hello,To convert 4 way to 8 way partition we need to change the layout in the
partioning component. There will be seperate parameters for each and every type
of partioning eg. AI_MFS_HOME, AI_MFS_MEDI...
227.What are primary keys and foreign keys?
Asked By: Interview Candidate | Asked On: Aug 26th, 2004
2 answers:-Foreignkey is a primarykey in other table which has parent & child
relationship betwee two tables. foreignkey resides in child table which reference the
parent table.
(OR)
In RDBMS the relationship between the two tables is represented as Primary key
and foreign key relationship.Wheras the primary key table is the parent table and
foreignkey table is the child table.The criteria for both the tables is there should
be a matching column.
228. What is writing of wrapper can any explain elaborately?
Asked By: jai | Asked On: Sep 20th, 2006
1 answer:-Writting a wrapper script helps u 2 to run the graph in sequence as u
want.Example:when u need to run 3 graphs but the condition is after the first graph ran
successfully u need to take the feed...
229. What is conditional dml? Can anyone please explain with example.?
2 answers:- Conditional dml ,is a DML which is separated based on a
condition.eg:Suppose ur DML have Header ,detail and Trailer parts,u need to separate
these coz we need to have detailed part .record if (r...
(OR)
The DML that is used as a condition is known as conditional DML..Suppose we
have data that includes the Header,Main data and Trailer as given below:10 This
data contains employee info.20 emp_id,emp_na...
(OR)
230.How to take the input data from an excel sheet?
5 answers:-Hi GUYS, COMPONENTS>TRANSALATE>READ SAPARATED VALUES
component ... U CAN USE THIS EXCEL FOR EXCEL SHEET...................THANKSRAFI
CHENNAI
(OR)
make it csv formatted , deliminated file and read it thru input table comp.
231. I am going to get inhouse training on ab initio can anyone state the
responsibilities of etldeveloper?
1 answer:-
1- unix + shell
2- sql
3- llrsim
4- little data base concepts (DBA)
232. Describe a typical backup policy that you would employ.?
1 answer:- WE can use backup the EME with air-repostory backup command from the
command line mode.
233. What is the difference between partitioning with key and round robin?
4 answers:-for partition by key it internally generates the hash code. through that i
distributes the data records.
round robin what they said was correct
(OR)
Partition by Key - Distribute according to the key value Partition by Round Robin
- Distribute a predefined number of records to one flow and then the same
numbers of records to the next flow and so o...
234. How do you truncate a table? (each candidate would say only 1 of the
several ways to do this.)?
4 answers:- You can truncate the table using one of the following components: Run
SQL, Truncate Table, Update Table.
(OR)
There are many ways to do it.
1. Probably the easiest way is to use Truncate Table
2. Run Sql or update table can be used to do the same thing
3. Run Program
235. What r the graph parameter?
4 answers:-Hello,The Parameters that are private to a graph are called Graph
Parameters.They can't be used for all the graphs present in the Sandbox ie. they are
limited to the graph for which they have bee...
(OR)
where we use local parameter and formal
236. Difference between conventional loading and direct loading ? When it is
used in real time .?
2 answers:-Mis-conception!Utility Mode can be either Direct or Conventional.This
question is basically an Oracle question, details please refer to the Oracle sqlldr
references.Conventional Load: same ...
(OR)
Conventional Load: Before loading the data, all the Table constraints will be
checked against the data. Direct load:(Faster Loading) All the Constraints will be
disabled. Data wil...
237. What are the components new in 2.14 compared to 1.8 and state the
usage of the components.
1 answer:-Don't think this question is appropiate for interview......may be just because
I don't know the answer :)
238.How to create surrogate key using ab initio?
2 answers:-For #1, if the layout is not serial, you might need to do something like:
(next_in_sequence()-1)*no_of_partition()+this_partition()
(OR)
Hi Bava,There r many ways to create Surrogatekey but it depends on your
business logic. here u can try these ways...1. use next_in_sequence() function in
your transform.2.use Assign key values compone...
239. What are the advantages and disadvantages of using assign keys component?
1 answer:-pros: easy and save development time
cons: need to understand how to feed the parameters, and you can't control it
easily :)
239. Could anybody provide me the major UNIX commands for abinitio
multifile system ?
2 answers:-m_mkfs - For creating a multifile
m_ls - to list all the multifiles
m_rm - To remove the multifile
m_cp - To cpy a multifile
(OR)
m_mkfs control_file multipart_file1 multipart_file2 .... multipart_filen
240. How can you schedule the scripted file in abinitio?
2 answers:-You could use tivilo maestro for the batch schedule. This will call a unix
shell script that calls the abinitio graph.
(OR)
We can schedule scripts by using Cron job only.
241. How to create multifile system in windows?
Asked By: prakash25 | Asked On: Aug 9th, 2006
2 answers:-Hi we can create multi file by 2 waysone is go to run tab in gde,and go to
execute command there give m_mkfs contro_partition data_partition1
data_partition2.........data_partitionn eg:m_mkf...
(OR)
in the GDE go to RUN --> Â "execute command",M_mkfs will help you create
multifile system.type the control partition, and the rest of the locations where your
data can go or from where you want to acc...
242. How to pass i/p file name as parameter while running a graph by
wrapper(back end ) ?
Asked By: Nilanjan | Asked On: Sep 3rd, 2006
1 answer:-Hi All,You can pass the i/p file name as parameter using Graph level
parameter which is for particularly first graph if this graph only using that input file and
another parameter file for another gra...
243.I am having a control file which consists of 4 mfs files, I want to add
other 4 files to the same control file, how can I do and how to convert it into a
directory?
1 answer:-Use m_mkdir command to add more directories to the already existing
directory structure.
-10-
244. How to pass a condition of filter by expression as a parameter while
running a graph by wrapper ( back-end) ?
Asked By: Nilanjan | Asked On: Sep 3rd, 2006
1 answer:-First of all define a "Formal" Keyword Parameter of type string. e.g Let us
say you defined paramter by name "FilterCondition". And you want your filter to be
"COUNT > 0"Now on your com...
245.What does layout means in terms of ab initio?
3 answers:-hi, am adding some more what i know...
the layout of program components tells the co>os that where the component is
running and the layout of dataset components tells that where the data is
available....
(OR)
layout describes the following to the co>operating system:1.the location of a
serial file2.the number and locations of the partitions of the multifile3.the number
of, and the locationsat which...
246.What does dependency analysis mean in ab initio?
Asked By: Godhuly | Asked On: Aug 3rd, 2006
1 answer:-dependency analysis will answer the questions regarding datalinage.that is
where does the data come from,what applications prodeuce and depend on this data
etc..
247.How to create repository in abinitio for stand alone system(local nt)?
1 answer:-Answered by: Prasad on: Aug 9th, 2006
Hi, If you are trying to install the Ab -Initio on stand alone machine , then it is not
necessary to create the repository , While installing It creates automatically for
you under abinitio folder ( ...
248.What do you mean by .Profile in abinitio and what does it contains?
1 answer:-.profile is a file which gets executed automatically when that particular user
logging in.you can change your .profile file to include any commands that you want to
execute whenever u logging in.you c...
249. How to work with parameterized graphs?
4 answers:- for design your graph as a parameterised
you should declare the parameter scope "formal"
(OR)
One of the main purpose of the parameterized graphs is that if we need to run the
same graph for n number of times for different files, we set up the graph
parameters like $INPUT_FILE, $OUTPUT_FILE et...
250.Have you worked with packages?
3 answers:-Packages are nothing but the reusable blocks of objects like transforms,
user defined functions, dmls etc. These packages are to be included in the transform
where you use them. For example, cons...
need more elobirated with example .
251.How does maxcore works?
6 answers:-Every component in Ab initio has a memory of 7MB (anybody correct me if
I'm wrong) except a few components like In-Memory components such as join, Rollup,
Scan, join with DB etc., which uses an a...
It is maximum temporary scratchpad memory allowed for execution of a in-
memory component/per partition.
252.Can anyone give me an example of real-time start script in the graph?
2 answers:-Here is a simple example to use a start script in a graph:In start script lets
give as:export $DT=`date '+%m%d%y'`Now this variable DT will have today's date before
the graph is run.Now somewhere in t...
(OR)
If you want to run the graph iff the output file exists then you can write a script in
start script to do the same.
psuedo script:
if out.dat not exists
exit
endif
253.What is data mapping and data modelling?
1 answer:-data mapping deals with the transformation of the extracted data at FIELD
level i.e. the transformation of the source field to target field is specified by the mapping
defined on the target ...
254. Can we load multiple files?
2 answers:-Load multiple files from my perspective means writing into more than one
file at a time. If this is the same case with you, Ab initio provides a component called
Write Multiplefiles (in dataset Compo...
there are so many ways to do this, i am giving one example due to time constraint
Using MFS system you can do
255.What is the difference between .Dbc and .Cfg file?
Asked By: rupal | Asked On: Apr 26th, 2006
3 answers:-.cfg file is for the remote connection and .dbc is for connecting the
database ..cfg contains :1. The name of the remote machine2. The username/pwd to
be used while connecting to the db.3. The locatio...
(OR)
.dbc is database configuration fileand.cfg is database table configuration file
256.Can anybody please add some questions on terdata utilities.?
Answer Question ?

257.Hello friends,how to execute the graph from start to end stages? Tell me
and how to run graph in non-abinitio system?
1 answer:-there are so many ways to do this, i am giving one example due to time
constraint
you can run components according to phasea how you defined.
by creating ksh, sh scripts also you can run,
258. How will you use eme for view/publish metadata reports using eme?
1 answer:-what is "metadata reports"?
259.Can anyone please explain the environment varaibles with example.
1 answer:-Environemental variables server as global variables in unix envrionment.
They are used for passing on values from a shell/ process to another. They are inherited
by Abinitio as sandbox variables/ grap...
259.When running a stored procedure definition script how would you
guarantee the definition could be "rolled back" in the event of problems.?
1 answer:-There are quite a few factors that determines the approach such as what
type of version control are used, what is the size of the change, what is the impact of
the change, is it a new procedure or rep...
260.Describe how you would ensure that database object definitions (tables,
indices, constraints, triggers, users, logins, connection options, and server
options etc) are consistent and repeatable between...?
1 answer:-Take an entire database backup and restore it in different instance.
Take a statistics of all valid and invalid objects and match.
Periodically refresh
261.Hi can anyone tell me what happens when the graph run? I.E the co-
operating system will be at the host, we are running the graph at some other
place. How the co-operating system interprets with native...?
Asked By: Saravanan Shanmuganathan | Asked On: May 12th, 2006
3 answers:-when you run gde the co-op sys converts the gde into local langg of unix.
co-op sys acts like interpretator to operating system.
(OR)
hi sunithawhen ever you press Run button on your GDE,the GDE genarates a
scriptand the genarated script will be transfered to your host which is specified in
to your GDE run settings. then the Co>o...
262. How will you test a dbc file from command prompt ??
1 answer:-try "m_db test myfile.dbc"
-11-
263.How to find the number of arguments defined in graph?
3 answers:-$# - No of positional parameters
$? - the exit status of the last executed command.
Answered by: cndraa on: May 13th, 2006
then what is $# and $? ...
263. Hi i'm unable to connect sever database(Oracle) from gde(db config file)
local system.I set all these tags1)database:2)database
version3)db_home4)db_node5)db_host6)user7)default_database_layoutstill
i'm...?
1 answer:-Hi ChalapathiFirst we can check the properties in internet options and then
u can check in cmd format telenet abinitio ip_add .i think it will help u
.Cheers..........Koganti
264.What is brodcasting and replicate ?
5 answers:-Broadcast - Takes data from multiple inputs, combines it and sends it to all
the output ports. Eg - You have 2 incoming flows (This can be data parallelism or
component parallelism) on Bro...
(OR)
replicate generates multiple straight flows as the output where as broadcast results
single fanout flow.
replicate improves component parallelism where as broadcast improves data
parallelism.
265.What is local and formal parameter?
2 answers:-local parameter is like local variable in c language where as formal
parameter is like command line argument we need to pass at run time.
(OR)
Two are graph level parameters but in local you need to initialize the value at the
time of declaration where as globle no need to initialize the data it will promt at
the time of running the graph for that parameter.
266. What is driving port? When do you use it?
2 answers:-driving port in join supplies the data that drives join . That means, for every
record from the driving port, it will be compared against the data from non driving
port.We have to set the driving port...
(OR)
When you set the sorted-input parameter of "JOIN" component to "In memory:
Input need not be sorted", you can find the driving port. Generally driving port
use to improve performance in a ...
267.Sift links to mfs files on UNIX for ab initio ? What is this ....2) $pound
what is this 3) $? What for it is used 4) types of loading 5 overwrite when it
used ?
2 answers:-thanks for ur answers ! still im not getting clarity on Sift links to MFS files
on Unix for Ab Initio ? what is this.. can u explain with real example
(OR)
Link is a command where in unix we use for when the original file is deleted
when we create a link the other replaces file exists.Example: ln file 1 file2$#
Total number of positional parameters.$? ex...
268.What is ab_local expression where do you use it in ab-initio?
3 answers:-in some cases abinitio will not recognize some part of sql statments and
unable to read the table.. to avoid this we have use AB_LOCAL() and table name when
ever we use clauses like WHERE condition
(OR)
one example is, if you want to use oracle hint in your select query at input
table...u should use AB_LOCAL(table) to parse the query at Database sQL engine
not by AI parser.
It is mainly used at parallel unload.
269. What are the different versions and releases of abinitio (gde and co-op
version)?
2 answers:-any idea when these versions are released
Answered by: Rupal on: Mar 23rd, 2006
for GDE 1.10, 1.11, 1.12, 1.13, and 1.15 is latest one. For Co-Op latest one is
2.14.
270. What is $mpjret? Where it is used in ab-initio?
Asked By: rupal | Asked On: Mar 2nd, 2006
3 answers:-same as $?....but it will give u return status of mp run cmd...capture and
use it at end script to display and notify warning/success and failure msg
(OR)
U can use $mpjret in endscript like
if 0 -eq($mpjret)
then
echo "success"
else
mailx -s "[graphname] failed" mailid
271. What are differences between different gde
versions(1.10,1.11,1.12,1.13and 1.15)?What are differences between different
versions of co-op?
1 answer:-1.10 is a non key version and rest are key versions.
There are lot of components added and revised at following versions.
272.Have you eveer encountered an error called "depth not equal"? (this
occurs when you extensively create graphs it is a trick question)?
2 answers:-Thanks to Ashim for more
clarification...**********************************I have talked about a situation
where you have linked2 components - each of them having different layouts.Think
about a situa...
(OR)

When two components are linked together if their layout doesnot match then this
problem can occur during the compilation of the graph. A solution to this problem
would be to use a partitioning component in between if there was change in
layout.
273. How many components in your most complicated graph?
Asked By: Interview Candidate | Asked On: Sep 5th, 2005
2 answers:-This is a tricky question, number of component in a graph has nothing to do
with the level of knowledge a person has. On the contrary, a proper standardized and
modular parametric approach will reduce...
(OR)
It depends the type of components you us.
usually avoid using much complicated transform function in a graph.
274. Which one is faster for processing fixed length dmls or delimited dmls
and why ?
1 answer:-Fixed length DML's are faster because it will directly read the data of that
length without any comparisons but in delimited one,s every character is to be
compared and hence delays
275. What are the contineous components in abinitio?
1 answer:-Contineous components used to create graphs,that produce useful output
file while running continously
Ex:- Contineous rollup,Contineous update,batch subscribe
276.How will you test a dbc file from command prompt ??
2 answers:-Hi ,You can test a dbc file from unix command prompt using m_db test
<name-of-the dbc file> which tests the data base connection, data base version, user
name, password, database ...
(OR)

You can test a dbc file from command prompt(unix) using m_db test <name-of-
dbc file> command which gives the checking of data base connection, version of
data base, user
277. Explain the difference between the “truncate” and "delete" commands.?
3 answers:-Answered by: karthik on: Jan 18th, 2006
When you use truncate all the space alloted for the records along with records gets
deleted and if you use delete only the records are deleted and the space allocated
for the records remains unaltered.
(OR)
Truncate :- It is a DDL command, used to delete tables or clusters. Since it is a
DDL command hence it is auto commit and Rollback can't be performed. It is
faster than delete.Delete:- It is DML comma...
278.How to retrive data from database to source in that case whice
componenet is used for this?
1 answer:-To unload (retrive) Data from the database DB2, Informix, or Oracle we
have components like Input Table and Unload DB Table by using these two components
we can unload data from the database
279.Describe the elements you would review to ensure multiple scheduled
"batch" jobs do not "collide" with each other.?
1 answer:-Because every job depend upon another job for example if you first job
result is successfull then another job will execute otherwise your job doesn't work.
280.When using multiple dml statements to perform a single unit of work, is
it preferable to use implicit or explicit transactions, and why.?
1 answer:-Because implicit is using for internal processing and explicit is using for user
open data requied.
281.How do you add default rules in transformer?
1 answer:-Double click on the transform parameter of parameter tab page of
component properties, it will open transform editor. In the transform editor click on the
Edit menu and then select Add Default Rules from the dropdown. It will show two
options - 1) Match Names 2) Wildcard. -12-
282.What are kinds of layouts does ab initio supports?
1 answer:-Basically there are serial and parallel layouts supported by AbInitio. A graph
can have both at the same time. The parallel one depends on the degree of data
parallelism. If the multi-file system is 4...
283.What is the difference between look-up file and look-up, with a relevant
example?
2 answers:-A lookup is a component of abinitio graph where we can store data and
retrieve it by using a key parameter.
A lookup file is the physical file where the data for the lookup is stored.
(OR)
Generally Lookup file represents one or more serial files(Flat files). The amount
of data is small enough to be held in the memory. This allows transform functions
to retrive records much more quickly than it could retrive from Disk.
i have No general idea about lookup.
284.Describe the process steps you would perform when defragmenting a
data table. This table contains mission critical data.?
1 answer:-There are several ways to do this:1) We can move the table in the same or
other tablespace and rebuild all the indexes on the table.alter table <table_name> move
<tablespace_name> this act...
285.How would you find out whether a SQL query is using the indices you
expect?
1 answer:-explain plan can be reviewed to check the execution plan of the query. This
would guide if the expected indexes are used or not.
286.Why might you create a stored procedure with the 'with recompile'
option?
1 answer:-Recompile is useful when the tables referenced by the stored proc
undergoes a lot of modification/deletion/addition of data. Due to the heavy
modification activity the execute plan becomes outdated an...
287.How do you add default rules in transformer?
3 answers:-
[email protected] In case of reformat if the destination field names are
same or subset of the source fields then no need to write anything in the reformat
xfr unless you dont want to use any ...
(OR)
1)If it is not already displayed, display the Transform Editor Grid.
2)Click the Business Rules tab if it is not already displayed.
3)Select Edit > Add Default Rules.

288.What are the inserted and deleted tables and what are they used for?
Asked By: Interview Candidate | Asked On: Aug 26th, 2004
Answer Question
289.Why might the optimizer use a table scan when an index is available?
Asked By: Interview Candidate | Asked On: Aug 26th, 2004
Answer Question
290. Describe where you would use each of the following index types in
relation to query type and/or data set: hashed, sorted, bitmapped (ranked),
clustered?
Asked By: Interview Candidate | Asked On: Aug 26th, 2004
Answer Question
291. In the scenario outlined in #18 and given that the output table(s) will be
accessed by concurrent readers describe how your SQL code would be
structured to minimize lock contention.?
Asked By: Interview Candidate | Asked On: Aug 26th, 2004
Answer Question ==?
292.Describe which database structural and physical elements you would
review when optimizing query performance.?
Asked By: Interview Candidate | Asked On: Aug 26th, 2004
Answer Question
293.Describe typical activities or elements of investigation you would employ
when trouble shooting complaints of localized process slowness.?
Asked By: Interview Candidate | Asked On: Aug 26th, 2004
Answer Question
294. What is the T-SQL command element you would use to place a logical
object (table or index) on a specific physical file (file group).
Asked By: Interview Candidate | Asked On: Aug 26th, 2004
Answer Question
295. Describe how you would monitor and control database physical file size and
growth.?
Asked By: Interview Candidate | Asked On: Aug 26th, 2004
296. Answer QuestionDescribe what the SQL server tempdb and tlog
file(s)are used for.?
Asked By: Interview Candidate | Asked On: Aug 26th, 2004

You might also like