0% found this document useful (0 votes)
119 views

Capgemini Interview Questions

The document provides example questions that may be asked during interviews for Capgemini, Cognizant, TCS, and Infosys. The questions cover topics like ETL concepts, SQL, data warehousing, transformations, and more.

Uploaded by

vinay.murahara
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
119 views

Capgemini Interview Questions

The document provides example questions that may be asked during interviews for Capgemini, Cognizant, TCS, and Infosys. The questions cover topics like ETL concepts, SQL, data warehousing, transformations, and more.

Uploaded by

vinay.murahara
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 8

Capgemini interview questions:

1)Self introduction2)About project

3)input 1,2,3,4,5,6,7,8,13 output 1,2,3,5,8,13 how do you write logic -wrongquestions

4) how do you load data to a table based on target load order -> we can define whihch
target will load frist by using target load order

5) send first three records to one target, next 5 records to another target,remaining to
another target.-> 1.TAke expression drag column to expression 2.take sequence
generator and start value with 1 and drag nexvalue toexpression 3.take router and take
both column to the router and make condtionnexvalue<=3 , nexvalue>3 and
nexvalue<=8 and nexvalue>8 4. connect to the
output**************************************************************************************
********************************************************Cognizant Interview questions

1. Tell me about your self

2. How to remove duplicate from table in oracle-> DELETE employeeWHERE rowid NOT
IN (SELECT MAX(rowid)FROM employee

GROUP BY EMPID);

-> WITH cte AS (SELECT EMPID ,ROW_NUMBER()OVER (PARTITION BY EMPID, ORDER


BY EMPID)as row_num FROM employee)DELETE FROM cteWHERE row_num > 1;

3. What are the column names of Fact table in your project-> STUDY:
id,name,country,date, study description SITE:id, country, SUBJECT:

4. what is the purpose of loading the Dimention Table in your project-> To make the
reports and dashboards ,data monitor

5. What the use of Agg. Transformation-> To perform aggregate function like sum ,avg,
min ,max

So be careful guys

28-5-22Cognizant Interview Questions1.explain about scd type 2-> Transformation


needed : expression, Lookup, sequence-generator, router,update-strategy

2.how to load unique records into one table duplicate records into another table-> 1.
take aggregate transformation find the count of the cloumn 2. take router and make
two condtion one is count =1 for unique and count!=1or default gruop can be used for
duplicate 3.indirect file method-> Used to load the source files with same structure by
using singal sourcequlifier to the target

4.how to pass one session parameter into another session

->we can not pass parameter

this is for variable-> Yes, By giving theIn mapping1 create a mapping variable say var1
and set its value to cloumn.Create a workflow variable in the workflow, say var_wkfIn
session1, in Post-session on success variable assignment, assign var_wkf = var1In
mapping2, create a mapping variable, say var2In session2, in Pre-session variable
assignment, assign var2=var_wkfYou can use the var2 variable in mapping2, it should
have the value set in mapping1

5.what is joiner transformation , explain about how to improve performance-> joiner is


used to join the two heteroguoues source in the mapping ,-> use sorted data whenever
posibale ,use source with fewer rows as master so thatiterations will be less with detail
,Perform joins in a database

6.what is transaction control transformation->Use the Transaction Control


transformation to commit or roll back transactionsfrom transactional targets we can use
this whenever we have large amount ot data and we can commit aftersome interval

7.what are the process for code deployment in your project->

8.do you have knowledge on MD5 function->You want to write changed data to a
database. Use MD5 to generate checksum valuesfor rows of data you read from a
source. When you run a mapping, compare thepreviously generated checksum values
against the new checksum values. Then, writethe rows with updated checksum values to
the target. You can conclude that anupdated checksum value indicates that the data has
changed.

9.explain about router transformation->Router transformatio is used to apply multiple


conditions on same input datarather than using multiple filter

10.explain about performance tuning->To optimize session performance by eliminating


performance bottlenecks.performance bottlenecks mean bez of one object or compoent
the entire session orworkflow is getting stopped or running for long time
11.explain about update strategy transformation->The Update Strategy transformation
is an active transformation. When you design adata warehouse, you need to decide
what type of information to store in targets. Aspart of the target table design, you need
to determine whether to maintain all thehistoric data or just the most recent changes.
there are four tranf we can useinsert, delete, update, or reject.

11.How to update the records without update strategy transformation->use "insert else
update" option at session, the informatica first look in totarget table to see if the data
exeist for coming primary key valye. If yes, it will update otherwise insert.

12.explain about incremental loading(cdc)->the process of loading the selective data


which is newly created or updated fromsource system to the target
system**************************************************************************************
********************************************************************Tcs Interview questions

1. HLD and LLD documents-> HLD : High level document is the overall system design -
covering the systemarchitecture and database design. It describes the relation between
various modules and functions of the system. data flow, flow charts and datastructures
are covered under HLD LLD: (LLD) is like detailing the HLD. It defines the actual logic for
eachand every component of the system. Class diagrams with all the methods and
relationbetween classes comes under LLD. Programs specs are covered under LLD

2. How do u get requirements-> Through JIRA and DMS

3 .explain complex thing you had done in your project->4. How will you give the
parameterized connections-> We will create one file in BWParam location in that we will
mention all targetand source and if needed parameter and will give location of that file
in seesionor in workflow

5.tell connected and unconnected lookup-> connected lkp: Is connected in pipeline or


mapping . we can retrun multipleports in connected lkp ,Use a dynamic or static cache
UN-connected : Is not connected in pipeline or mapping . we can retrun onlyone ports
in un-connected lkp, Use a static cache

6 . Scd 2-> Transformation needed : expression, Lookup, sequence-generator,


router,update-strategy

7. Star schema and snowflake schema-> Star schema: Fact table sourdded by dimension
table . Snowflake: Fact table sourdded by dimension table and that dim table is
againconnected to dimension
8. Partitioning->Partitioning Option to increases the performance of ETL through
parallel dataprocessing.

9. Hash Auto-Keys->The Integration Service uses all grouped or sorted ports to


generate compoundPartition keys

10.Hash User-Keys->The Integration Service uses a hash function to group rows of data
amongPartitions, We define the number of ports to generate the partition key

10.performance tuning->to optimize session performance by eliminating


performancebottlenecks.performance bottlenecks mean bez of one object or compoent
the entire session or workflow is getting stopped or running for long time

11. Joins-> To join the two tables ,inner,left ,right,full

12. Dense , rank , rownum->13. Delete duplicate records->WITH cte AS ( SELECT EMPID
, ROW_NUMBER() OVER (PARTITION BY EMPID, ORDER BY EMPID) as row_num FROM
employee)DELETE FROM cteWHERE row_num > 1;14.stored procedures->15. Dynamic
lookup cache-> dynamic LKP chnages the cache every time the service perocess the
lookup. usethis when we want insert or update the target

16. How do you load alternate records.-> take expression sequence-generator cycle 1-2
only , then router 1 will go to theone target and 2 will another target

17. Primary key and foreign key-> Primary key is used to uniquely identify a row
.Foreign key is used to connecttwo tables or maintain relationship between them

18. Indexes->Used to speed up the data retrieval form the database . liek select ,
ordelete,update , insert

19.views->virtual table of an SQL statement.The fields in a view are fields from one
ormore real tables in the database

20 Sorter active or passive why??-> Sorter is passive bez it does not changing no of
rows

21 Active and connected->An active transformation can change the number of rows
that pass through thetransformation A passive transformation does not change the
number of rows that pass throughthe transformation
22.How do you create single lookup transformation using multiple tables?->we have the
lOOKUP OVERRIDE Query in the Lookup transformation. Use the SQl Queryto join the
tables you lookup on

23.what is worklet and what use of worklet and in which situation we can use it?..-
>Worklet:- Group of tasks taken to accomplih a task is known as a Worklet.Use of
Worklet:- You can bind many of the tasks in one place so that they caneasily get
identified and also they can be of a specific purpose.

24.what is a junk dimension ?.->A "junk" dimension is a collection of random


transactional codes, flags and/ortext attributes that are unrelated to any particular
dimension 25.Conformed Dimensions->This is used in multiple locations. It helps in
creating consistency so that thesame can be maintained across the fact tables

26.Degenerate Dimensions->When the dimension attribute is stored in the fact table


and not in a separatedimension table

*********************************************************************************************
****************************Infosys Interview questions--------------------------------1)SQl
operations->2)difference between fact and dimension-> Fact: Quantitative data , we can
make calulation on them Dimension : Descriptive data, information regarding facts
fields3)push down optimization-> Convrt the Transformation logic into sql queries and
push them into eithersource ot target side

4)partitions in informatica-> parallel processing for workflow

5)persistent lookup cache-> Integration service will save the cache file and it will reuse
for the next time

6)dynamic lookup-> its type of lookup cache . whenever IS peocess the lookup every
time the cachewill chnage it is called the lookup cache

7)MD5 function in informatica->You want to write changed data to a database. Use


MD5 to generate checksum valuesfor rows of data you read from a source. When you
run a mapping, compare thepreviously generated checksum values against the new
checksum values. Then, writethe rows with updated checksum values to the target. You
can conclude that anupdated checksum value indicates that the data has changed.

8)why sorter is active transformation->when we check distict option then it will acts as
active
9)why union is active transformation-> it combines two or more data streams into one
an no of rows are not chnaging

10)constraint based load->Constraint-Based Loading is used to load the corresponding


transformed row firstto the primary key table, then to any foreign key tables. In the
Workflow Manager,you can specify constraint-based loading for a session. When you
select thisoption, the Integration Service orders the target load on a row-by-row basis

11)target load order->we can define whihch target will load frist by using target load
order

13)difference between oltp and olap and real time examples-


>https://www.guru99.com/oltp-vs-olap.html

14)sql index ing->It is used to retrive the data fastly from database 15)sql performance
tuning->Reduce Table Size,Simplify Joins,Use SELECT Fields FROM Instead of SELECT
*FROM,Use WHERE Instead of HAVING

16)Analytic functions in sql->rank ,row_num,dense_rank17)delete vs truncate->Delete


Truncate The DELETE command is used to delete specified rows(one or more). While this
command is used to delete all the rows from atable. It is a DML(Data Manipulation
Language) command. While it is a DDL(Data Definition Language) command. WHERE
clause in the DELETE While there may not be WHERE clause in the

18)which is the better one of delete and truncate and why?->TRUNCATE is faster than
DELETE,TRUNCATE is used to delete all the rows from atable

19)do you have knowledge about IICS->NO20)difference between union and union all-
>union: will eliminate the duplicate union all: will not eliminiate duplicate

21)data driven types and update startegy and explain them->To insert, update, or delete
row values by an identifying key, choose the DataDriven Load type.

22)temporary due for pushdown->Allows the PowerCenter Integration Service to create


temporary view objects inthe database when it pushes the session to the
database********************************************************************************

1- How to find duplicates in a table-> Select empId , Count(*) from employees group by
empId having count(*)>12- How to delete duplicates from a table->delete from
EMPLOYEESwhere employee_id in (Select e.employee_id , Count(*) from EMPLOYEES e
group by e.employee_id having count(*)>1 )
3- Difference between union and union all-> uninon will remove the duplicate , but
union all will not remove the duplicate

If you have ever given an SQL interview some of the questions

would be definitely from below list: 4- Difference between rank,row_number and


dense_rank

5- Find records in a table which are not present in anothertable->SELECT


e.employee_idFROM EMPLOYEES eWHERE NOT EXISTS ( SELECT * FROM job_history jh
WHERE e.employee_id =jh.employee_id )

&****************

SELECT *FROM EMPLOYEESWHERE employee_id NOT IN (SELECT employee_id FROM


job_history)

5. second highest salary employees

select e.department_id, e.salary, dense_rank () over ( order by e.salary desc ) rnk from
EMPLOYEES e

6- Find second highest salary employees in each department-> with high_salary as (


select e.department_id, e.salary, dense_rank() over (partition by e.department_id order
by e.salary desc) rnk

from EMPLOYEES e)

select * from high_salary where rnk=2;

7- Find employees with salary more than their manager'ssalary->SELECT *FROM


employees e, employees mWHERE e.manager_id = m.employee_id AND e.salary >
m.salary;

8- Difference between inner and left join->9- update a table and swap gender
values.>update employeesset gender = case gender when 'Male' then 'Female' when
'Female' then 'Male' else 'Other' end

10- Number of records in output with different kinds of join.

Active Transformation – An active transformation changes the number of rows that pass
through the mapping.
Source Qualifier TransformationSorter TransformationsAggregator TransformationsFilter
TransformationUnion TransformationJoiner TransformationNormalizer
TransformationRank TransformationRouter TransformationUpdate Strategy
TransformationAdvanced External Procedure Transformation

Passive Transformation – Passive transformations do not change the number of


rowsthat pass through the mapping.

Expression TransformationSequence Generator TransformationLookup


TransformationStored Procedure TransformationXML Source Qualifier
TransformationExternal Procedure TransformationInput Transformation(Mapplet)Output
Transformation(Mapplet)

You might also like