Ab Initio Interview Questions Summer Internship

3) Mention what is the role of Co-operating system in Abinitio?The Abinitio co-operating system provide features like

  • Manage and run Abinitio graph and control the ETL processes
  • Provide Ab initio extensions to the operating system
  • ETL processes monitoring and debugging
  • Meta-data management and interaction with the EME
  • 7) List out the file extensions used in Abinitio?The file extensions used in Abinitio are

  • .mp: It stores Ab initio graph or graph component
  • .mpc: Custom component or program
  • .mdc: Dataset or custom data-set component
  • .dml: Data manipulation language file or record type definition
  • .xfr: Transform function file
  • .dat: Data file (multifile or serial file)
  • 4) Explain what does dependency analysis mean in Abinitio?In Ab initio, dependency analysis is a process through which the EME examines a project entirely and traces how data is transferred and transformed- from component-to-component, field-by-field, within and between graphs.

  • Data Integration Portion
  • User Interface ( Access to the meta-data information)
  • 1 Suppose you find the term Validation mentioned with a set of data, what does that simply represent?

    It represents that the concerned data is clean, correct, and can thus be used reliably without worrying about anything. Data validation is widely regarded as the key points in the processing system.

    1 What are the key elements of a data processing system?

    These are Converter, Aggregator, Validator, Analyzer, Summarizer, and a sorter.

    9) Explain how you can run a graph infinitely in Ab initio?To execute graph infinitely, the graph end script should call the .ksh file of the graph. Therefore, if the graph name is abc.mp then in the end script of the graph it should call to abc.ksh. This will run the graph for infinitely.

  • Component parallelism: A graph with multiple processes executing simultaneously on separate data uses parallelism
  • Data parallelism: A graph that works with data divided into segments and operates on each segments respectively, uses data parallelism.
  • Pipeline parallelism: A graph that deals with multiple components executing simultaneously on the same data uses pipeline parallelism. Each component in the pipeline read continuously from the upstream components, processes data and writes to downstream components. Both components can operate in parallel.
  • Whether you are a fresher or are an experienced professional, this is one of the first questions that is asked in every Abinitio interview. You need to be thorough while explaining the concept. Abinitio (meaning from the beginning) is a tech tool that is used to deal with the data. Apart from this, it is also used for other purposes such as data manipulation, analysis, and graphical user interface based parallel processing.

    Explain that to execute graph infinitely in Abinitio, the graph end script should call the .ksh file of the graph. Hence, if the graph name is xyz.mp, then it should call to abc.ksh in the end script of the graph to run it infinitely.

    ● Dedup component is used to remove duplicate records. ● Replicate component combines the data records from the inputs into one flow followed by writing a copy of that flow to each of its output ports.

    As an experienced professional, you need to know all the fundamental concepts of Abinitio. Below are the functions of these two components:

    By asking this question, the interviewer wants to check your previous work knowledge of Abinitio. You need to talk about the below layouts that Abinitio supports: ● Serial and parallel layouts. ● A multi-file system. ● A graph layout that supports both serial and parallel layouts at a time. ● A component in a graph system that can run 4-way parallel system.

    AbInitio Interview Questions and Answers | BI |ETL |

    Related Posts

    Leave a Reply

    Your email address will not be published. Required fields are marked *