PALANTIR DATA ENGINEERING CERTIFICATION EXAM LATEST, Exams of Computer Science

PALANTIR DATA ENGINEERING CERTIFICATION EXAM LATEST

Typology: Exams

2025/2026

Available from 05/09/2026

Tutornurse
Tutornurse šŸ‡ŗšŸ‡ø

3.4

(5)

7.1K documents

1 / 51

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
END OF
PAGE
1
PALANTIR DATA
ENGINEERING
CERTIFICATION
EXAM
LATEST
1. What actions are performed when the ModelOutput.publish()
method is called in Foundry's Code Repositories? Select two:
It serializes the model using the ModelAdapter.save() method.
It initializes the model adapter with the fresh model.
It runs the model inference.
It creates a new model version.
It serializes the model using the ModelAdapter.save() method.
It creates a new model version.
6. Which of the following statements correctly describes the
behavior of the FileSystem.open() method in Foundry Transforms?
it allows random access to any part of that file
it automatically infers the file schema upon opening
it returns a writable stream by default
it provides a read-only stream without support for seek or tell
methods
it provides a read-only stream without support for seek or tell methods
8. Which of the following are recommended practices for chaining
expressions in PySpark to enhance code readability? Select two.
pf3
pf4
pf5
pf8
pf9
pfa
pfd
pfe
pff
pf12
pf13
pf14
pf15
pf16
pf17
pf18
pf19
pf1a
pf1b
pf1c
pf1d
pf1e
pf1f
pf20
pf21
pf22
pf23
pf24
pf25
pf26
pf27
pf28
pf29
pf2a
pf2b
pf2c
pf2d
pf2e
pf2f
pf30
pf31
pf32
pf33

Partial preview of the text

Download PALANTIR DATA ENGINEERING CERTIFICATION EXAM LATEST and more Exams Computer Science in PDF only on Docsity!

END OF PAGE

ENGINEERING

CERTIFICATION

EXAM

LATEST

  1. What actions are performed when the ModelOutput.publish() method is called in Foundry's Code Repositories? Select two: It serializes the model using the ModelAdapter.save() method. It initializes the model adapter with the fresh model. It runs the model inference. It creates a new model version. It serializes the model using the ModelAdapter.save() method. It creates a new model version.
  2. Which of the following statements correctly describes the behavior of the FileSystem.open() method in Foundry Transforms? it allows random access to any part of that file it automatically infers the file schema upon opening it returns a writable stream by default it provides a read-only stream without support for seek or tell methods it provides a read-only stream without support for seek or tell methods
  3. Which of the following are recommended practices for chaining expressions in PySpark to enhance code readability? Select two.

END OF PAGE

ENGINEERING

CERTIFICATION

EXAM

LATEST

isolate each logical group of transformations into separate code blocks. chain as many expressions as possible for conciseness. use backslashes () for line breaks in chains. limit chains to a maximum of 5 statements. extract complex logic into separate functions. nest multiple chains within a single expression block. limit chains to a maximum of 5 statements. extract complex logic into separate functions.

  1. You need to inject a TransformContext into your Transform's compute function to access the current Spark session. How should you define the parameters of your compute function? def compute(context, input, output): def compute(input, output): def compute(input, output, ctx): def compute(ctx, input, output): def compute(ctx, input, output):

END OF PAGE

ENGINEERING

CERTIFICATION

EXAM

LATEST

Running PySpark commands in the console Previewing intermediate dataframes at breakpoints Automatically fixing variable values Changing the programming language of the transform on the fly Editing the source code directly from the debugger Navigating frames to examine variables Running PySpark commands in the console Previewing intermediate dataframes at breakpoints Navigating frames to examine variables

  1. Which type of pipeline in Foundry typically has the lowest compute cost? all have similar compute costs streaming batch incremental incremental
  2. In Foundry, which schema field type requires specifying both precision and scale parameters?

END OF PAGE

ENGINEERING

CERTIFICATION

EXAM

LATEST

ARRAY

DECIMAL

DATE

STRING

DECIMAL

  1. Which of the following are stages included in the condaPackRun task used in CI checks for a Python repository in Foundry? Select three. Run unit tests using PyTest Upload artifacts to a remote server Download and extract all packages in the solved environment Compile the Python source code Link packages into the environment Verify package contents Download and extract all packages in the solved environment Link packages into the environment Verify package contents

END OF PAGE

ENGINEERING

CERTIFICATION

EXAM

LATEST

Keep logic expressions inside the same code block to 3 expressions at most.

  1. You are developing a Transform in Foundry that processes input dataframes using PySpark and needs to output multiple datasets based on different filters. Which decorator should you use to define this Transform? @ transform_df @ transform_pandas @ transform_file @ transform @ transform
  2. You are performing a left join between two DataFrames in PySpark, but realize that the right DataFrame may have multiple matches for some keys, leading to duplicate rows in the output. According to the style guide, what should you do to prevent this 'join explosion'? Use .dropDuplicates() after the join Switch to an inner join to avoid duplicates

END OF PAGE

ENGINEERING

CERTIFICATION

EXAM

LATEST

Ensure the join key in the right DataFrame is unique Use a right join instead of a left join Ensure the join key in the right DataFrame is unique

  1. Which of the following are considered bad practices when performing joins in PySpark? Using dataframe aliases to disambiguate column names. Dropping unnecessary columns after the join. Ensuring the key you join on is unique when performing left joins. Using right joins. Explicitly specifying the join type. Allowing expressions that duplicate columns in the output. Using right joins. Allowing expressions that duplicate columns in the output.
  2. Which of the following is the correct sequence of steps to configure a direct connection in Foundry's managed SaaS platform?

END OF PAGE

ENGINEERING

CERTIFICATION

EXAM

LATEST

Direct Connection

  1. What is the minimum recommended amount of RAM for a Foundry agent host? 12 GB 8 GB 32 GB 16 GB 16 GB
  2. Which of the following are part of securing a Foundry agent host? Select two. Allow all inbound traffic to facilitate connectivity. Allow network traffic only from specific IPs. Open all ports for flexibility. Install antivirus software on the host. Ensure the agent host can talk to Palantir. Configure the firewall to block all traffic except to desired destinations. Ensure the agent host can talk to Palantir. Configure the firewall to block all traffic except to desired destinations.

END OF PAGE

ENGINEERING

CERTIFICATION

EXAM

LATEST

  1. A data engineer needs to integrate data from various legacy systems into Palantir AIP without modifying the existing data formats. Which feature of Palantir AIP facilitates this seamless integration? Metadata Services Virtual Tables REST Interfaces Palantir HyperAuto Pipelines Virtual Tables
  2. Which of the following actions can be performed after successfully syncing a table range from a Fusion sheet to a dataset in Foundry? Select three. Change the branch of the dataset. Modify the export column type to match desired data types. Delete the original Fusion sheet without affecting the dataset. Use both sheet sync and table sync on the same Fusion sheet. Automatically merge changes from multiple Fusion sheets. Rename the synced dataset.

END OF PAGE

ENGINEERING

CERTIFICATION

EXAM

LATEST

Define object properties Capture data from operators Orchestrate decision-making processes

  1. You are responsible for syncing a specific range of data from a Fusion spreadsheet to a dataset in Foundry to be used by Contour. After selecting the desired table range and initiating the sync, what must you ensure to avoid synchronization issues? Ensure that the dataset has Viewer permissions. Export the synced data as a CSV file immediately after syncing. Only use table sync without any sheet sync in the Fusion sheet. Use both sheet sync and table sync within the same Fusion sheet. Only use table sync without any sheet sync in the Fusion sheet.
  2. When syncing a table range from a Fusion sheet to a dataset in Foundry, which of the following conditions must be met to ensure that future changes in the spreadsheet are reflected in the dataset? The user must have at least Editor permissions on the dataset. The dataset must be exported as a CSV file after each sync. Both sheet sync and table sync must be enabled concurrently.

END OF PAGE

ENGINEERING

CERTIFICATION

EXAM

LATEST

The user must have Viewer permissions on the dataset. The user must have at least Editor permissions on the dataset.

  1. When developing a transform for unstructured datasets in Foundry, which of the following approaches is most effective for parsing semi-structured data like JSON or XML files? Converting the unstructured data into plain text before processing. Storing the unstructured data as binary blobs without parsing. Leveraging custom Python or Java code within the transform to handle parsing. Using built-in SQL functions to parse the data directly. Leveraging custom Python or Java code within the transform to handle parsing.
  2. Which role is required to configure network egress policies in Foundry's managed SaaS platform? Information Security Officer User Project Admin Data Pipeline Developer

END OF PAGE

ENGINEERING

CERTIFICATION

EXAM

LATEST

Avoiding documentation to keep the pipeline simple. Manually verifying each pipeline run for consistency. Ensuring that data transformations preserve the integrity of semantic relationships. Implementing error handling to manage discrepancies between data sources and ontology requirements. Aligning pipeline logic with the ontology's entity and relationship definitions. Ensuring that data transformations preserve the integrity of semantic relationships.

  1. You are assigned to maintain a critical data pipeline in Foundry that has been experiencing intermittent failures. To ensure timely resolution and support, which of the following support structures should you establish? Implement a ticketing system for tracking support requests and resolutions. Create detailed documentation outlining common issues and troubleshooting steps.

END OF PAGE

ENGINEERING

CERTIFICATION

EXAM

LATEST

Set up automated alerting for pipeline failures and performance issues. Restrict access to the pipeline only to senior data engineers. Implement a ticketing system for tracking support requests and resolutions. Create detailed documentation outlining common issues and troubleshooting steps. Set up automated alerting for pipeline failures and performance issues.

  1. A data scientist wants to leverage their existing Jupyter notebooks within Palantir AIP for data analysis without switching to a different interface. Which feature of Palantir AIP should they utilize to achieve this? REST Interfaces Virtual Tables Palantir HyperAuto Pipelines Code Workspaces Code Workspaces
  2. What are the kinetic elements in the Palantir Ontology?

END OF PAGE

ENGINEERING

CERTIFICATION

EXAM

LATEST

The runtime environment configuration All functions within the Transform All modules the Transform depends on The module where the Transform is defined Any project dependencies

  1. When would you choose to use the 'Merge with fast-forward' mode in Foundry's Code Repositories? When you need to create a new commit that combines all changes from the pull request. When the target branch has diverged significantly from the source branch. When you want to maintain a detailed commit history with merge commits. When there are no additional changes on the target branch and you want a linear commit history. When there are no additional changes on the target branch and you want a linear commit history.
  2. You want to leverage distributed processing in Foundry Transforms to handle files of varying sizes efficiently. Which Spark

END OF PAGE

ENGINEERING

CERTIFICATION

EXAM

LATEST

configuration properties should you adjust to control the partitioning of the FileStatus DataFrame? Select two. spark.executor.cores spark.executor.memory spark.sql.files.openCostInBytes spark.driver.memory spark.sql.files.maxPartitionBytes spark.sql.files.openCostInBytes spark.sql.files.maxPartitionBytes

  1. In a Foundry Pipeline, you need to generate multiple output datasets from a single input dataset by filtering based on different criteria. Which feature of the Transforms API allows you to accomplish this efficiently? Multiple-output Transforms Transform logic level versioning (TLLV) Transform generation using for-loops TransformContext injection Multiple-output Transforms