


















































































Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
A detailed practice exam for SAS programmers working in clinical research. It covers regulatory standards (CDISC SDTM/ADaM), statistical reporting requirements, data cleaning, macro programming, validation procedures, and generation of clinical data deliverables. Candidates encounter realistic clinical trial programming scenarios requiring strong understanding of SAS Base, advanced techniques, and pharmaceutical compliance documentation.
Typology: Exams
1 / 90
This page cannot be seen from the preview
Don't miss anything!



















































































Question 1. Which phase of a clinical trial primarily assesses safety and tolerability in healthy volunteers? A) Phase I B) Phase II C) Phase III D) Phase IV Answer: A Explanation: Phase I trials are the first-in‑human studies, typically enrolling healthy subjects to evaluate safety, tolerability, pharmacokinetics, and pharmacodynamics. Question 2. In a Statistical Analysis Plan (SAP), the term “primary efficacy endpoint” refers to: A) The safety outcome of greatest interest B) The main hypothesis test that determines trial success C) Any secondary analysis performed after the primary analysis D) The demographic characteristic used for stratification Answer: B Explanation: The primary efficacy endpoint is the pre‑specified main outcome that the trial is powered to detect; its analysis determines whether the investigational product meets its intended effect. Question 3. Which CDISC model is used to represent analysis‑ready data sets for statistical analysis? A) SDTM B) ADaM C) ODM D) SEND Answer: B
Explanation: The Analysis Data Model (ADaM) provides standardized structures (e.g., ADSL, BDS) that are derived from SDTM data and ready for statistical analysis. Question 4. In SDTM, the domain that captures subject‑level baseline characteristics such as age, sex, and race is: A) AE B) LB C) DM D) EX Answer: C Explanation: The Demographics (DM) domain stores subject‑level baseline information, which is required for most analyses. Question 5. Which SAS procedure is most appropriate for generating a frequency table of categorical adverse event terms? A) PROC MEANS B) PROC FREQ C) PROC UNIVARIATE D) PROC CORR Answer: B Explanation: PROC FREQ produces frequency counts and percentages for categorical variables, ideal for adverse event listings. Question 6. The 21 CFR Part 11 regulation primarily addresses: A) Informed consent documentation B) Electronic records and electronic signatures
Answer: C Explanation: SYMBOLGEN writes the resolution of macro variables to the SAS log, helping to trace macro execution. Question 10. The ODS statement “ODS PDF FILE='report.pdf' STYLE=STATISTICAL;” performs which action? A) Opens a PDF destination with the STATISTICAL style B) Closes the PDF output file C) Converts a SAS data set to PDF format D) Generates a statistical summary of the data set Answer: A Explanation: ODS PDF opens a PDF destination, directing subsequent output to the specified file and applying the chosen style. Question 11. Which SDTM class includes records for concomitant medications taken by subjects? A) Findings B) Interventions C) Events D) Special Purpose Answer: B Explanation: The Interventions class contains domains like CM (Concomitant Medications) that record treatments administered outside the study drug.
Question 12. The purpose of “Last Observation Carried Forward (LOCF)” imputation is to: A) Replace missing baseline values with the mean of the group B) Carry the most recent non‑missing value forward to subsequent missing visits C) Exclude subjects with any missing data from the analysis D) Randomly assign missing values based on observed distribution Answer: B Explanation: LOCF fills missing post‑baseline measurements with the last observed value, preserving a complete data set for analysis. Question 13. Which PROC can be used to compare two SAS data sets for validation purposes? A) PROC COMPARE B) PROC TRANSPOSE C) PROC CONTENTS D) PROC SORT Answer: A Explanation: PROC COMPARE compares observations and variables between two data sets, reporting differences useful for validation. Question 14. In SAS, the function “INTCK('day', date1, date2)” returns: A) The number of complete months between two dates B) The number of days between two dates (including fractional days) C) The integer number of day intervals between two dates D) The ISO week number for the first date Answer: C
Answer: A Explanation: &DSNAME is an automatic macro variable that contains the name of the data set currently being read or written. Question 18. In the context of GCP, the term “source data verification (SDV)” refers to: A) Checking that the statistical analysis plan matches the protocol B) Comparing data entered into the CRF with original source documents C) Validating the SAS macro code for syntax errors D) Auditing the randomization schedule for balance Answer: B Explanation: SDV involves confirming that data captured in case report forms accurately reflect the original source records (e.g., medical charts). Question 19. Which PROC is most suitable for generating descriptive statistics (mean, std dev, min, max) for continuous laboratory values? A) PROC FREQ B) PROC MEANS C) PROC UNIVARIATE D) PROC TABULATE Answer: B Explanation: PROC MEANS provides basic descriptive statistics for numeric variables, commonly used for lab data summaries. Question 20. The ADaM “Basic Data Structure (BDS)” is characterized by:
A) One record per subject only B) One record per analysis‑time point per subject C) One record per adverse event term D) One record per visit regardless of subject Answer: B Explanation: BDS datasets contain one observation per analysis‑time point (e.g., visit, day) per subject, facilitating repeated‑measure analyses. Question 21. Which SAS system option controls the level of messages (NOTE, WARNING, ERROR) displayed in the log? A) MSGLEVEL=I B) MSGLEVEL=W C) MSGLEVEL=N D) MSGLEVEL=Q Answer: B Explanation: MSGLEVEL=W displays warnings and notes; setting to I would show only informational messages, while Q suppresses notes. Question 22. When creating a variable label in a DATA step, which statement is used? A) ATTRIB var= “label”; B) LABEL var= “label”; C) FORMAT var= “label”; D) LENGTH var= “label”; Answer: B Explanation: The LABEL statement assigns a descriptive label to a variable, which appears in output listings.
Answer: A Explanation: PUTN allows dynamic selection of a numeric format at run time, returning a character string. Question 26. In PROC REPORT, the “DEFINE” statement with the option “COMPUTED” indicates that: A) The variable is computed in a COMPUTE block B) The variable is read‑only from the data set C) The variable will be used as a grouping variable D) The variable is a summary statistic automatically generated Answer: A Explanation: COMUTED tells PROC REPORT that the column’s values will be created in a COMPUTE block rather than read directly from the data set. Question 27. Which SDTM domain would you use to store vital signs measurements? A) VS B) LB C) EX D) DS Answer: A Explanation: The VS (Vital Signs) domain captures measurements such as blood pressure, pulse, and temperature. Question 28. The macro variable “SYSDATE” contains: A) The current system date in DATE9. format B) The name of the operating system
C) The current SAS session ID D) The date and time stamp of the last PROC PRINT Answer: A Explanation: SYSDATE is an automatic macro variable that stores the current date as a character string (e.g., 01JAN2024). Question 29. Which PROC statement creates a permanent SAS data set in a library called “clinical”? A) DATA clinical.mydata; B) PROC SAVE DATA=clinical.mydata; C) LIBNAME clinical ‘/path’; D) PROC EXPORT DATA=clinical.mydata; Answer: A Explanation: The DATA step with a two‑level name (library.dataset) writes a permanent data set to the specified library. Question 30. When using PROC GLM for an analysis of covariance (ANCOVA), which statement correctly specifies a covariate? A) CLASS covar; B) MODEL response = treatment covar; C) CONTRAST covar; D) LSMEANS covar; Answer: B Explanation: In PROC GLM, covariates are included as continuous predictors directly in the MODEL statement. Question 31. In a CDISC‑compliant ADaM data set, the variable “ANL01FL” is typically used to:
Question 34. Which of the following best describes the purpose of the “RETAIN” statement? A) To keep a variable’s value from the previous observation in the current one B) To delete duplicate observations C) To enforce a variable’s length across the data set D) To sort the data set in descending order Answer: A Explanation: RETAIN holds the value of a variable across iterations of the DATA step, preventing automatic resetting to missing. Question 35. In the context of ICH E6(R2) GCP, the term “source document” refers to: A) The final statistical analysis report B) Original records (e.g., medical charts) that contain trial data C) The SAS program used for data cleaning D) The informed consent form only Answer: B Explanation: Source documents are the original records where data are first captured; they are the basis for CRF entry and subsequent data verification. Question 36. Which SAS option enables the generation of an HTML ODS destination? A) ODS HTML; B) PROC HTML; C) LIBNAME HTML; D) DATA HTML; Answer: A
Explanation: The ODS HTML statement opens an HTML destination for SAS output. Question 37. When using PROC LOGISTIC for a binary outcome, which option requests odds ratios for each predictor? A) OR; B) ODDSRATIO; C) LSMEANS; D) ESTIMATE; Answer: B Explanation: The ODDSRATIO option adds odds ratio estimates for the model parameters to the output. Question 38. In SAS, the “INPUT” statement is used within a DATA step to: A) Write data to a permanent data set B) Read raw data records from an external file C) Define variable attributes D) Sort observations Answer: B Explanation: INPUT reads raw data from external files (e.g., text, CSV) according to specified informats. Question 39. Which macro function returns the number of macro parameters passed to a macro? A) %NRMACS B) %PARMCNT C) %SYSFUNC(countw) D) %NUMARGS
Answer: D Explanation: %END terminates a %DO loop in macro language. Question 43. Which SAS function can be used to convert a SAS date value to a character string in the format YYYY‑MM‑DD? A) DATEPART() B) PUTN(date, yymmdd10.) C) INPUT(date, yymmdd10.) D) TODAY() Answer: B Explanation: PUTN with the yymmdd10. format converts a numeric SAS date to the desired character representation. Question 44. In the context of a SAP, “interim analysis” refers to: A) The final analysis after database lock B) An analysis performed before study completion to assess early efficacy or safety C) A descriptive summary of baseline characteristics only D) The process of data cleaning prior to any analysis Answer: B Explanation: Interim analyses are pre‑planned evaluations conducted before the trial ends, often for early stopping decisions. Question 45. Which of the following statements about the SDTM “EX” domain is true?
A) It stores all adverse events reported during the study B) It contains information about study drug administration (dose, route) C) It records laboratory test results D) It documents subject disposition (e.g., withdrawal) Answer: B Explanation: The EX (Exposure) domain captures details of investigational product administration, including dose, timing, and route. Question 46. When using PROC TRANSPOSE, the “PREFIX=” option is used to: A) Define the name of the output data set B) Set a prefix for the generated variable names C) Specify the variable that provides column headers D) Indicate the BY variables for transposition Answer: B Explanation: PREFIX= adds a character string before the generated variable names (e.g., COL1, COL2). Question 47. The SAS system option “NOXWAIT” is used to: A) Prevent SAS from waiting for user input when launching external programs B) Disable the generation of the SAS log file C) Turn off macro variable resolution D) Suppress warning messages in the log Answer: A Explanation: NOXWAIT tells SAS not to wait for the user to close an external program (e.g., Excel) before returning control.
Explanation: PROC MIXED fits linear mixed‑effects models, handling correlated repeated measures and random effects. Question 51. The “%INCLUDE” statement in SAS is used to: A) Import external data files directly into a data set B) Insert the contents of an external SAS program into the current program C) Create a new library reference D) Generate a PDF report from ODS output Answer: B Explanation: %INCLUDE reads and executes code from an external file as if it were typed in place. Question 52. In ADaM, the variable “ANL01FL” is set to “Y” for subjects that: A) Have missing efficacy data B) Meet the criteria for the primary analysis population C) Were randomized but not treated D) Have a safety adverse event Answer: B Explanation: ANL01FL flags subjects belonging to a defined analysis set, such as the Intent‑to‑Treat (ITT) population. Question 53. Which SAS function can be used to extract the year component from a SAS date value? A) YEAR() B) YEARPART() C) INTCK('year',.,.)
D) SUBSTR(date,1,4) Answer: A Explanation: YEAR() returns the four‑digit year from a SAS date value. Question 54. When generating a table of counts for each treatment group, which PROC option automatically adds a total row? A) NOCUM B) TOTALLABEL= C) MISSING D) LISTALL Answer: B Explanation: In PROC REPORT, the TOTALLABEL= option specifies the label for the total row added by a BREAK or RBREAK statement. Question 55. Which CDISC model is primarily used for non‑clinical toxicology studies? A) SEND B) ADaM C) SDTM D) ODM Answer: A Explanation: SEND (Standard for Exchange of Nonclinical Data) is the CDISC model for pre‑clinical toxicology data. Question 56. The SAS system option “OBS=10” instructs SAS to: A) Process only the first 10 observations of each data set