Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Community
Ask the community for help and clear up your study doubts
Discover the best universities in your country according to Docsity users
Free resources
Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors
A comprehensive set of questions and answers related to relativity processing certification. It covers various aspects of the software, including processing profiles, extraction settings, deduplication methods, and ocr options. Valuable for individuals preparing for the relativity processing certification exam.
Typology: Exams
1 / 10
Processing Profile - Answer An object that stores the numbering, deNIST, extraction, and deduplication settings that the processing engine refers to when publishing the documents in each data source that you attach to your processing set. You can create a profile specifically for one set or you can reuse the same profile for multiple sets. Extract children - Answer arranges for the removal of child items during discovery, including attachments, embedded objects and images and other non- parent files. The options are: Yes - extracts all children files during discovery so that both children and parents are included in the processing job. No - does not extract children, so that only parents are included in the processing job Email Output - Answer Determines the file format in which emails will be published to the workspace. The options are: MSG - publishes all emails as MSG to the workspace. MHT - converts all emails in your data source from MSG to MHT and publishes them to the workspace. Numbering Type - Answer Determines how the documents in each data source are numbered when published to the workspace. This field gives you the option of defining your document numbering schema. This is useful when you're importing from alternative sources and you need to keep your document numbering consistent. The choices for this field are: Auto Numbering - determines that the next published document will be identified by the next available number of that prefix.
Define Start Number - define the starting number of the documents you intend to publish to the workspace Default Start Number - Answer The starting number for documents that are published from the processing set(s) that use this profile. This field is only visible if you selected the Define Start Number choice for the Numbering Type field above. If you use a different start number for the Start Number field on the data source that you attach the processing set, that number takes precedence over the value you enter here. The maximum value you can enter here is 2,147,483,647. If you enter a higher value, you'll receive an Invalid Integer warning next to field value and you won't be able to save the profile. Number of Digits (padded with zeros) - Answer Determines how many digits the document's control number contains. The range of available values is 1 and 10. By default, this field is set to 10 characters. Parent/Child Numbering - Answer Determines how parent and child documents are numbered relative to each other when published to the workspace. The choices for this field are as follows. For examples of each type, see Parent/child numbering type examples. Suffix Always - arranges for child documents to be appended to their parent with a delimiter. Continuous Always - arranges for child documents to receive a sequential control number after their parent. Continuous, Suffix on Retry - arranges for child documents to receive a sequential control number after their parent except for child documents that weren't published to the workspace. When these unpublished child documents are retried and published, they will receive the parent's number with a suffix. If you resolve the error post-publish, the control number doesn't change.
Native (failover to dtSearch) - tells Relativity to use PowerPoint to extract text from PowerPoint files with dtSearch as a backup text extraction method if Native text extraction is unsuccessful. dtSearch (failover to Native) - tells Relativity to use dtSearch to extract text from PowerPoint files with Native as a backup text extraction method if dtSearch text extraction is unsuccessful. This typically results in faster extraction speeds; however, we recommend considering some differences between dtSearch and Native extraction. For example, dtSearch doesn't support extracting watermarks from pre-2007 PowerPoint files, and also certain metadata fields aren't populated when using dtSearch Excel Text Extraction Method - Answer Determines whether the processing engine uses Excel or dtSearch to extract text from Excel files during publish. Native - tells Relativity to use Excel to extract text from Excel files. Native (failover to dtSearch) - tells Relativity to use Excel to extract text from Excel files with dtSearch as a backup text extraction method if Native text extraction is unsuccessful. dtSearch (failover to Native) - tells Relativity to use dtSearch to extract text from Excel files with Native as a backup text extraction method if dtSearch text extraction is unsucessful. This typically results in faster extraction speeds; however, we recommend considering some differences between dtSearch and Native extraction. For example, dtSearch doesn't support extracting the Track Changes text from Excel files. Excel Header/Footer Extraction - Answer Extract header and footer information from Excel files when you publish them. This is useful for instances in which the header and footer information in your Excel files is relevant to the case. This field isn't available if you selected dtSearch for the Excel Text Extraction Method field above because dtSearch automatically extracts header and footer information and places it at the end of the text; if you selected a value for this field and then select dtSearch above, your selection here is nullified. The options are: Do not extract - doesn't extract any of the header or footer information from the Excel files and publishes the files with the header and footer in their normal
positions. This option is selected by default; however, if you change the value for the Excel Text Extraction Method field above from dtSearch, back to Native, this option will be de-selected and you'll have to select one of these options in order to save the profile. Extract and place at end - extracts the header and footer information and stacks the header on top of the footer at the end of the text of each sheet of the Excel file. Note that the native file will still have its header and footer. Extract and place inline (slows text extraction) - extracts the header and footer information and puts it inline into the file. The header appears inline directly above the text in each sheet of the file, while the footer appear directly below the text. Note that this could impact text extraction performance if your data set includes many Excel files with headers and footers. Note that the native file will still have its header and footer. OCR - Answer select Enable to run OCR during processing. If you select Disable, Relativity won't provide any OCR text in the Extracted Text view. OCR Accuracy - Answer Determines the desired accuracy of your OCR results and the speed with which you want the job completed. This drop-down menu contains three options: High (Slowest Speed) - Runs the OCR job with the highest accuracy and the slowest speed. Medium (Average Speed) - Runs the OCR job with medium accuracy and average speed. Low (Fastest Speed) - Runs the OCR job with the lowest accuracy and fastest speed. OCR Text Separator - Answer Select Enable to display a separator between extracted text at the top of a page and text derived from OCR at the bottom of the page in the Extracted Text view. The separator reads as, "--- OCR From Images ---". With the separator disabled, the OCR text will still be on the page beneath the
Global or Custodial for the deduplication method above. You have the following options: Select Yes to have the metadata fields you mapped populated for parent and children documents out of the following: All Custodians, Deduped Custodians, All Paths/Locations, Deduped Paths, and Dedupe Count. Select No to have the following metadata fields populated for parent documents only: All Custodians, Deduped Custodians, All Paths/Locations, and Deduped Paths. If you republish a processing set that originally contained a password-protected error without first resolving that error, then the deduplication data won't be propagated correctly to the children of the document that received the error. In certain cases, the Propagate deduplication data setting can override the extract children setting on your profile. For example, you have two processing sets that both contain an email message with an attachment of a Word document, Processing Set 1 and 2. You publish Processing Set 1 with the Extract children field set to Yes, which means that the Word attachment is published. You then publish Processing Set 2 with the Extract children field set to No but with the Deduplication method field set to Global and the Propagate deduplication date field set to Yes. When you do this, given that the emails are duplicates, the deduplication data is propagated to the Word attachment published in Processing Set 1, even though you didn't extract it in Processing Set 2. Auto-publish set - Answer Arranges for the processing engine to automatically kick off publish after the completion of discovery, with or without errors. By default, this is set to No. Leaving this at No means that you must manually start publish. Default destination folder - Answer The folder in Relativity into which documents are placed once they're published to the workspace. This value determines the default value of the destination folder field on the processing data source. You have the option of overriding this value when you add or edit a data source on the processing set. Publish jobs read the destination folder field on the
data source, not on the profile. You can select an existing folder or create a new one by right-clicking the base folder and selecting Create. If the source path you selected is an individual file or a container, such as a zip, then the folder tree does not include the folder name that contains the individual file or container. If the source path you selected is a folder, then the folder tree includes the name of the folder you selected. Do you want to use source folder structure - Answer Maintain the folder structure of the source of the files you process when you bring these files into Relativity. Work Monitoring Tab - Answer View all active jobs Displays - Workspace Processing set Data Source Status Can cancel imaging jobs here Priority - Answer The order in which jobs in the queue are processed. Lower priority numbers result in higher priority. This is determined by the value of the Order field on the data source. You can change the priority of a data source with the Change Priority button at the bottom of the view. If you change the priority on a job and there is currently another job in progress, the new job will be picked up and worked on as worker threads become available. Changing the priority only changes the priority for that immediate job. Processing sets are processed in the queue on a first-come, first-served basis.
Error action console - Answer A tab ion a processing set to review errors, retry, or ignore. Also has error imformation Processing Administration tab - Answer Centralized location for you to access data on active processing and imaging jobs throughout your Relativity environment, as well as the status of all workers assigned to perform those jobs. You can find this information in the Worker Monitoring sub-tab.