STEP 1: Register Your Text Files
1
Set Up AWS
Before you can do anything with the Encord platform and cloud storage, you need to configure your cloud storage to work with Encord. Once the integration between Encord and your cloud storage is complete, you can then use your data in Encord.In order to integrate with AWS S3, you need to:
- Create a permission policy for your resources that allows appropriate access to Encord.
- Create a role for Encord and attach the policy so that Encord can access those resources.
- Activate Cross-origin resource sharing which allows Encord to access those resources from a web browser.
- Test the integration to make sure it works.
2
Create AWS Integration in Encord
Create an S3 bucket to store your files if you haven’t already. Your S3 bucket permissions should be set to be blocking all public access.


It is essential you do not close this tab or window until you have finished the whole integration process. If you use the AWS UI for integration, we advise opening the AWS console in a separate tab.
3
Create JSON file for Registration
Create a JSON file based on the templates provided below.The
title field is optional. If omitted, the file path and file name are used as the default title. For example, if the file is located at https://encord-solutions-bucket.s3.eu-west-2.amazonaws.com/path/to/my/bucket/website.html, the title defaults to /path/to/my/bucket/website.html.| Key or Flag | Required? | Default value |
|---|---|---|
| ”objectUrl” | Yes | |
| ”title” | No | The file’s path + title |
| ”clientMetadata” | No |
Keys / Flags that are not required can be omitted from the JSON file entirely.
4
Create a Folder to Store Your Audio Files
All files in Encord must be stored within folders. Therefore, you need to create a folder before registering any data with Encord. To create a folder:
- Navigate to Files under the Index heading in the Encord platform.
- Click the + New folder button to create a new folder. Select the type of folder you want to create.

- Give the folder a meaningful name and description.
- Click Create to create the folder. The folder is listed in Files.
STEP 2: Set Up Your Project
1
Create a Dataset
- Click the New dataset button in the Datasets section in Annotate.

- Give your Dataset a meaningful title and description. A clear title and description keeps your data organized.

- Click Create dataset to create the Dataset.
Attach files
- Navigate to the Datasets section under the Annotate heading.
- Click the Dataset you want to attach data to.
- Click +Attach existing files.

- Select the folders containing the files you want to attach to the Dataset. To select individual files, double-click a folder to see its contents, and select the files you want to add to the Dataset.
- Click Attach data to attach the selected files to the Dataset.
2
Create an Ontology
We recommend learning about Ontology structure before creating Ontologies.
- Click the New ontology button in the Ontologies section to create a new Ontology.

- Give your Ontology a meaningful title and description. A clear title and description keeps your Ontologies organized.
- Click Next to continue.

- Define your Ontology structure. Text files are labeled using the Text Region object type and Classifications.

- Click + Add object to create a new object.
- Give the object a name, for example “Region of Interest”.
- Select the Text Region object type.
- Optionally, configure the object to add any attributes.
- Optionally, enable the Required toggle to mark the object as Required.
- Repeat these steps for as many Text Region objects as necessary.

- Click the arrow button next to the object to start configuring.
- Click + Add attribute to add an attribute.
- Select the attribute type: Text, radio button, or checklist.
- Give the attribute a name, for example “Why is this region of interest?” could be a text attribute.
- If your attribute is a radio button or a checklist click Add option and add options.

- Optionally mark the object as Required and / or Dynamic.
- Click the Back to parent button to return the Ontology creation view.

- Click + Add classification to create a new classification.
- Give the classification a name. For example, “HTML Type”.
- Optionally, configure the classification to change the classification type. The default classification type is a text field.
- Optionally, enable the Required toggle to mark the classification as Required.
- Repeat these steps for as many classifications as necessary.
- Click the arrow icon next to an object to configure the classification.

- Click the classification type to change the classification type. The default classification type is a text field.
- Click Add option to add an option if you have selected a radio button or check list classification.
- Enter a name for the classification option. For example, the classification “HTML Type” can have the options “Raw”, and “Single Extension”.
- Click the Back to parent button to return the Ontology creation view.
3
Create a Project
- In the Encord platform, select Projects under Annotate.
- Click the + New annotation project button to create a new Project.

- Give the Project a meaningful title and description.

If you are part of an Organization, an optional Project tags drop-down is visible. Project tags are useful for categorizing and finding your Projects. Select as many tags as are relevant for your Project.
- Click the Attach ontology button.
- Select the Ontology you created previously from the list using the Select button.

- Click OK to attach the Ontology to the Project.
- Click the Attach datasets button.
- Select the Dataset you created previously from the list using the Attach button.

- Click OK to attach the Dataset(s) to the Project.

- Click the Load from template button to use a Workflow template.

- Select the template you want to use and click Load template.
- Click Create project to finish creating the Project.

STEP 3: Label Your Text Files
Text
Labeling Text Files
Labeling Text Files
Text
- Highlight the text to label.
- Click a Text Region label type at the top of the Editor (or use the hotkey) to apply the label.

Labeling JSON Files
Labeling JSON Files
JSON
- Highlight the text to label.
- Click a Text Region label type at the top of the Editor (or use the hotkey) to apply the label.

Labeling XML Files
Labeling XML Files
XML
- Highlight the text to label.
- Click a Text Region label type at the top of the Editor (or use the hotkey) to apply the label.

HTML
Encord supports both raw HTML files and single-extension HTML files. The key difference is that single-extension HTML files include all the necessary elements to render the webpage, such as CSS and JavaScript.Labeling HTML - Single Extension
Labeling HTML - Single Extension
HTML - Single Extension
- Click a Text Region label type (or use the hotkey)
- Highlight the text you want to apply the label to.

Labeling HTML - Raw
Labeling HTML - Raw
HTML - Raw
- Click a Text Region label type (or use the hotkey)
- Highlight the text you want to apply the label to.

Using Relation Text Attributes - Text and HTML
Relation text attributes link an object to another object. In images or videos this could be linking a chicken’s body to its wing. In PDFs or text documents (text or HTML files) this could be linking words with measurements, or linking words with other words.Your Ontology must MUST contain text attributes with the Relation feature enabled if you want to use relation text attributes.





