Experience RDEToolKit
Purpose
This tutorial will help you execute your first structured processing using RDEToolKit and experience the basic workflow. The estimated time is approximately 15 minutes.
Upon completion, you will be able to: - Understand the basic structure of RDE projects - Create custom structured processing functions - Execute structured processing and verify results
1. Create a Project
Purpose
Create a project directory for RDE structured processing and prepare the necessary file structure.
Code to Execute
terminal | |
---|---|
1 2 3 4 5 6 7 8 |
|
command_prompt | |
---|---|
1 2 3 4 5 6 7 8 |
|
Expected Result
The following directory structure will be created:
1 2 3 4 5 |
|
2. Define Dependencies
Purpose
Define the Python packages to be used in the project.
Code to Execute
requirements.txt | |
---|---|
1 |
|
Expected Result
The requirements.txt
file is created with RDEToolKit dependencies defined.
3. Create Custom Structured Processing
Purpose
Create a custom function containing data processing logic.
Code to Execute
modules/process.py | |
---|---|
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 |
|
Expected Result
The modules/process.py
file is created with structured processing logic defined.
4. Create Main Script
Purpose
Create an entry point to launch the RDEToolKit workflow.
Code to Execute
main.py | |
---|---|
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 |
|
Expected Result
The main.py
file is created and ready to execute structured processing.
5. Prepare Sample Data
Purpose
Create sample data to test the structured processing.
Code to Execute
data/inputdata/sample_data.txt | |
---|---|
1 2 3 4 5 6 7 |
|
Expected Result
The data/inputdata/sample_data.txt
file is created with sample data ready for processing.
6. Execute Structured Processing
Purpose
Execute RDE structured processing with the created project and verify its operation.
Code to Execute
terminal | |
---|---|
1 2 3 4 5 |
|
Expected Result
Output similar to the following will be displayed:
1 2 3 4 5 6 7 8 9 10 11 |
|
7. Verify Results
Purpose
Verify the execution results and file generation of structured processing.
Code to Execute
terminal | |
---|---|
1 2 |
|
Expected Result
You can verify that files like the following have been generated:
- tasksupport/sample_metadata.json
- Created metadata file
- raw/
or nonshared_raw/
- Copy of input files (depending on configuration)
Congratulations!
You have completed your first structured processing using RDEToolKit.
What You Accomplished
✅ Created basic RDE project structure
✅ Implemented custom structured processing function
✅ Executed structured processing workflow
✅ Learned how to verify processing results
Important Concepts Learned
- Project Structure: Roles of
data/inputdata/
,tasksupport/
,modules/
- Custom Functions: How to use
RdeInputDirPaths
andRdeOutputResourcePath
- Workflow Execution: Basic usage of
rdetoolkit.workflows.run()
Next Steps
To learn more in detail:
- Structured Processing Concepts - Detailed understanding of processing flow
- Configuration File - How to customize behavior
- API Reference - Check all available features
Next Practice
Try more complex structured processing using actual research data. It's important to select the appropriate processing mode based on the type of data.