Reinforcement learning to improve delta robot throws for sorting scrap metal

[en] This study proposes a novel approach based on reinforcement learning (RL) to enhance the sorting efficiency of scrap metal using delta robots and a Pick-and-Place (PaP) process, widely used in the industry. We use three classical model-free RL algorithms (TD3, SAC and PPO) to reduce the time to sort metal scraps. We learn the release position and speed needed to throw an object in a bin instead of moving to the exact bin location, as with the classical PaP technique. Our contribution is threefold. First, we provide a new simulation environment for learning RL-based Pick-and-Throw (PaT) strategies for parallel grippers. Second, we use RL algorithms for learning this task in this environment resulting in 89.32\% accuracy while speeding up the throughput by 51\% in simulation. Third, we evaluate the performances of RL algorithms and compare them to a PaP and a state-of-the-art PaT method both in simulation and reality, learning only from simulation with domain randomisation and without fine tuning in reality to transfer our policies. This work shows the benefits of RL-based PaT compared to PaP or classical optimization PaT techniques used in the industry.

Research Center/Unit :

Montefiore Institute - Montefiore Institute of Electrical Engineering and Computer Science - ULiège

Disciplines :

Computer science

Author, co-author :

Louette, Arthur ; Université de Liège - ULiège > Montefiore Institute of Electrical Engineering and Computer Science

Language :

English

Title :

Reinforcement learning to improve delta robot throws for sorting scrap metal

Publication date :

21 June 2024

Source :

Arxiv

Tags :

CÉCI : Consortium des Équipements de Calcul Intensif

Development Goals :

12. Responsible consumption and production

Available on ORBi :

since 21 June 2024

Statistics

Number of views

131 (41 by ULiège)

Number of downloads

86 (13 by ULiège)

More statistics

Bibliography

Similar publications

Name

Provider / Domaine

Expiration

Description

JSESSIONID

Oracle Corporation

www.uliege.be

Session

General purpose platform session cookie, used by sites written in JSP. Usually used to maintain an anonymous user session by the server.

CookieScriptConsent

CookieScript

.uliege.be

1 year

This cookie is used by Cookie-Script.com service to remember visitor cookie consent preferences. It is necessary for Cookie-Script.com cookie banner to work properly.

Name

Provider / Domaine

Expiration

Description

_pk_id

InnoCraft Ltd

.uliege.be

1 year

Used to store a few details about the user such as the unique visitor ID

_pk_ses

InnoCraft Ltd

.uliege.be

30 minutes

Short lived cookies used to temporarily store data for the visit

_pk_ref

InnoCraft Ltd

.uliege.be

6 months

Used to store the attribution information, the referrer initially used to visit the website

Name	Provider / Domaine	Expiration	Description
JSESSIONID	Oracle Corporation www.uliege.be	Session	General purpose platform session cookie, used by sites written in JSP. Usually used to maintain an anonymous user session by the server.
CookieScriptConsent	CookieScript .uliege.be	1 year	This cookie is used by Cookie-Script.com service to remember visitor cookie consent preferences. It is necessary for Cookie-Script.com cookie banner to work properly.

Name	Provider / Domaine	Expiration	Description
_pk_id	InnoCraft Ltd .uliege.be	1 year	Used to store a few details about the user such as the unique visitor ID
_pk_ses	InnoCraft Ltd .uliege.be	30 minutes	Short lived cookies used to temporarily store data for the visit
_pk_ref	InnoCraft Ltd .uliege.be	6 months	Used to store the attribution information, the referrer initially used to visit the website