한국과학기술원 도서관

서지주요정보
Adaptability improvement of learning from demonstration in motion planning by particle swarm optimization = 운동계획에서 시범에 의한 학습 방법의 입자군집최적화를 이용한 적응성 향상
서명 / 저자	Adaptability improvement of learning from demonstration in motion planning by particle swarm optimization = 운동계획에서 시범에 의한 학습 방법의 입자군집최적화를 이용한 적응성 향상 / Jeong-Jung Kim.
발행사항	[대전 : 한국과학기술원, 2015].
Online Access	원문보기 원문인쇄

소장정보

등록번호

8028502

소장위치/청구기호

학술문화관(문화관) 보존서고

DEE 15045

휴대폰 전송

도서상태

이용가능(대출불가)

사유안내

반납예정일

리뷰정보

초록정보

One of the approaches for a motion planning in a high-dimensional space is a Learning from Demonstration (LfD) strategy. Trajectories that a robot should follow are extracted from human demonstrations and those are used as data for encoding motions and they are used for a motion planning. The advantage of the LfD method is that it can find a motion planning solution that is suitable to a task in a short time. Although the method successfully generates a motion when a query point is similar to learned trajectories, it has a limitation of adaptability when additional constraints such as an obstacles constraint and a short distance constraint are added. To overcome the limitation and improve the adaptability of LfD, a trajectory modification method for LfD by Particle Swarm Optimization (PSO) is suggested. The suggested method is applied to a trajectory generated by LfD. PSO is a population-based stochastic global optimization method and has advantages of simplicity and fast convergence which are good properties for the motion planning. A parameterization of the modification to a trajectory and a cost function that addresses constraints are developed for PSO. The amounts of the changes for each step is encoded into a particle and a trajectory after summing between an original trajectory and a modification amount is evaluated on a cost function. The optimization is considered as a constrained optimization which minimizes cost while satisfying an equality condition. An obstacle avoidance constraint is set as an equality condition and large amount of penalty value is set for the constraint when the constraint is not met for escaping the region. The cost values for remain constraints for a short distance, a minimum acceleration and a minimum modification are normalized between 0 and 1 to eliminate the scale property between the constraints. A common method for initializing the particles in PSO is to use a uniform distribution in a search space. However, for the trajectory optimization problem in this study, the method produces poor results because the search space is huge. A normalized step cost (NSC) concept is suggsted and used to initialize the particles for the optimization and boosted PSO (BPSO) is suggested. The suggested method boosts the convergence speed of the PSO for the trajectory optimization. A method for reusing previously optimized parameters is also investigated. The optimized parameters are stored with NSC feature vectors, and when a constraint violation occurs, the parameter stored with an NSC vector that is similar to the query NSC vector is selected. The selected parameter is used for initializing the particles. The approach reduces the necessity to re-optimize a trajectory which was previously optimized already and reduces the motion planning time. The suggested method is aim for an applcations of a target reaching task with arm-type manipulators. The task is essential in motion planning for daily life and various tasks such as pick-and-place task can be extended based on the task. Simulations for modifying trajectory generated from LfD are conducted with the suggested method and other optimization methods and results for convergence speed, and computation time are compared. The suggested method is also applied for optimizing a trajectory with real arm-type manipulators. The proposed algorithm optimizes a trajectory while satisfying the constraints and is less likely to converge to a local minimum.

고차원에서 운동계획을 하기 위한 한가지 접근 방법은 시범에 의한 학습 (Learning from Demonstration, LfD) 이다. 이 방법에서는 로봇이 따라야 할 궤적을 사람의 시범을 통해서 추출하고 추출된 궤적을 학습하여 추후에 비슷한 운동계획 시에 이 궤적을 이용한다. 이 LfD이 가지는 장점은 특정 작업에 특화된 궤적들을 빠른 시간 안에 생성할수 있는 것이다. 운동계획시 이 방법의 경우 학습에 이용된 궤적과 비슷한 궤적만을 생성해낼 수 있고 학습에 사용되지 않은 추가적인 제한조건을 만족하는 운동계획을 새로 해야 할 경우에는 그에 적합한 궤적을 생성하지 못하는 제한 점이 존재한다. 본 논문에서는 이와 같은 제한 점을 해결하고 LfD의 적응성을 향상시키기 위해 입자군집최적화(Particle Swarm Optimization, PSO)를 통해 LfD로 생성된 궤적을 변경하는 방법을 제시한다. 이 PSO는 집단의 정보를 이용하는 확률적 전역 최적화 방법으로 간단한 구조와 빠른 수렴속도를 보이고 이 특징은 운동계획에 적용하기에 적합하다. 이 PSO를 이용하기 위해 본 논문에서는 궤적 변경 정도의 변수화 및 제한조건들을 다루는 비용 함수 설계에 대한 내용을 포함한다. 궤적 각각의 단계에서 변경되는 정도는 입자에 설정되고 초기 궤적에 변경된 정도를 더해진 후 비용함수에서 평가된다. 이 최적화는 동일조건을 만족하면서 비용을 최소화하는 제한 최적화 문제로 다룬다. 장애물 회피에 대한 제한 조건을 동일 조건으로 설정하고 이 조건을 만족하지 않을 때는 큰 패널티 값을 할당하여 적합하지 않은 영역에서 벗어나도록 한다. 최단거리, 최소가속, 최소변경에 대한 제한조건의 비용은 0과 1로 정규화하여 제한조건들 간의 크기 특성을 제거하였다. PSO에서 입자들의 일반적인 초기화 방법은 탐색공간에서 균일 무작위 함수를 사용하는 것이다. 하지만 이 방법은 탐색 범위가 넓은 궤적 최적화 문제에서는 나쁜 결과를 낸다. 본 논문에서는 정규화된 비용(Normalized Step Cost, NSC)을 입자 초기화에 이용하는 boosted PSO (BPSO)를 제안한다. 제안한 방법은 궤적 최적화에 PSO를 적용할 때 수렴속도를 가속시킨다. 기존에 최적화된 파라미터를 재사용하는 방법 또한 제안된다. 최적화된 파라미터들은 NSC 특징 벡터와 함께 저장되고 질의 NSC가 입력으로 주어졌을 때 저장된 NSC 중 가장 유사한 NSC를 찾고 이와 함께 저장되었던 파라미터가 선택된다. 선택된 파라미터는 입자들을 초기화하는데 이용된다. 이 접근 방법은 기존에 최적화된 파라미터를 다시 최적화 하는 필요를 줄이고 운동계획의 시간을 줄인다. 제안된 방법은 로봇 팔의 목적지 도달 작업에 적용하여 검증하였다. 이 작업은 일상생활에서 이루어지는 동작 계획에 있어서 핵심적인 작업이며 물건을 집고 놓는 작업과 같은 다양한 작업에 확장될 수 있다. LfD에서 생성된 궤적을 변경하는 시뮬레이션에서 여러 최적화 방법과 제안된 방법을 비교 하였으며 수렴 속도, 계산 속도, 최종 비용 값에 있어 제안한 방법이 가장 좋은 성능을 내었다. 제안된 방법은 또한 실제 로봇 팔에 적용되었고 제한조건들을 만족하는 궤적을 생성하는 것을 확인 하였다.

서지기타정보

서지기타정보
청구기호	{DEE 15045
형태사항	iii, 64 : 삽화 ; 30 cm
언어	영어
일반주기	저자명의 한글표기 : 김정중 지도교수의 영문표기 : Byung Kook Kim 지도교수의 한글표기 : 김병국 공동지도교수의 영문표기 : Ju-Jang Lee 공동지도교수의 한글표기 : 이주장 Including Appendix
학위논문	학위논문(박사) - 한국과학기술원 : 전기및전자공학부,
서지주기	References : p.

QR CODE

책소개

전체보기

나의 도서관정보

메뉴

소장정보

리뷰정보

초록정보

서지기타정보

책소개

목차

이 주제의 인기대출도서