부산대학교 도서관
검색
통합검색
소장자료
학술논문
전자저널
학술DB
전자책
교내접속
OFF
로그인
커뮤니티
공지 및 행사
게시판문의
자주묻는질문
전화/이메일 문의
도서관 일정
영문 바로가기
로그인
자료검색
통합검색
소장자료
학술논문
전자저널
학술DB
전자책
온라인강좌
컬렉션
인기도서
신착도서
책 읽는 대학
테마도서
교수저작물 컬렉션
PNU 생산자료
도서관 소식지
연구·학습지원
연구지원
논문작성지원
연구정보가이드
이용교육
학습정보가이드
수업교재정보
자료이용안내
대출/반납
희망도서 신청
원문복사/상호대차
소장학술지 원문제공
외국학술지지원센터
협력기관
자료배달
SDI서비스
교외접속
시설이용안내
개관시간/층별
이용자별안내
열람실
그룹스터디룸
매체 제작/편집실
시설대관
PC/Wi-Fi/모바일
출력/복사/스캔
도서관소개
사명과 비전
역대 도서관장
연혁
조직 및 부서
통계
규정/운영세칙
오시는길
기증/기부
커뮤니티
공지 & 행사
게시판 문의
자주묻는 질문
전화/이메일 문의
도서관 일정
My Library
내서재
대출중인 도서
연체/제재
예약도서
원문복사(상호대차)
이용교육
정보 변경
전체메뉴 열기
로그인하세요
부산대학교 도서관
자료검색
통합검색
소장자료
학술논문
전자저널
학술DB
전자책
온라인강좌
컬렉션
인기도서
신착도서
책 읽는 대학
테마도서
교수저작물 컬렉션
PNU 생산자료
도서관 소식지
연구·학습지원
연구지원
논문작성지원
연구정보가이드
이용교육
학습정보가이드
수업교재정보
자료이용안내
대출/반납
희망도서 신청
원문복사/상호대차
소장학술지 원문제공
외국학술지지원센터
협력기관
자료배달
SDI서비스
교외접속
시설이용안내
개관시간/층별
이용자별안내
열람실
그룹스터디룸
매체 제작/편집실
시설대관
PC/Wi-Fi/모바일
출력/복사/스캔
도서관소개
사명과 비전
역대 도서관장
연혁
조직 및 부서
통계
규정/운영세칙
오시는길
기증/기부
커뮤니티
공지 & 행사
게시판 문의
자주묻는 질문
전화/이메일 문의
도서관 일정
My Library
내서재
대출중인 도서
연체/제재
예약도서
원문복사(상호대차)
이용교육
정보 변경
clear
교내접속
OFF
영문 바로가기
Home
통합검색
학술논문
학술논문
arrow_drop_down
통합검색
ISSN
논문명
저널명
저자명
다국어
세부검색
히라가나
가타가나
독일,프랑스,스페인
그리스
라틴
러시아
로마자
한글고어
단위기호
학술기호
타이어
몽골어
o
ㅋ
ㄱ
ㅅ
ㅈ
ㅌ
ㄷ
촉
ㄴ
ㅎ
ㅂ
ㅃ
ㅁ
야
ㄹ
와
ㅏ
あ
a
ぁ
か
ka
が
ga
さ
sa
ざ
za
た
ta
だ
da
な
na
は
ha
ば
ba
ぱ
pa
ま
ma
や
ya
ゃ
lya
ら
ra
わ
wa
ゎ
lwa
ん
n
l
い
i
ぃ
き
ki
ぎ
gi
し
si
じ
zi
ち
ti
ぢ
di
に
ni
ひ
hi
び
bi
ぴ
pi
み
mi
り
ri
ㅜ
う
u
ぅ
く
ku
ぐ
gu
す
su
ず
zu
つ
tu
づ
du
っ
ぬ
nu
ふ
hu
ぶ
bu
ぷ
pu
む
mu
ゆ
yu
ゅ
lyu
る
ru
ㅔ
え
e
ぇ
け
ke
げ
ge
せ
se
ぜ
ze
て
te
で
de
ね
ne
へ
he
べ
be
ぺ
pe
め
me
れ
re
ㅗ
お
o
ぉ
こ
ko
ご
go
そ
so
ぞ
zo
と
to
ど
do
の
no
ほ
ho
ぼ
bo
ぽ
po
も
mo
よ
yo
ょ
lyo
ろ
ro
を
wo
o
ㅋ
ㄱ
ㅅ
ㅈ
ㅌ
ㄷ
촉
ㄴ
ㅎ
ㅂ
ㅃ
ㅁ
야
ㄹ
와
ㅏ
ア
a
ァ
カ
ka
ガ
ga
サ
sa
ザ
za
タ
ta
ダ
da
ナ
na
ハ
ha
バ
ba
パ
pa
マ
ma
ヤ
ya
ャ
lya
ラ
ra
ワ
wa
ヮ
lwa
ン
n
l
イ
i
ィ
キ
ki
ギ
gi
シ
si
ジ
zi
チ
ti
ヂ
di
ニ
ni
ヒ
hi
ビ
bi
ピ
pi
ミ
mi
リ
ri
ㅜ
ウ
u
ゥ
ク
ku
グ
gu
ス
su
ズ
zu
ツ
tu
ヅ
du
ッ
ヌ
nu
フ
hu
ブ
bu
プ
pu
ム
mu
ユ
yu
ュ
lyu
ル
ru
ヴ
vu
ㅔ
エ
e
ェ
ケ
ke
ゲ
ge
セ
se
ゼ
ze
テ
te
デ
de
ネ
ne
ヘ
he
ベ
be
ぺ
pe
メ
me
レ
re
ㅗ
オ
o
ォ
コ
ko
ゴ
go
ソ
so
ゾ
zo
ト
to
ド
do
ノ
no
ホ
ho
ボ
bo
ポ
po
モ
mo
ヨ
yo
ョ
lyo
ロ
ro
ヲ
wo
ー
ー
독일어
ä
Ä
ö
Ö
ü
Ü
ß
프랑스어
á
à
Á
À
é
è
É
È
ç
Ç
ê
스페인어
à
á
Á
é
É
í
Í
Ó
ó
Ú
ú
Ñ
ñ
ä
Ä
Α
Β
Γ
Δ
Ε
Ζ
Η
Θ
Ι
Κ
Λ
Μ
Ν
Ξ
Ο
Π
Ρ
Σ
Τ
Υ
Φ
Χ
Ψ
Ω
α
β
γ
δ
ε
ζ
η
θ
ι
κ
λ
μ
ν
ξ
ο
π
ρ
ς
σ
τ
υ
φ
χ
ψ
ω
À
Á
Â
Ã
Ä
Å
Æ
Ç
È
É
Ê
Ë
Ì
Í
Î
Ï
Ð
Ñ
Ò
Ó
Ô
Õ
Ö
Ø
Ù
Ú
Û
Ü
Ý
Þ
ß
à
á
â
ã
ä
å
æ
ç
è
é
ê
ë
ì
í
î
ï
ð
ñ
ò
ó
ô
õ
ö
ø
ù
ú
û
ü
ý
þ
ÿ
А
Б
В
Г
Д
Е
Ё
Ж
З
И
Й
К
Л
М
Н
О
П
Р
С
Т
У
Ф
Х
Ц
Ч
Ш
Щ
Ъ
Ы
Ь
Э
Ю
Я
а
б
в
г
д
е
ё
ж
з
и
й
к
л
м
н
о
п
р
с
т
у
ф
х
ц
ч
ш
щ
ъ
ы
ь
э
ю
я
ⅰ
ⅱ
ⅲ
ⅳ
ⅴ
ⅵ
ⅶ
ⅷ
ⅸ
ⅹ
Ⅰ
Ⅱ
Ⅲ
Ⅳ
Ⅴ
Ⅵ
Ⅶ
Ⅷ
Ⅸ
Ⅹ
ㅥ
ㅦ
ㅧ
ㅨ
ㅩ
ㅪ
ㅫ
ㅬ
ㅭ
ㅮ
ㅯ
ㅰ
ㅱ
ㅲ
ㅳ
ㅴ
ㅵ
ㅶ
ㅷ
ㅸ
ㅹ
ㅺ
ㅻ
ㅼ
ㅽ
ㅾ
ㅿ
ㆀ
ㆁ
ㆂ
ㆃ
ㆄ
ㆅ
ㆆ
ㆇ
ㆈ
ㆉ
ㆊ
ㆋ
ㆌ
ㆍ
ㆎ
′
″
℃
Å
¢
£
¥
¤
℉
‰
$
%
F
₩
㎕
㎖
㎗
ℓ
㎘
㏄
㎣
㎤
㎥
㎦
㎙
㎚
㎛
㎜
㎝
㎞
㎟
㎠
㎡
㎢
㏊
㎍
㎎
㎏
㏏
㎈
㎉
㏈
㎧
㎨
㎰
㎱
㎲
㎳
㎴
㎵
㎶
㎷
㎸
㎹
㎀
㎁
㎂
㎃
㎄
㎺
㎻
㎽
㎾
㎿
㎐
㎑
㎒
㎓
㎔
Ω
㏀
㏁
㎊
㎋
㎌
㏖
㏅
㎭
㎮
㎯
㏛
㎩
㎪
㎫
㎬
㏝
㏐
㏓
㏃
㏉
㏜
㏆
±
×
÷
≠
≤
≥
∞
∴
∠
⊥
⌒
∂
∇
≡
≒
≪
≫
√
∽
∝
∵
∫
∬
∈
∋
⊆
⊇
⊂
⊃
∪
∩
∧
∨
¬
⇒
⇔
∀
∃
∮
∑
∏
+
-
<
=
>
ก
ข
ฃ
ค
ฅ
ฆ
ง
จ
ฉ
ช
ซ
ฌ
ญ
ฎ
ฏ
ฐ
ฑ
ฒ
ณ
ด
ต
ถ
ท
ธ
น
บ
ป
ผ
ฝ
พ
ฟ
ภ
ม
ย
ร
ฤ
ล
ฦ
ว
ศ
ษ
ส
ห
ฬ
อ
ฮ
ฯ
ะ
ั
า
ำ
ิ
ี
ึ
ื
ุ
ู
ฺ
฿
เ
แ
โ
ใ
ไ
ๅ
ๆ
็
่
้
๊
๋
์
ํ
๎
๏
᠀
᠁
᠂
᠃
᠄
᠅
᠆
᠇
᠈
᠉
᠊
᠐
᠑
᠒
᠓
᠔
᠕
᠖
᠗
᠘
᠙
ᠠ
ᠡ
ᠢ
ᠣ
ᠤ
ᠥ
ᠦ
ᠧ
ᠨ
ᠩ
ᠪ
ᠫ
ᠬ
ᠭ
ᠮ
ᠯ
ᠰ
ᠱ
ᠲ
ᠳ
ᠴ
ᠵ
ᠶ
ᠷ
ᠸ
ᠹ
ᠺ
ᠻ
ᠼ
ᠽ
ᠾ
ᠿ
ᡀ
ᡁ
ᡂ
ᡃ
ᡄ
ᡅ
ᡆ
ᡇ
ᡈ
ᡉ
ᡊ
ᡋ
ᡌ
ᡍ
ᡎ
ᡏ
발행년
-
(예 : 2010-2015)
'학술논문'
에서 검색결과
289
건 | 목록
1~10
전체선택
내보내기
Relevance
arrow_drop_down
Relevance
DateNewest
Date Oldest
10
arrow_drop_down
5
10
20
30
40
50
format_list_bulleted
format_list_bulleted
E-mail
EndNote
RefWorks
Offline Regularised Reinforcement Learning for Large Language Models Alignment
Report
Richemond, Pierre Harvey
;
Tang, Yunhao
;
Guo, Daniel
;
Calandriello, Daniele
;
Azar, Mohammad Gheshlaghi
;
Rafailov, Rafael
;
Pires, Bernardo Avila
;
Tarassov, Eugene
;
Spangher, Lucas
;
Ellsworth, Will
;
Severyn, Aliaksei
;
Mallinson, Jonathan
;
Shani, Lior
;
Shamir, Gil
;
Joshi, Rishabh
;
Liu, Tianqi
;
Munos, Remi
;
Piot, Bilal
Open Access (Arxiv)
Find it@PNU
playlist_add_check
요약보기
open_in_browser
내보내기
E-mail
EndNote
RefWorks
Multi-turn Reinforcement Learning from Preference Human Feedback
Report
Shani, Lior
;
Rosenberg, Aviv
;
Cassel, Asaf
;
Lang, Oran
;
Calandriello, Daniele
;
Zipori, Avital
;
Noga, Hila
;
Keller, Orgad
;
Piot, Bilal
;
Szpektor, Idan
;
Hassidim, Avinatan
;
Matias, Yossi
;
Munos, Rémi
Open Access (Arxiv)
Find it@PNU
playlist_add_check
요약보기
open_in_browser
내보내기
E-mail
EndNote
RefWorks
Understanding the performance gap between online and offline alignment algorithms
Report
Tang, Yunhao
;
Guo, Daniel Zhaohan
;
Zheng, Zeyu
;
Calandriello, Daniele
;
Cao, Yuan
;
Tarassov, Eugene
;
Munos, Rémi
;
Pires, Bernardo Ávila
;
Valko, Michal
;
Cheng, Yong
;
Dabney, Will
Open Access (Arxiv)
Find it@PNU
playlist_add_check
요약보기
open_in_browser
내보내기
E-mail
EndNote
RefWorks
Super-Exponential Regret for UCT, AlphaGo and Variants
Report
Orseau, Laurent
;
Munos, Remi
Open Access (Arxiv)
Find it@PNU
playlist_add_check
요약보기
open_in_browser
내보내기
E-mail
EndNote
RefWorks
Human Alignment of Large Language Models through Online Preference Optimisation
Report
Calandriello, Daniele
;
Guo, Daniel
;
Munos, Remi
;
Rowland, Mark
;
Tang, Yunhao
;
Pires, Bernardo Avila
;
Richemond, Pierre Harvey
;
Lan, Charline Le
;
Valko, Michal
;
Liu, Tianqi
;
Joshi, Rishabh
;
Zheng, Zeyu
;
Piot, Bilal
Open Access (Arxiv)
Find it@PNU
playlist_add_check
요약보기
open_in_browser
내보내기
E-mail
EndNote
RefWorks
Near-Minimax-Optimal Distributional Reinforcement Learning with a Generative Model
Report
Rowland, Mark
;
Wenliang, Li Kevin
;
Munos, Rémi
;
Lyle, Clare
;
Tang, Yunhao
;
Dabney, Will
Open Access (Arxiv)
Find it@PNU
playlist_add_check
요약보기
open_in_browser
내보내기
E-mail
EndNote
RefWorks
Off-policy Distributional Q($\lambda$): Distributional RL without Importance Sampling
Report
Tang, Yunhao
;
Rowland, Mark
;
Munos, Rémi
;
Pires, Bernardo Ávila
;
Dabney, Will
Open Access (Arxiv)
Find it@PNU
playlist_add_check
요약보기
open_in_browser
내보내기
E-mail
EndNote
RefWorks
Generalized Preference Optimization: A Unified Approach to Offline Alignment
Report
Tang, Yunhao
;
Guo, Zhaohan Daniel
;
Zheng, Zeyu
;
Calandriello, Daniele
;
Munos, Rémi
;
Rowland, Mark
;
Richemond, Pierre Harvey
;
Valko, Michal
;
Pires, Bernardo Ávila
;
Piot, Bilal
Open Access (Arxiv)
Find it@PNU
playlist_add_check
요약보기
open_in_browser
내보내기
E-mail
EndNote
RefWorks
Nash Learning from Human Feedback
Report
Munos, Rémi
;
Valko, Michal
;
Calandriello, Daniele
;
Azar, Mohammad Gheshlaghi
;
Rowland, Mark
;
Guo, Zhaohan Daniel
;
Tang, Yunhao
;
Geist, Matthieu
;
Mesnard, Thomas
;
Michi, Andrea
;
Selvi, Marco
;
Girgin, Sertan
;
Momchev, Nikola
;
Bachem, Olivier
;
Mankowitz, Daniel J.
;
Precup, Doina
;
Piot, Bilal
Open Access (Arxiv)
Find it@PNU
playlist_add_check
요약보기
open_in_browser
내보내기
E-mail
EndNote
RefWorks
Model-free Posterior Sampling via Learning Rate Randomization
Report
Tiapkin, Daniil
;
Belomestny, Denis
;
Calandriello, Daniele
;
Moulines, Eric
;
Munos, Remi
;
Naumov, Alexey
;
Perrault, Pierre
;
Valko, Michal
;
Menard, Pierre
Open Access (Arxiv)
Find it@PNU
playlist_add_check
요약보기
open_in_browser
내보내기
E-mail
EndNote
RefWorks
1
2
3
4
…
17
18
19
20
다음
검색 결과 제한하기
arrow_forward
제한된 항목
[AR] Munos, Rémi
clear
발행연도 제한
-
재검색
학술DB(Database Provider)
arXiv
(116)
MathSciNet via EBSCOhost
(34)
Complementary Index
(32)
Science Citation Index Expanded
(28)
Academic Search Complete
(20)
Business Source Complete
(11)
IEEE Xplore Digital Library
(8)
ScienceDirect
(8)
Springer Nature Journals
(8)
Gale Academic OneFile
(6)
ACM Full-Text Collection
(4)
Social Sciences Citation Index
(4)
Journals@OVID
(3)
Supplemental Index
(3)
Scopus®
(2)
JSTOR Journals
(1)
eScholarship
(1)
Gale General OneFile
(1)
add
remove
더보기
zoom_in
재검색
저널명(출판물, Title)
journal of machine learning research
(25)
machine learning
(22)
theoretical computer science
(19)
journal of machine learning research (jmlr)
(10)
algorithmic learning theory
(6)
siam journal on control & optimization
(6)
methodology and computing in applied probability
(5)
scientific reports
(5)
nature
(4)
automatica
(3)
engineering applications of artificial intelligence
(3)
nature communications
(3)
siam journal on control and optimization
(3)
2013 ieee symposium on adaptive dynamic programming & reinforcement learning (adprl)
(2)
2013 ieee symposium on adaptive dynamic programming and reinforcement learning (adprl), adaptive dynamic programming and reinforcement learning (adprl), 2013 ieee symposium on
(2)
acm international conference proceeding series
(2)
comptes rendus - mathematique
(2)
journal of artificial intelligence research
(2)
journal of computer and system sciences
(2)
journal of neural engineering
(2)
methodology & computing in applied probability
(2)
the annals of statistics
(2)
2007 ieee international symposium on approximate dynamic programming and reinforcement learning, approximate dynamic programming and reinforcement learning, 2007. adprl 2007. ieee international symposium on
(1)
2011 ieee symposium on adaptive dynamic programming and reinforcement learning (adprl), adaptive dynamic programming and reinforcement learning (adprl), 2011 ieee symposium on
(1)
2014 ieee congress on evolutionary computation (cec)
(1)
2014 ieee congress on evolutionary computation (cec), evolutionary computation (cec), 2014 ieee congress on
(1)
2014 ieee symposium on adaptive dynamic programming & reinforcement learning (adprl)
(1)
2014 ieee symposium on adaptive dynamic programming and reinforcement learning (adprl), adaptive dynamic programming and reinforcement learning (adprl), 2014 ieee symposium on
(1)
2016 american control conference (acc)
(1)
2016 american control conference (acc), american control conference (acc), 2016
(1)
53rd ieee conference on decision & control
(1)
53rd ieee conference on decision and control, decision and control (cdc), 2014 ieee 53rd annual conference on
(1)
algorithmic learning theory (9783319463780)
(1)
algorithmic learning theory (9783540752240)
(1)
algorithmic learning theory (9783642409349)
(1)
foundations & trends in machine learning
(1)
learning theory (9783540352945)
(1)
machine learning & knowledge discovery in databases (9783642158827)
(1)
machine learning & knowledge discovery in databases: european conference, ecml pkdd 2014, nancy, france, september 15-19, 2014. proceedings, part ii
(1)
proceedings of the 18th international conference on autonomous agents and multiagent systems
(1)
proceedings of the 22nd international conference on machine learning
(1)
proceedings of the 22nd international conference: machine learning
(1)
proceedings of the 26th annual international conference on machine learning
(1)
proceedings of the 7th acm international conference on web search and data mining
(1)
proceedings of the 7th acm international conference web search & data mining
(1)
proceedings of the annual meeting of the cognitive science society
(1)
proceedings of the ieee conference on decision and control
(1)
proceedings of the international joint conference on neural networks
(1)
recent advances in reinforcement learning (9783540897217)
(1)
recent advances in reinforcement learning (9783642299452)
(1)
add
remove
더보기
zoom_in
재검색
출판사(Publisher)
springer nature
(19)
microtome publishing
(17)
ieee
(16)
elsevier b.v.
(13)
springer
(9)
microtome publ
(8)
society for industrial & applied mathematics
(6)
association for computing machinery
(4)
elsevier science bv
(4)
springer, heidelberg
(4)
kluwer academic publishers
(3)
nature publishing group
(3)
springer us
(3)
ai access foundation
(2)
elsevier ltd
(2)
nature portfolio
(2)
nature publishing group uk
(2)
pergamon-elsevier science ltd
(2)
springer, berlin
(2)
academic press inc elsevier science
(1)
academic press inc.
(1)
academie des science, centre mersenne
(1)
amer assoc advancement science
(1)
american automatic control council (aacc)
(1)
elsevier sas
(1)
escholarship, university of california
(1)
hermes sci. publ./lavoisier, paris
(1)
inst mathematical statistics
(1)
iop publishing
(1)
iop publishing ltd
(1)
iste, london
(1)
nature research
(1)
now publishers
(1)
siam publications
(1)
springer, [cham]
(1)
add
remove
더보기
zoom_in
재검색
자료유형(Source Type)
Academic Journals
(126)
Reports
(116)
Reviews
(34)
Conference Materials
(33)
Books
(10)
Magazines
(3)
add
remove
더보기
zoom_in
재검색
주제어
computer science - machine learning
(65)
statistics - machine learning
(65)
computer science - artificial intelligence
(47)
computer science - learning
(32)
reinforcement learning
(25)
optimal control
(21)
markov decision processes
(13)
markov processes
(13)
mathematical optimization
(11)
algorithms
(10)
monte carlo method
(10)
computer science - multiagent systems
(9)
dynamic programming
(9)
planning
(9)
computer science - computer science and game theory
(7)
computing and processing
(7)
machine learning
(6)
mathematics - statistics theory
(6)
optimization
(6)
problem solving
(6)
robotics and control systems
(6)
sensitivity analysis
(6)
stochastic analysis
(6)
active learning
(5)
adaptive control systems
(5)
adaptive sampling
(5)
bang-bang control
(5)
computational complexity
(5)
computer science
(5)
monte carlo simulations
(5)
regression
(5)
signal processing and analysis
(5)
stochastic gradient
(5)
supervised learning
(5)
communication, networking and broadcast technologies
(4)
error analysis
(4)
function approximation
(4)
iterative methods (mathematics)
(4)
mathematics - optimization and control
(4)
multi-armed bandits
(4)
probability theory
(4)
random variables
(4)
regret
(4)
robbers
(4)
standard deviations
(4)
statistical learning
(4)
statistical sampling
(4)
stochastic approximation
(4)
upper bound
(4)
91b02
(3)
add
remove
더보기
zoom_in
재검색
언어
english
(160)
french
(1)
add
remove
더보기
zoom_in
재검색
메일 발송
이메일
취소
확인
팝업 닫기
폴더 추가
팝업 닫기