Facility Agriculture: Intelligent SensorsDepartment of Electrical Engineering / Tsai, Hsiao-Ping / Assistant Professor  
設施農業:智慧感測【電機工程學系/蔡曉萍助理教授】

論文篇名 An Expected Win Rate-based Real Time Bidding Strategy for Branding Campaign by the Model-free Reinforcement Learning Model
期刊名稱 IEEE ACCESS
發表年份,卷數,起迄頁數 2020, 8: 151952-151967
作者 Shih, Wen-Yueh; Lu, Yi-Shu; Tsai, Hsiao-Ping(蔡曉萍)*; Huang, Jiun-Long
DOI 10.1109/ACCESS.2020.3016824
 
英文摘要 The bidding strategy plays the most important role to help the Demand Side Platforms (DSPs) making bidding decisions on a large number of bid requests in Real Time Bidding (RTB) to satisfy the different objectives of campaigns under the lifetime and budget constraints. In this paper, we focus on branding campaign whose objective is to obtain as many impressions as possible under the lifetime and budget constraints. To achieve the objectives of branding campaigns, we propose a novel expected win rate-based bidding strategy for branding campaign under the lifetime and budget constraints by utilizing a model-free reinforcement learning model. Specically, to prevent missing good opportunities resulting from submitting extremely low bid prices, the concept of the base winning price is introduced to determine the lower bound of expected winning price. In addition, to obtain more impressions, the concept of the DSP-specied budget spending plan is proposed to determine the proper winning prices. The base expected win rate is then calculated based on the base winning price and the winning price determined by the DSP-specied budget spending plan. Since RTB is a dynamic environment, we propose a novel expected win rate-based bidding strategy named EWDQN which utilizes Deep Q Network (DQN) to dynamically determine the expected win rate according to the base expected win rate and the current status of the RTB market, and then determines the bid price according to the expected win rate. To the best of our knowledge,  this is the rst research applying the reinforcement learning technique on the bidding strategies for branding campaign. To measure the performance of EWDQN, several experiments are conducted on two real datasets. Experimental results show that EWDQN outperforms the-state-of-the-art bidding strategies for branding campaign in terms of the number of obtained impressions and CPM (cost per thousand impressions).