Leverage temporal convolutional network for the representation learning of URLs

Yunji Liang, Jian Kang, Zhiwen Yu, Bin Guo, Xiaolong Zheng, Saike He

科研成果: 书/报告/会议事项章节会议稿件同行评审

7 引用 (Scopus)

摘要

Cyber crimes including computer virus/malwares, spam, illegal sales, and phishing websites are proliferated aggressively via the disguised Uniform Resource Locators (URL). Although numerous studies were conducted for the URL classification task, the traditional URL classification solutions retreated due to the hand-crafted feature engineering and the boom of newly generated URLs. In this paper, we study the representation learning of URLs, and explore the URL classification using deep learning. Specifically, we propose URL2vec to extract both the structural and lexical features of URLs, and apply temporal convolutional network (TCN) for the URL classification task. The experimental results show that URL2vec outperforms both word2vec and character-level embedding for URL representation, and TCN achieves the best performance than baselines with the precision up to 95.97%.

源语言英语
主期刊名2019 IEEE International Conference on Intelligence and Security Informatics, ISI 2019
编辑Xiaolong Zheng, Ahmed Abbasi, Michael Chau, Alan Wang, Lina Zhou
出版商Institute of Electrical and Electronics Engineers Inc.
74-79
页数6
ISBN(电子版)9781728125046
DOI
出版状态已出版 - 7月 2019
活动17th IEEE International Conference on Intelligence and Security Informatics, ISI 2019 - Shenzhen, 中国
期限: 1 7月 20193 7月 2019

出版系列

姓名2019 IEEE International Conference on Intelligence and Security Informatics, ISI 2019

会议

会议17th IEEE International Conference on Intelligence and Security Informatics, ISI 2019
国家/地区中国
Shenzhen
时期1/07/193/07/19

指纹

探究 'Leverage temporal convolutional network for the representation learning of URLs' 的科研主题。它们共同构成独一无二的指纹。

引用此