TY - GEN
T1 - Extracting URLs from JavaScript via program analysis
AU - Wang, Qi
AU - Zhou, Jingyu
AU - Chen, Yuting
AU - Zhang, Yizhou
AU - Zhao, Jianjun
PY - 2013
Y1 - 2013
N2 - With the extensive use of client-side JavaScript in web applications, web contents are becoming more dynamic than ever before. This poses significant challenges for search engines, because more web URLs are now embedded or hidden inside JavaScript code and most web crawlers are script-agnostic, significantly reducing the coverage of search engines. We present a hybrid approach that combines static analysis with dynamic execution, overcoming the weakness of a purely static or dynamic approach that either lacks accuracy or suffers from huge execution cost. We also propose to integrate program analysis techniques such as statement coverage and program slicing to improve the performance of URL mining.
AB - With the extensive use of client-side JavaScript in web applications, web contents are becoming more dynamic than ever before. This poses significant challenges for search engines, because more web URLs are now embedded or hidden inside JavaScript code and most web crawlers are script-agnostic, significantly reducing the coverage of search engines. We present a hybrid approach that combines static analysis with dynamic execution, overcoming the weakness of a purely static or dynamic approach that either lacks accuracy or suffers from huge execution cost. We also propose to integrate program analysis techniques such as statement coverage and program slicing to improve the performance of URL mining.
UR - http://www.scopus.com/inward/record.url?scp=84883733499&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84883733499&partnerID=8YFLogxK
U2 - 10.1145/2491411.2494583
DO - 10.1145/2491411.2494583
M3 - Conference contribution
AN - SCOPUS:84883733499
SN - 9781450322379
T3 - 2013 9th Joint Meeting of the European Software Engineering Conference and the ACM SIGSOFT Symposium on the Foundations of Software Engineering, ESEC/FSE 2013 - Proceedings
SP - 627
EP - 630
BT - 2013 9th Joint Meeting of the European Software Engineering Conference and the ACM SIGSOFT Symposium on the Foundations of Software Engineering, ESEC/FSE 2013 - Proceedings
T2 - 2013 9th Joint Meeting of the European Software Engineering Conference and the ACM SIGSOFT Symposium on the Foundations of Software Engineering, ESEC/FSE 2013
Y2 - 18 August 2013 through 26 August 2013
ER -