python spilt(‘ “ ‘),spilt(“=“)处理html 链接文本

发布于:2023-01-01 ⋅ 阅读:(708) ⋅ 点赞:(0)
web_test_response = requests.get(web_test_url)
print("web_test_response11",web_test_response.text)
print("web_test_response22",web_test_response.text.split('"'))


if web_test_response.status_code != 404:
    web_url_test = "http://x.x.x.x:xx"+web_test_response.text.split('"')[-2]
    print("web_url_test",web_url_test)
    web_test_response1 = requests.get(web_url_test)
    print("web_test_response1",web_test_response1)
    web_test_packurl = web_test_response1.text.split("=")[-1]
    print("
", web_test_response1.text)
    print("web_test_response3", web_test_response1.text.split("="))
    print("web_test_response4", web_test_response1.text.split("=")[-1])
    if web_test_packurl not in web_test_list:
        web_test_list.clear()
        web_test_list.append(web_test_packurl)


web_test_response11:用' " '分割

<html><head><title>X.X.X.X - /agora_ad/cloudclass/web/test/release_2.7.1/</title></head><body><H1>X.X.X.X - /agora_ad/cloudclass/web/test/release_2.7.1/</H1><hr>

<pre><A HREF="/agora_ad/cloudclass/web/test/">[转到父目录]</A><br><br> 2022/8/19    12:11          259 <A HREF="/agora_ad/cloudclass/web/test/release_2.7.1/20220819_4329.txt">20220819_4329.txt</A><br> 2022/8/19    13:34          259 <A HREF="/agora_ad/cloudclass/web/test/release_2.7.1/20220819_4331.txt">20220819_4331.txt</A><br> 2022/8/19    17:52          259 <A HREF="/agora_ad/cloudclass/web/test/release_2.7.1/20220819_4335.txt">20220819_4335.txt</A><br> 2022/8/22    10:34          259 <A HREF="/agora_ad/cloudclass/web/test/release_2.7.1/20220822_4340.txt">20220822_4340.txt</A><br> 2022/8/23    17:17          259 <A HREF="/agora_ad/cloudclass/web/test/release_2.7.1/20220823_4351.txt">20220823_4351.txt</A><br> 2022/8/24    11:10          259 <A HREF="/agora_ad/cloudclass/web/test/release_2.7.1/20220824_4365.txt">20220824_4365.txt</A><br> 2022/8/30    10:36          259 <A HREF="/agora_ad/cloudclass/web/test/release_2.7.1/20220830_4378.txt">20220830_4378.txt</A><br> 2022/8/30    17:26          259 <A HREF="/agora_ad/cloudclass/web/test/release_2.7.1/20220830_4384.txt">20220830_4384.txt</A><br> 2022/8/30    17:54          259 <A HREF="/agora_ad/cloudclass/web/test/release_2.7.1/20220830_4387.txt">20220830_4387.txt</A><br> 2022/8/30    17:54          259 <A HREF="/agora_ad/cloudclass/web/test/release_2.7.1/20220830_4389.txt">20220830_4389.txt</A><br> 2022/8/31    10:26          259 <A HREF="/agora_ad/cloudclass/web/test/release_2.7.1/20220831_4393.txt">20220831_4393.txt</A><br></pre><hr></body></html>

web_test_response22:分割后的列表

['<html><head><title>X.X.X.X - /agora_ad/cloudclass/web/test/release_2.7.1/</title></head><body><H1>X.X.X.X - /agora_ad/cloudclass/web/test/release_2.7.1/</H1><hr>\r\n\r\n<pre><A HREF=', '/agora_ad/cloudclass/web/test/', '>[转到父目录]</A><br><br> 2022/8/19    12:11          259 <A HREF=', '/agora_ad/cloudclass/web/test/release_2.7.1/20220819_4329.txt', '>20220819_4329.txt</A><br> 2022/8/19    13:34          259 <A HREF=', '/agora_ad/cloudclass/web/test/release_2.7.1/20220819_4331.txt', '>20220819_4331.txt</A><br> 2022/8/19    17:52          259 <A HREF=', '/agora_ad/cloudclass/web/test/release_2.7.1/20220819_4335.txt', '>20220819_4335.txt</A><br> 2022/8/22    10:34          259 <A HREF=', '/agora_ad/cloudclass/web/test/release_2.7.1/20220822_4340.txt', '>20220822_4340.txt</A><br> 2022/8/23    17:17          259 <A HREF=', '/agora_ad/cloudclass/web/test/release_2.7.1/20220823_4351.txt', '>20220823_4351.txt</A><br> 2022/8/24    11:10          259 <A HREF=', '/agora_ad/cloudclass/web/test/release_2.7.1/20220824_4365.txt', '>20220824_4365.txt</A><br> 2022/8/30    10:36          259 <A HREF=', '/agora_ad/cloudclass/web/test/release_2.7.1/20220830_4378.txt', '>20220830_4378.txt</A><br> 2022/8/30    17:26          259 <A HREF=', '/agora_ad/cloudclass/web/test/release_2.7.1/20220830_4384.txt', '>20220830_4384.txt</A><br> 2022/8/30    17:54          259 <A HREF=', '/agora_ad/cloudclass/web/test/release_2.7.1/20220830_4387.txt', '>20220830_4387.txt</A><br> 2022/8/30    17:54          259 <A HREF=', '/agora_ad/cloudclass/web/test/release_2.7.1/20220830_4389.txt', '>20220830_4389.txt</A><br> 2022/8/31    10:26          259 <A HREF=', '/agora_ad/cloudclass/web/test/release_2.7.1/20220831_4393.txt', '>20220831_4393.txt</A><br></pre><hr></body></html>

web_url_test:取web_test_response22列表的倒数第二个元素

http://x.x.x.x:x.x/agora_ad/cloudclass/web/test/release_2.7.1/20220831_4393.txt

处理:web_test_response2

web_test_response2:

url: https://agora-adc-artifacts.s3.cn-north-1.amazonaws.com.cn/apaas/app/test/release_2.7.1/20220831_4393/index.html
            global_url: url
=https://solutions-apaas.agora.io/apaas/app/test/release_2.7.1/20220831_4393/index.html

web_test_response3:用“=”分割后的列表

 ['\n            url: https://x.x.x.x/apaas/app/test/release_2.7.1/20220831_4393/index.html\n            global_url: url', 'https://solutions-apaas.agora.io/apaas/app/test/release_2.7.1/20220831_4393/index.html\n            \n']

web_test_response4:web_test_response3列表的最后一个元素

https://solutions-apaas.agora.io/apaas/app/test/release_2.7.1/20220831_4393/index.html


 

本文含有隐藏内容,请 开通VIP 后查看

网站公告

今日签到

点亮在社区的每一天
去签到