Python模块pyquery应用技巧
        【摘要】 
                     
 公众号 yk 坤帝 
 后台回复 pyquery 获取全部源代码 
 
1. 初始化部分 
1.1 字符串初始化 
1.2 URL初始化 
1.3 文件初始化 
2. 基本CSS选择器 
3. 查找元素 
3.1 查找子元素 
3.2 查找父元素 
3.3 查找兄弟元素 
4. 遍历单个元素 
5. 获取属性,文本,HT...
    
    
    
    公众号 yk 坤帝
后台回复 pyquery 获取全部源代码
1. 初始化部分
1.1 字符串初始化
1.2 URL初始化
1.3 文件初始化
2. 基本CSS选择器
3. 查找元素
3.1 查找子元素
3.2 查找父元素
3.3 查找兄弟元素
4. 遍历单个元素
5. 获取属性,文本,HTML
5.1 获取属性
5.2 获取文本
5.3 获取HTML
6. DOM操作
6.1 addClass、removeClass
6.2 attr、css
6.3 remove
7. 伪类选择器
1. 初始化部分
1.1 字符串初始化
  
   - 
    
     
    
    
     
      html = '''
     
    
 
   - 
    
     
    
    
     
      <div>
     
    
 
   - 
    
     
    
    
         <ul>
     
    
 
   - 
    
     
    
    
              <li class="item-0">first item</li>
     
    
 
   - 
    
     
    
    
              <li class="item-1"><a href="link2.html">second item</a></li>
     
    
 
   - 
    
     
    
    
              <li class="item-0 active"><a href="link3.html"><span class="bold">third item</span></a></li>
     
    
 
   - 
    
     
    
    
              <li class="item-1 active"><a href="link4.html">fourth item</a></li>
     
    
 
   - 
    
     
    
    
              <li class="item-0"><a href="link5.html">fifth item</a></li>
     
    
 
   - 
    
     
    
    
          </ul>
     
    
 
   - 
    
     
    
    
      </div>
     
    
 
   - 
    
     
    
    
     
      '''
     
    
 
   - 
    
     
    
    
     
      from pyquery import PyQuery as pq
     
    
 
   - 
    
     
    
    
     
      doc = pq(html)
     
    
 
   - 
    
     
    
    
     
      print(doc('li'))
     
    
 
  
 
1.2 URL初始化
  
   - 
    
     
    
    
     
      from pyquery import PyQuery as pq
     
    
 
   - 
    
     
    
    
     
      doc = pq(url='http://www.baidu.com')
     
    
 
   - 
    
     
    
    
     
      print(doc('head'))
     
    
 
  
 
1.3 文件初始化
  
   - 
    
     
    
    
     
      from pyquery import PyQuery as pq
     
    
 
   - 
    
     
    
    
     
      doc = pq(filename='demo.html')
     
    
 
   - 
    
     
    
    
     
      print(doc('li'))
     
    
 
  
 
2. 基本CSS选择器
  
   - 
    
     
    
    
     
      html = '''
     
    
 
   - 
    
     
    
    
     
      <div id="container">
     
    
 
   - 
    
     
    
    
         <ul class="list">
     
    
 
   - 
    
     
    
    
              <li class="item-0">first item</li>
     
    
 
   - 
    
     
    
    
              <li class="item-1"><a href="link2.html">second item</a></li>
     
    
 
   - 
    
     
    
    
              <li class="item-0 active"><a href="link3.html"><span class="bold">third item</span></a></li>
     
    
 
   - 
    
     
    
    
              <li class="item-1 active"><a href="link4.html">fourth item</a></li>
     
    
 
   - 
    
     
    
    
              <li class="item-0"><a href="link5.html">fifth item</a></li>
     
    
 
   - 
    
     
    
    
          </ul>
     
    
 
   - 
    
     
    
    
      </div>
     
    
 
   - 
    
     
    
    
     
      '''
     
    
 
   - 
    
     
    
    
     
      from pyquery import PyQuery as pq
     
    
 
   - 
    
     
    
    
     
      doc = pq(html)
     
    
 
   - 
    
     
    
    
     
      print(doc('#container .list li'))
     
    
 
  
 
3. 查找元素
3.1 查找子元素
  
   - 
    
     
    
    
     
      公众号 yk 坤帝
     
    
 
   - 
    
     
    
    
      
     
    
 
   - 
    
     
    
    
     
      后台回复 pyquery 获取全部源代码
     
    
 
   - 
    
     
    
    
      
     
    
 
   - 
    
     
    
    
     
      html = '''
     
    
 
   - 
    
     
    
    
     
      <div id="container">
     
    
 
   - 
    
     
    
    
     
       <ul class="list">
     
    
 
   - 
    
     
    
    
     
       <li class="item-0">first item</li>
     
    
 
   - 
    
     
    
    
     
       <li class="item-1"><a href="link2.html">second item</a></li>
     
    
 
   - 
    
     
    
    
     
       <li class="item-0 active"><a href="link3.html"><span class="bold">third item</span></a></li>
     
    
 
   - 
    
     
    
    
     
       <li class="item-1 active"><a href="link4.html">fourth item</a></li>
     
    
 
   - 
    
     
    
    
     
       <li class="item-0"><a href="link5.html">fifth item</a></li>
     
    
 
   - 
    
     
    
    
     
       </ul>
     
    
 
   - 
    
     
    
    
     
       </div>
     
    
 
   - 
    
     
    
    
     
      '''
     
    
 
   - 
    
     
    
    
     
      from pyquery import PyQuery as pq
     
    
 
   - 
    
     
    
    
     
      doc = pq(html)
     
    
 
   - 
    
     
    
    
     
      items = doc('.list')
     
    
 
   - 
    
     
    
    
     
      print(type(items))
     
    
 
   - 
    
     
    
    
     
      print(items)
     
    
 
   - 
    
     
    
    
     
      lis = items.find('li')
     
    
 
   - 
    
     
    
    
     
      print(type(lis))
     
    
 
   - 
    
     
    
    
     
      print(lis)
     
    
 
  
 
  
   - 
    
     
    
    
     
      lis = items.children()
     
    
 
   - 
    
     
    
    
     
      print(type(lis))
     
    
 
   - 
    
     
    
    
     
      print(lis)
     
    
 
  
 
3.2 查找父元素
  
   - 
    
     
    
    
     
      html = '''
     
    
 
   - 
    
     
    
    
     
      <div id="container">
     
    
 
   - 
    
     
    
    
         <ul class="list">
     
    
 
   - 
    
     
    
    
              <li class="item-0">first item</li>
     
    
 
   - 
    
     
    
    
              <li class="item-1"><a href="link2.html">second item</a></li>
     
    
 
   - 
    
     
    
    
              <li class="item-0 active"><a href="link3.html"><span class="bold">third item</span></a></li>
     
    
 
   - 
    
     
    
    
              <li class="item-1 active"><a href="link4.html">fourth item</a></li>
     
    
 
   - 
    
     
    
    
              <li class="item-0"><a href="link5.html">fifth item</a></li>
     
    
 
   - 
    
     
    
    
          </ul>
     
    
 
   - 
    
     
    
    
      </div>
     
    
 
   - 
    
     
    
    
     
      '''
     
    
 
   - 
    
     
    
    
     
      from pyquery import PyQuery as pq
     
    
 
   - 
    
     
    
    
     
      doc = pq(html)
     
    
 
   - 
    
     
    
    
     
      items = doc('.list')
     
    
 
   - 
    
     
    
    
     
      container = items.parent()
     
    
 
   - 
    
     
    
    
     
      print(type(container))
     
    
 
   - 
    
     
    
    
     
      print(container)
     
    
 
  
 
  
   - 
    
     
    
    
     
      html = '''
     
    
 
   - 
    
     
    
    
     
      <div class="wrap">
     
    
 
   - 
    
     
    
    
         <div id="container">
     
    
 
   - 
    
     
    
    
             <ul class="list">
     
    
 
   - 
    
     
    
    
                  <li class="item-0">first item</li>
     
    
 
   - 
    
     
    
    
                  <li class="item-1"><a href="link2.html">second item</a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-0 active"><a href="link3.html"><span class="bold">third item</span></a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-1 active"><a href="link4.html">fourth item</a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-0"><a href="link5.html">fifth item</a></li>
     
    
 
   - 
    
     
    
    
              </ul>
     
    
 
   - 
    
     
    
    
          </div>
     
    
 
   - 
    
     
    
    
      </div>
     
    
 
   - 
    
     
    
    
     
      '''
     
    
 
   - 
    
     
    
    
     
      from pyquery import PyQuery as pq
     
    
 
   - 
    
     
    
    
     
      doc = pq(html)
     
    
 
   - 
    
     
    
    
     
      items = doc('.list')
     
    
 
   - 
    
     
    
    
     
      parents = items.parents()
     
    
 
   - 
    
     
    
    
     
      print(type(parents))
     
    
 
   - 
    
     
    
    
     
      print(parents)
     
    
 
  
 
  
   - 
    
     
    
    
     
      parent = items.parents('.wrap')
     
    
 
   - 
    
     
    
    
     
      print(parent)
     
    
 
  
 
3.3 查找兄弟元素
  
   - 
    
     
    
    
     
      html = '''
     
    
 
   - 
    
     
    
    
     
      <div class="wrap">
     
    
 
   - 
    
     
    
    
         <div id="container">
     
    
 
   - 
    
     
    
    
             <ul class="list">
     
    
 
   - 
    
     
    
    
                  <li class="item-0">first item</li>
     
    
 
   - 
    
     
    
    
                  <li class="item-1"><a href="link2.html">second item</a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-0 active"><a href="link3.html"><span class="bold">third item</span></a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-1 active"><a href="link4.html">fourth item</a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-0"><a href="link5.html">fifth item</a></li>
     
    
 
   - 
    
     
    
    
              </ul>
     
    
 
   - 
    
     
    
    
          </div>
     
    
 
   - 
    
     
    
    
      </div>
     
    
 
   - 
    
     
    
    
     
      '''
     
    
 
   - 
    
     
    
    
     
      from pyquery import PyQuery as pq
     
    
 
   - 
    
     
    
    
     
      doc = pq(html)
     
    
 
   - 
    
     
    
    
     
      li = doc('.list .item-0.active')
     
    
 
   - 
    
     
    
    
     
      print(li.siblings())
     
    
 
  
 
  
   - 
    
     
    
    
     
      html = '''
     
    
 
   - 
    
     
    
    
     
      <div class="wrap">
     
    
 
   - 
    
     
    
    
         <div id="container">
     
    
 
   - 
    
     
    
    
             <ul class="list">
     
    
 
   - 
    
     
    
    
                  <li class="item-0">first item</li>
     
    
 
   - 
    
     
    
    
                  <li class="item-1"><a href="link2.html">second item</a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-0 active"><a href="link3.html"><span class="bold">third item</span></a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-1 active"><a href="link4.html">fourth item</a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-0"><a href="link5.html">fifth item</a></li>
     
    
 
   - 
    
     
    
    
              </ul>
     
    
 
   - 
    
     
    
    
          </div>
     
    
 
   - 
    
     
    
    
      </div>
     
    
 
   - 
    
     
    
    
     
      '''
     
    
 
   - 
    
     
    
    
     
      from pyquery import PyQuery as pq
     
    
 
   - 
    
     
    
    
     
      doc = pq(html)
     
    
 
   - 
    
     
    
    
     
      li = doc('.list .item-0.active')
     
    
 
   - 
    
     
    
    
     
      print(li.siblings('.active'))
     
    
 
  
 
4. 遍历单个元素
  
   - 
    
     
    
    
     
      html = '''
     
    
 
   - 
    
     
    
    
     
      <div class="wrap">
     
    
 
   - 
    
     
    
    
         <div id="container">
     
    
 
   - 
    
     
    
    
             <ul class="list">
     
    
 
   - 
    
     
    
    
                  <li class="item-0">first item</li>
     
    
 
   - 
    
     
    
    
                  <li class="item-1"><a href="link2.html">second item</a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-0 active"><a href="link3.html"><span class="bold">third item</span></a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-1 active"><a href="link4.html">fourth item</a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-0"><a href="link5.html">fifth item</a></li>
     
    
 
   - 
    
     
    
    
              </ul>
     
    
 
   - 
    
     
    
    
          </div>
     
    
 
   - 
    
     
    
    
      </div>
     
    
 
   - 
    
     
    
    
     
      '''
     
    
 
   - 
    
     
    
    
     
      from pyquery import PyQuery as pq
     
    
 
   - 
    
     
    
    
     
      doc = pq(html)
     
    
 
   - 
    
     
    
    
     
      li = doc('.item-0.active')
     
    
 
   - 
    
     
    
    
     
      print(li)
     
    
 
  
 
  
   - 
    
     
    
    
     
      html = '''
     
    
 
   - 
    
     
    
    
     
      <div class="wrap">
     
    
 
   - 
    
     
    
    
         <div id="container">
     
    
 
   - 
    
     
    
    
             <ul class="list">
     
    
 
   - 
    
     
    
    
                  <li class="item-0">first item</li>
     
    
 
   - 
    
     
    
    
                  <li class="item-1"><a href="link2.html">second item</a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-0 active"><a href="link3.html"><span class="bold">third item</span></a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-1 active"><a href="link4.html">fourth item</a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-0"><a href="link5.html">fifth item</a></li>
     
    
 
   - 
    
     
    
    
              </ul>
     
    
 
   - 
    
     
    
    
          </div>
     
    
 
   - 
    
     
    
    
      </div>
     
    
 
   - 
    
     
    
    
     
      '''
     
    
 
   - 
    
     
    
    
     
      from pyquery import PyQuery as pq
     
    
 
   - 
    
     
    
    
     
      doc = pq(html)
     
    
 
   - 
    
     
    
    
     
      lis = doc('li').items()
     
    
 
   - 
    
     
    
    
     
      print(type(lis))
     
    
 
   - 
    
     
    
    
     
      for li in lis:
     
    
 
   - 
    
     
    
    
     
          print(li)
     
    
 
  
 
5. 获取属性,文本,HTML
5.1 获取属性
  
   - 
    
     
    
    
     
      html = '''
     
    
 
   - 
    
     
    
    
     
      <div class="wrap">
     
    
 
   - 
    
     
    
    
         <div id="container">
     
    
 
   - 
    
     
    
    
             <ul class="list">
     
    
 
   - 
    
     
    
    
                  <li class="item-0">first item</li>
     
    
 
   - 
    
     
    
    
                  <li class="item-1"><a href="link2.html">second item</a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-0 active"><a href="link3.html"><span class="bold">third item</span></a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-1 active"><a href="link4.html">fourth item</a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-0"><a href="link5.html">fifth item</a></li>
     
    
 
   - 
    
     
    
    
              </ul>
     
    
 
   - 
    
     
    
    
          </div>
     
    
 
   - 
    
     
    
    
      </div>
     
    
 
   - 
    
     
    
    
     
      '''
     
    
 
   - 
    
     
    
    
     
      from pyquery import PyQuery as pq
     
    
 
   - 
    
     
    
    
     
      doc = pq(html)
     
    
 
   - 
    
     
    
    
     
      a = doc('.item-0.active a')
     
    
 
   - 
    
     
    
    
     
      print(a)
     
    
 
   - 
    
     
    
    
     
      print(a.attr('href'))
     
    
 
   - 
    
     
    
    
     
      print(a.attr.href)
     
    
 
  
 
5.2 获取文本
  
   - 
    
     
    
    
     
      html = '''
     
    
 
   - 
    
     
    
    
     
      <div class="wrap">
     
    
 
   - 
    
     
    
    
         <div id="container">
     
    
 
   - 
    
     
    
    
             <ul class="list">
     
    
 
   - 
    
     
    
    
                  <li class="item-0">first item</li>
     
    
 
   - 
    
     
    
    
                  <li class="item-1"><a href="link2.html">second item</a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-0 active"><a href="link3.html"><span class="bold">third item</span></a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-1 active"><a href="link4.html">fourth item</a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-0"><a href="link5.html">fifth item</a></li>
     
    
 
   - 
    
     
    
    
              </ul>
     
    
 
   - 
    
     
    
    
          </div>
     
    
 
   - 
    
     
    
    
      </div>
     
    
 
   - 
    
     
    
    
     
      '''
     
    
 
   - 
    
     
    
    
     
      from pyquery import PyQuery as pq
     
    
 
   - 
    
     
    
    
     
      doc = pq(html)
     
    
 
   - 
    
     
    
    
     
      a = doc('.item-0.active a')
     
    
 
   - 
    
     
    
    
     
      print(a)
     
    
 
   - 
    
     
    
    
     
      print(a.text())
     
    
 
  
 
5.3 获取HTML
  
   - 
    
     
    
    
     
      html = '''
     
    
 
   - 
    
     
    
    
     
      <div class="wrap">
     
    
 
   - 
    
     
    
    
         <div id="container">
     
    
 
   - 
    
     
    
    
             <ul class="list">
     
    
 
   - 
    
     
    
    
                  <li class="item-0">first item</li>
     
    
 
   - 
    
     
    
    
                  <li class="item-1"><a href="link2.html">second item</a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-0 active"><a href="link3.html"><span class="bold">third item</span></a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-1 active"><a href="link4.html">fourth item</a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-0"><a href="link5.html">fifth item</a></li>
     
    
 
   - 
    
     
    
    
              </ul>
     
    
 
   - 
    
     
    
    
          </div>
     
    
 
   - 
    
     
    
    
      </div>
     
    
 
   - 
    
     
    
    
     
      '''
     
    
 
   - 
    
     
    
    
     
      from pyquery import PyQuery as pq
     
    
 
   - 
    
     
    
    
     
      doc = pq(html)
     
    
 
   - 
    
     
    
    
     
      li = doc('.item-0.active')
     
    
 
   - 
    
     
    
    
     
      print(li)
     
    
 
   - 
    
     
    
    
     
      print(li.html())
     
    
 
  
 
6. DOM操作
6.1 addClass、removeClass
  
   - 
    
     
    
    
     
      html = '''
     
    
 
   - 
    
     
    
    
     
      <div class="wrap">
     
    
 
   - 
    
     
    
    
         <div id="container">
     
    
 
   - 
    
     
    
    
             <ul class="list">
     
    
 
   - 
    
     
    
    
                  <li class="item-0">first item</li>
     
    
 
   - 
    
     
    
    
                  <li class="item-1"><a href="link2.html">second item</a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-0 active"><a href="link3.html"><span class="bold">third item</span></a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-1 active"><a href="link4.html">fourth item</a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-0"><a href="link5.html">fifth item</a></li>
     
    
 
   - 
    
     
    
    
              </ul>
     
    
 
   - 
    
     
    
    
          </div>
     
    
 
   - 
    
     
    
    
      </div>
     
    
 
   - 
    
     
    
    
     
      '''
     
    
 
   - 
    
     
    
    
     
      from pyquery import PyQuery as pq
     
    
 
   - 
    
     
    
    
     
      doc = pq(html)
     
    
 
   - 
    
     
    
    
     
      li = doc('.item-0.active')
     
    
 
   - 
    
     
    
    
     
      print(li)
     
    
 
   - 
    
     
    
    
     
      li.removeClass('active')
     
    
 
   - 
    
     
    
    
     
      print(li)
     
    
 
   - 
    
     
    
    
     
      li.addClass('active')
     
    
 
   - 
    
     
    
    
     
      print(li)
     
    
 
  
 
6.2 attr、css
  
   - 
    
     
    
    
     
      html = '''
     
    
 
   - 
    
     
    
    
     
      <div class="wrap">
     
    
 
   - 
    
     
    
    
         <div id="container">
     
    
 
   - 
    
     
    
    
             <ul class="list">
     
    
 
   - 
    
     
    
    
                  <li class="item-0">first item</li>
     
    
 
   - 
    
     
    
    
                  <li class="item-1"><a href="link2.html">second item</a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-0 active"><a href="link3.html"><span class="bold">third item</span></a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-1 active"><a href="link4.html">fourth item</a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-0"><a href="link5.html">fifth item</a></li>
     
    
 
   - 
    
     
    
    
              </ul>
     
    
 
   - 
    
     
    
    
          </div>
     
    
 
   - 
    
     
    
    
      </div>
     
    
 
   - 
    
     
    
    
     
      '''
     
    
 
   - 
    
     
    
    
     
      from pyquery import PyQuery as pq
     
    
 
   - 
    
     
    
    
     
      doc = pq(html)
     
    
 
   - 
    
     
    
    
     
      li = doc('.item-0.active')
     
    
 
   - 
    
     
    
    
     
      print(li)
     
    
 
   - 
    
     
    
    
     
      li.attr('name', 'link')
     
    
 
   - 
    
     
    
    
     
      print(li)
     
    
 
   - 
    
     
    
    
     
      li.css('font-size', '14px')
     
    
 
   - 
    
     
    
    
     
      print(li)
     
    
 
  
 
6.3 remove
  
   - 
    
     
    
    
     
      html = '''
     
    
 
   - 
    
     
    
    
     
      <div class="wrap">
     
    
 
   - 
    
     
    
    
     
       Hello, World
     
    
 
   - 
    
     
    
    
     
       <p>This is a paragraph.</p>
     
    
 
   - 
    
     
    
    
     
       </div>
     
    
 
   - 
    
     
    
    
     
      '''
     
    
 
   - 
    
     
    
    
     
      from pyquery import PyQuery as pq
     
    
 
   - 
    
     
    
    
     
      doc = pq(html)
     
    
 
   - 
    
     
    
    
     
      wrap = doc('.wrap')
     
    
 
   - 
    
     
    
    
     
      print(wrap.text())
     
    
 
   - 
    
     
    
    
     
      wrap.find('p').remove()
     
    
 
   - 
    
     
    
    
     
      print(wrap.text())
     
    
 
  
 
7. 伪类选择器
  
   - 
    
     
    
    
     
      html = '''
     
    
 
   - 
    
     
    
    
     
      <div class="wrap">
     
    
 
   - 
    
     
    
    
         <div id="container">
     
    
 
   - 
    
     
    
    
             <ul class="list">
     
    
 
   - 
    
     
    
    
                  <li class="item-0">first item</li>
     
    
 
   - 
    
     
    
    
                  <li class="item-1"><a href="link2.html">second item</a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-0 active"><a href="link3.html"><span class="bold">third item</span></a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-1 active"><a href="link4.html">fourth item</a></li>
     
    
 
   - 
    
     
    
    
                  <li class="item-0"><a href="link5.html">fifth item</a></li>
     
    
 
   - 
    
     
    
    
              </ul>
     
    
 
   - 
    
     
    
    
          </div>
     
    
 
   - 
    
     
    
    
      </div>
     
    
 
   - 
    
     
    
    
     
      '''
     
    
 
   - 
    
     
    
    
     
      from pyquery import PyQuery as pq
     
    
 
   - 
    
     
    
    
     
      doc = pq(html)
     
    
 
   - 
    
     
    
    
     
      li = doc('li:first-child')
     
    
 
   - 
    
     
    
    
     
      print(li)
     
    
 
   - 
    
     
    
    
     
      li = doc('li:last-child')
     
    
 
   - 
    
     
    
    
     
      print(li)
     
    
 
   - 
    
     
    
    
     
      li = doc('li:nth-child(2)')
     
    
 
   - 
    
     
    
    
     
      print(li)
     
    
 
   - 
    
     
    
    
     
      li = doc('li:gt(2)')
     
    
 
   - 
    
     
    
    
     
      print(li)
     
    
 
   - 
    
     
    
    
     
      li = doc('li:nth-child(2n)')
     
    
 
   - 
    
     
    
    
     
      print(li)
     
    
 
   - 
    
     
    
    
     
      li = doc('li:contains(second)')
     
    
 
   - 
    
     
    
    
     
      print(li)
     
    
 
  
 
公众号 yk 坤帝
后台回复 pyquery 获取全部源代码
文章来源: blog.csdn.net,作者:yk 坤帝,版权归原作者所有,如需转载,请联系作者。
原文链接:blog.csdn.net/qq_45803923/article/details/123672317
        【版权声明】本文为华为云社区用户转载文章,如果您发现本社区中有涉嫌抄袭的内容,欢迎发送邮件进行举报,并提供相关证据,一经查实,本社区将立刻删除涉嫌侵权内容,举报邮箱:
            cloudbbs@huaweicloud.com
        
        
        
        
        
        
        - 点赞
 - 收藏
 - 关注作者
 
            
           
评论(0)