<output id="qn6qe"></output>

    1. <output id="qn6qe"><tt id="qn6qe"></tt></output>
    2. <strike id="qn6qe"></strike>

      亚洲 日本 欧洲 欧美 视频,日韩中文字幕有码av,一本一道av中文字幕无码,国产线播放免费人成视频播放,人妻少妇偷人无码视频,日夜啪啪一区二区三区,国产尤物精品自在拍视频首页,久热这里只有精品12

      python識別文檔英語單詞后查找翻譯并保存為xlsx

      代碼:

      # -*- coding: utf-8 -*-
      """
      Created on Fri Aug  5 17:11:50 2022
      
      @author: koneko
      """
      import requests
      import docx
      import re
      import sqlite3
      import openpyxl
      
         
      
      def translate(keyword):
          url = 'https://fanyi.baidu.com/sug'
          data = {'kw': keyword}
          html = requests.post(url,data)
          json = html.json()
          if json['data'] == []:
              return []
          else:
              return json['data'][0]['v']
      
      
      def lang_detect(keyword):
          url = 'https://fanyi.baidu.com/langdetect'
          data = {'query':keyword}
          html = requests.post(url, data)
          json = html.json()
          return json['lan']
      
      
      def load_docx_and_get_words(fileName):
          doc = docx.Document(fileName)
          text = ''
          
          for paragraph in doc.paragraphs:
              text += paragraph.text
              
          text = text.lower()
          words = re.findall(r'[A-Za-z]+', text)[:10]
          words =  list(set(words))
          print('總共解析出'+str(len(words))+'個單詞')
          return words
          
      
      def words_filter(words):
          for i, word in enumerate(words):
              print(i, word)
              if len(word) <= 2:
                  print('remove '+ word +' for length <= 2')
                  words.remove(word)
                  continue
              lan = lang_detect(word)
              if lan != 'en':
                  print('remove '+ word + ' for not english' )
                  words.remove(word)
                  continue
          print('清理后共'+str(len(words))+'個單詞')
          return words
      
      
      
      def words_to_dictionary(words):
          dictionary = dict()
          for word in words:
              trans = translate(word)
              if trans == []:
                  remains = 3
                  while remains and trans == []:
                      trans = translate(word)
                      remains -= 1
              if trans == []:
                  print(word,'找不到翻譯')
                  continue
              print(word)
              print(trans)
              dictionary[word] = trans
          
          #按鍵(字母順序)進行排序,這里會變成list    
          dictionary = sorted(dictionary.items(), key = lambda x:x[0])
          return dict(dictionary)
      
      
      def save_to_xlsx(fileName, dictionary):
          wb = openpyxl.Workbook()
          ws = wb.active
          ws.title = 'vocabulary'
          for row, item in enumerate(dictionary.items()):
              ws.cell(row+1, 1).value = item[0]
              ws.cell(row+1, 2).value = item[1]
          
          wb.save(fileName+'.xlsx')
      
      
      words = load_docx_and_get_words('cet4-1.docx')
      words = words_filter(words,)
      dictionary = words_to_dictionary(words)
      
      save_to_xlsx('myVocabulary', dictionary) 
      
      
      
      
          
      
      posted @ 2022-08-05 17:31  裏表異體  閱讀(90)  評論(0)    收藏  舉報
      主站蜘蛛池模板: 成人永久性免费在线视频| 狠狠色狠狠色综合久久蜜芽| 国产精品一区二区AV| 九九在线精品国产| 亚洲色最新高清AV网站| 久久精品手机观看| 亚洲va中文字幕无码久久| 久久人人爽爽人人爽人人片av| 视频一区二区三区在线视频| 2021av在线天堂网| 国产一区二区在线有码| 欧美成人h亚洲综合在线观看| 最新精品国偷自产在线| 夜夜影院未满十八勿进| 91密桃精品国产91久久| 极品尤物被啪到呻吟喷水| 亚洲欧美日韩愉拍自拍美利坚| 国产精品美女一区二区三| 玩弄放荡人妻少妇系列| 天堂在线中文| 国产69精品久久久久人妻| 97人人添人人澡人人澡人人澡| 乱人伦人妻中文字幕| 精品人妻av区乱码| 国产成人精品区一区二区| 久久精品国产6699国产精| 赣榆县| 精品无码久久久久国产动漫3d| 国产精品成人午夜福利| 乱女伦露脸对白在线播放| 日本黄页网站免费观看| 无码人妻一区二区三区在线视频| 国产三级精品片| 久久精品夜色噜噜亚洲av| 免费视频爱爱太爽了| 久久精品国产精品第一区| 亚洲乱码中文字幕小综合| 国产精品大片中文字幕| 亚洲精品码中文在线观看| 日本一码二码三码的区分| 亚洲成av人片天堂网无码|