在推文中查找完整的表情符号,而不是单独的字符 [英] Find emojis in a tweet as whole clusters and not as individual chars

查看:21
本文介绍了在推文中查找完整的表情符号,而不是单独的字符的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

所以我在这里问的第一个问题是关于表情符号的。对不起。

我在Tweepyregex(也尝试了python-pcre)的帮助下用Python制作了一个推特机器人,它将分析给定用户的推文,并记录一个单词或表情符号被使用的次数。我可以很好地完成大部分工作。我的问题从表情符号开始。

我的印象是,当使用X时,(在regex和python-pcre中)会找到扩展的字素簇。不只是个人。我在另一篇帖子What does the expression X match when inside a RegEx?中读到,X遵循一套指导原则来确定下一次充电是否应该集群,但始终至少返回1。

我在本文中尝试了第一种和第二种解决方案:How to extract all the emojis from text?

第一个与预期一致。获取单个代码点并将其添加到列表中。非常适合单代码点emoji,但我需要捕获具有多个代码点的emoji和单代码点emoji。

第二个解决方案一是我遇到问题的地方。根据POST,这个函数应该以字符串的形式打印表情符号,以空格分隔成簇。

def split_count(self, text):
        emoji_list = []
        data = regex.findall(r'X', text)
        for word in data:
            if any(char in emoji.UNICODE_EMOJI for char in word):
                emoji_list.append(word)
        return emoji_list

当调用Like时:

counter = self.split_count(tweet)
print(' '.join(emoji for emoji in counter))

应导致:

😁 👨‍👩‍👦‍👦

但是,当我运行它时,我得到:

😁 👨 👩 👦 👦

100%未群集化。

为什么会发生这种情况?这件事已经困扰我好几天了。

推荐答案

请注意,X匹配一个代码点,但emoji可能包含多个代码点,您可以自己在问题中提及。

您应该使用已在使用的emoji解析库从文本中提取emoji。

使用re,不需要regex

import re
from emoji import UNICODE_EMOJI

e_list = UNICODE_EMOJI.keys()
extract_emojis = re.compile("|".join(map(re.escape, sorted(e_list,key=len,reverse=True))))

#...

def split_count(self, text):
        return extract_emojis.findall(text)
        

请注意,使用此代码对包含Emoji List, v14.0中所有唯一表情符号的文件运行它时,由于超时,我无法获得任何结果。

因此,要匹配这些最新的emoji表情符号,您需要一个自定义正则表达式:

emoji_rx = r"[#*0-9]uFE0F?u20E3|©uFE0F?|[®u203Cu2049u2122u2139u2194-u2199u21A9u21AA]uFE0F?|[u231Au231B]|[u2328u23CF]uFE0F?|[u23E9-u23EC]|[u23ED-u23EF]uFE0F?|u23F0|[u23F1u23F2]uFE0F?|u23F3|[u23F8-u23FAu24C2u25AAu25ABu25B6u25C0u25FBu25FC]uFE0F?|[u25FDu25FE]|[u2600-u2604u260Eu2611]uFE0F?|[u2614u2615]|u2618uFE0F?|u261D[uFE0FU0001F3FB-U0001F3FF]?|[u2620u2622u2623u2626u262Au262Eu262Fu2638-u263Au2640u2642]uFE0F?|[u2648-u2653]|[u265Fu2660u2663u2665u2666u2668u267Bu267E]uFE0F?|u267F|u2692uFE0F?|u2693|[u2694-u2697u2699u269Bu269Cu26A0]uFE0F?|u26A1|u26A7uFE0F?|[u26AAu26AB]|[u26B0u26B1]uFE0F?|[u26BDu26BEu26C4u26C5]|u26C8uFE0F?|u26CE|[u26CFu26D1u26D3]uFE0F?|u26D4|u26E9uFE0F?|u26EA|[u26F0u26F1]uFE0F?|[u26F2u26F3]|u26F4uFE0F?|u26F5|[u26F7u26F8]uFE0F?|u26F9(?:u200D[u2640u2642]uFE0F?|[uFE0FU0001F3FB-U0001F3FF](?:u200D[u2640u2642]uFE0F?)?)?|[u26FAu26FD]|u2702uFE0F?|u2705|[u2708u2709]uFE0F?|[u270Au270B][U0001F3FB-U0001F3FF]?|[u270Cu270D][uFE0FU0001F3FB-U0001F3FF]?|u270FuFE0F?|[u2712u2714u2716u271Du2721]uFE0F?|u2728|[u2733u2734u2744u2747]uFE0F?|[u274Cu274Eu2753-u2755u2757]|u2763uFE0F?|u2764(?:u200D[U0001F525U0001FA79]|uFE0F(?:u200D[U0001F525U0001FA79])?)?|[u2795-u2797]|u27A1uFE0F?|[u27B0u27BF]|[u2934u2935u2B05-u2B07]uFE0F?|[u2B1Bu2B1Cu2B50u2B55]|[u3030u303Du3297u3299]uFE0F?|[U0001F004U0001F0CF]|[U0001F170U0001F171U0001F17EU0001F17F]uFE0F?|[U0001F18EU0001F191-U0001F19A]|U0001F1E6[U0001F1E8-U0001F1ECU0001F1EEU0001F1F1U0001F1F2U0001F1F4U0001F1F6-U0001F1FAU0001F1FCU0001F1FDU0001F1FF]|U0001F1E7[U0001F1E6U0001F1E7U0001F1E9-U0001F1EFU0001F1F1-U0001F1F4U0001F1F6-U0001F1F9U0001F1FBU0001F1FCU0001F1FEU0001F1FF]|U0001F1E8[U0001F1E6U0001F1E8U0001F1E9U0001F1EB-U0001F1EEU0001F1F0-U0001F1F5U0001F1F7U0001F1FA-U0001F1FF]|U0001F1E9[U0001F1EAU0001F1ECU0001F1EFU0001F1F0U0001F1F2U0001F1F4U0001F1FF]|U0001F1EA[U0001F1E6U0001F1E8U0001F1EAU0001F1ECU0001F1EDU0001F1F7-U0001F1FA]|U0001F1EB[U0001F1EE-U0001F1F0U0001F1F2U0001F1F4U0001F1F7]|U0001F1EC[U0001F1E6U0001F1E7U0001F1E9-U0001F1EEU0001F1F1-U0001F1F3U0001F1F5-U0001F1FAU0001F1FCU0001F1FE]|U0001F1ED[U0001F1F0U0001F1F2U0001F1F3U0001F1F7U0001F1F9U0001F1FA]|U0001F1EE[U0001F1E8-U0001F1EAU0001F1F1-U0001F1F4U0001F1F6-U0001F1F9]|U0001F1EF[U0001F1EAU0001F1F2U0001F1F4U0001F1F5]|U0001F1F0[U0001F1EAU0001F1EC-U0001F1EEU0001F1F2U0001F1F3U0001F1F5U0001F1F7U0001F1FCU0001F1FEU0001F1FF]|U0001F1F1[U0001F1E6-U0001F1E8U0001F1EEU0001F1F0U0001F1F7-U0001F1FBU0001F1FE]|U0001F1F2[U0001F1E6U0001F1E8-U0001F1EDU0001F1F0-U0001F1FF]|U0001F1F3[U0001F1E6U0001F1E8U0001F1EA-U0001F1ECU0001F1EEU0001F1F1U0001F1F4U0001F1F5U0001F1F7U0001F1FAU0001F1FF]|U0001F1F4U0001F1F2|U0001F1F5[U0001F1E6U0001F1EA-U0001F1EDU0001F1F0-U0001F1F3U0001F1F7-U0001F1F9U0001F1FCU0001F1FE]|U0001F1F6U0001F1E6|U0001F1F7[U0001F1EAU0001F1F4U0001F1F8U0001F1FAU0001F1FC]|U0001F1F8[U0001F1E6-U0001F1EAU0001F1EC-U0001F1F4U0001F1F7-U0001F1F9U0001F1FBU0001F1FD-U0001F1FF]|U0001F1F9[U0001F1E6U0001F1E8U0001F1E9U0001F1EB-U0001F1EDU0001F1EF-U0001F1F4U0001F1F7U0001F1F9U0001F1FBU0001F1FCU0001F1FF]|U0001F1FA[U0001F1E6U0001F1ECU0001F1F2U0001F1F3U0001F1F8U0001F1FEU0001F1FF]|U0001F1FB[U0001F1E6U0001F1E8U0001F1EAU0001F1ECU0001F1EEU0001F1F3U0001F1FA]|U0001F1FC[U0001F1EBU0001F1F8]|U0001F1FDU0001F1F0|U0001F1FE[U0001F1EAU0001F1F9]|U0001F1FF[U0001F1E6U0001F1F2U0001F1FC]|U0001F201|U0001F202uFE0F?|[U0001F21AU0001F22FU0001F232-U0001F236]|U0001F237uFE0F?|[U0001F238-U0001F23AU0001F250U0001F251U0001F300-U0001F320]|[U0001F321U0001F324-U0001F32C]uFE0F?|[U0001F32D-U0001F335]|U0001F336uFE0F?|[U0001F337-U0001F37C]|U0001F37DuFE0F?|[U0001F37E-U0001F384]|U0001F385[U0001F3FB-U0001F3FF]?|[U0001F386-U0001F393]|[U0001F396U0001F397U0001F399-U0001F39BU0001F39EU0001F39F]uFE0F?|[U0001F3A0-U0001F3C1]|U0001F3C2[U0001F3FB-U0001F3FF]?|[U0001F3C3U0001F3C4](?:u200D[u2640u2642]uFE0F?|[U0001F3FB-U0001F3FF](?:u200D[u2640u2642]uFE0F?)?)?|[U0001F3C5U0001F3C6]|U0001F3C7[U0001F3FB-U0001F3FF]?|[U0001F3C8U0001F3C9]|U0001F3CA(?:u200D[u2640u2642]uFE0F?|[U0001F3FB-U0001F3FF](?:u200D[u2640u2642]uFE0F?)?)?|[U0001F3CBU0001F3CC](?:u200D[u2640u2642]uFE0F?|[uFE0FU0001F3FB-U0001F3FF](?:u200D[u2640u2642]uFE0F?)?)?|[U0001F3CDU0001F3CE]uFE0F?|[U0001F3CF-U0001F3D3]|[U0001F3D4-U0001F3DF]uFE0F?|[U0001F3E0-U0001F3F0]|U0001F3F3(?:u200D(?:u26A7uFE0F?|U0001F308)|uFE0F(?:u200D(?:u26A7uFE0F?|U0001F308))?)?|U0001F3F4(?:u200Du2620uFE0F?|U000E0067U000E0062(?:U000E0065U000E006EU000E0067|U000E0073U000E0063U000E0074|U000E0077U000E006CU000E0073)U000E007F)?|[U0001F3F5U0001F3F7]uFE0F?|[U0001F3F8-U0001F407]|U0001F408(?:u200Du2B1B)?|[U0001F409-U0001F414]|U0001F415(?:u200DU0001F9BA)?|[U0001F416-U0001F43A]|U0001F43B(?:u200Du2744uFE0F?)?|[U0001F43C-U0001F43E]|U0001F43FuFE0F?|U0001F440|U0001F441(?:u200DU0001F5E8uFE0F?|uFE0F(?:u200DU0001F5E8uFE0F?)?)?|[U0001F442U0001F443][U0001F3FB-U0001F3FF]?|[U0001F444U0001F445]|[U0001F446-U0001F450][U0001F3FB-U0001F3FF]?|[U0001F451-U0001F465]|[U0001F466U0001F467][U0001F3FB-U0001F3FF]?|U0001F468(?:u200D(?:[u2695u2696u2708]uFE0F?|u2764uFE0F?u200D(?:U0001F48Bu200D)?U0001F468|[U0001F33EU0001F373U0001F37CU0001F393U0001F3A4U0001F3A8U0001F3EBU0001F3ED]|U0001F466(?:u200DU0001F466)?|U0001F467(?:u200D[U0001F466U0001F467])?|[U0001F468U0001F469]u200D(?:U0001F466(?:u200DU0001F466)?|U0001F467(?:u200D[U0001F466U0001F467])?)|[U0001F4BBU0001F4BCU0001F527U0001F52CU0001F680U0001F692U0001F9AF-U0001F9B3U0001F9BCU0001F9BD])|U0001F3FB(?:u200D(?:[u2695u2696u2708]uFE0F?|u2764uFE0F?u200D(?:U0001F48Bu200D)?U0001F468[U0001F3FB-U0001F3FF]|[U0001F33EU0001F373U0001F37CU0001F393U0001F3A4U0001F3A8U0001F3EBU0001F3EDU0001F4BBU0001F4BCU0001F527U0001F52CU0001F680U0001F692]|U0001F91Du200DU0001F468[U0001F3FC-U0001F3FF]|[U0001F9AF-U0001F9B3U0001F9BCU0001F9BD]))?|U0001F3FC(?:u200D(?:[u2695u2696u2708]uFE0F?|u2764uFE0F?u200D(?:U0001F48Bu200D)?U0001F468[U0001F3FB-U0001F3FF]|[U0001F33EU0001F373U0001F37CU0001F393U0001F3A4U0001F3A8U0001F3EBU0001F3EDU0001F4BBU0001F4BCU0001F527U0001F52CU0001F680U0001F692]|U0001F91Du200DU0001F468[U0001F3FBU0001F3FD-U0001F3FF]|[U0001F9AF-U0001F9B3U0001F9BCU0001F9BD]))?|U0001F3FD(?:u200D(?:[u2695u2696u2708]uFE0F?|u2764uFE0F?u200D(?:U0001F48Bu200D)?U0001F468[U0001F3FB-U0001F3FF]|[U0001F33EU0001F373U0001F37CU0001F393U0001F3A4U0001F3A8U0001F3EBU0001F3EDU0001F4BBU0001F4BCU0001F527U0001F52CU0001F680U0001F692]|U0001F91Du200DU0001F468[U0001F3FBU0001F3FCU0001F3FEU0001F3FF]|[U0001F9AF-U0001F9B3U0001F9BCU0001F9BD]))?|U0001F3FE(?:u200D(?:[u2695u2696u2708]uFE0F?|u2764uFE0F?u200D(?:U0001F48Bu200D)?U0001F468[U0001F3FB-U0001F3FF]|[U0001F33EU0001F373U0001F37CU0001F393U0001F3A4U0001F3A8U0001F3EBU0001F3EDU0001F4BBU0001F4BCU0001F527U0001F52CU0001F680U0001F692]|U0001F91Du200DU0001F468[U0001F3FB-U0001F3FDU0001F3FF]|[U0001F9AF-U0001F9B3U0001F9BCU0001F9BD]))?|U0001F3FF(?:u200D(?:[u2695u2696u2708]uFE0F?|u2764uFE0F?u200D(?:U0001F48Bu200D)?U0001F468[U0001F3FB-U0001F3FF]|[U0001F33EU0001F373U0001F37CU0001F393U0001F3A4U0001F3A8U0001F3EBU0001F3EDU0001F4BBU0001F4BCU0001F527U0001F52CU0001F680U0001F692]|U0001F91Du200DU0001F468[U0001F3FB-U0001F3FE]|[U0001F9AF-U0001F9B3U0001F9BCU0001F9BD]))?)?|U0001F469(?:u200D(?:[u2695u2696u2708]uFE0F?|u2764uFE0F?u200D(?:U0001F48Bu200D)?[U0001F468U0001F469]|[U0001F33EU0001F373U0001F37CU0001F393U0001F3A4U0001F3A8U0001F3EBU0001F3ED]|U0001F466(?:u200DU0001F466)?|U0001F467(?:u200D[U0001F466U0001F467])?|U0001F469u200D(?:U0001F466(?:u200DU0001F466)?|U0001F467(?:u200D[U0001F466U0001F467])?)|[U0001F4BBU0001F4BCU0001F527U0001F52CU0001F680U0001F692U0001F9AF-U0001F9B3U0001F9BCU0001F9BD])|U0001F3FB(?:u200D(?:[u2695u2696u2708]uFE0F?|u2764uFE0F?u200D(?:[U0001F468U0001F469][U0001F3FB-U0001F3FF]|U0001F48Bu200D[U0001F468U0001F469][U0001F3FB-U0001F3FF])|[U0001F33EU0001F373U0001F37CU0001F393U0001F3A4U0001F3A8U0001F3EBU0001F3EDU0001F4BBU0001F4BCU0001F527U0001F52CU0001F680U0001F692]|U0001F91Du200D[U0001F468U0001F469][U0001F3FC-U0001F3FF]|[U0001F9AF-U0001F9B3U0001F9BCU0001F9BD]))?|U0001F3FC(?:u200D(?:[u2695u2696u2708]uFE0F?|u2764uFE0F?u200D(?:[U0001F468U0001F469][U0001F3FB-U0001F3FF]|U0001F48Bu200D[U0001F468U0001F469][U0001F3FB-U0001F3FF])|[U0001F33EU0001F373U0001F37CU0001F393U0001F3A4U0001F3A8U0001F3EBU0001F3EDU0001F4BBU0001F4BCU0001F527U0001F52CU0001F680U0001F692]|U0001F91Du200D[U0001F468U0001F469][U0001F3FBU0001F3FD-U0001F3FF]|[U0001F9AF-U0001F9B3U0001F9BCU0001F9BD]))?|U0001F3FD(?:u200D(?:[u2695u2696u2708]uFE0F?|u2764uFE0F?u200D(?:[U0001F468U0001F469][U0001F3FB-U0001F3FF]|U0001F48Bu200D[U0001F468U0001F469][U0001F3FB-U0001F3FF])|[U0001F33EU0001F373U0001F37CU0001F393U0001F3A4U0001F3A8U0001F3EBU0001F3EDU0001F4BBU0001F4BCU0001F527U0001F52CU0001F680U0001F692]|U0001F91Du200D[U0001F468U0001F469][U0001F3FBU0001F3FCU0001F3FEU0001F3FF]|[U0001F9AF-U0001F9B3U0001F9BCU0001F9BD]))?|U0001F3FE(?:u200D(?:[u2695u2696u2708]uFE0F?|u2764uFE0F?u200D(?:[U0001F468U0001F469][U0001F3FB-U0001F3FF]|U0001F48Bu200D[U0001F468U0001F469][U0001F3FB-U0001F3FF])|[U0001F33EU0001F373U0001F37CU0001F393U0001F3A4U0001F3A8U0001F3EBU0001F3EDU0001F4BBU0001F4BCU0001F527U0001F52CU0001F680U0001F692]|U0001F91Du200D[U0001F468U0001F469][U0001F3FB-U0001F3FDU0001F3FF]|[U0001F9AF-U0001F9B3U0001F9BCU0001F9BD]))?|U0001F3FF(?:u200D(?:[u2695u2696u2708]uFE0F?|u2764uFE0F?u200D(?:[U0001F468U0001F469][U0001F3FB-U0001F3FF]|U0001F48Bu200D[U0001F468U0001F469][U0001F3FB-U0001F3FF])|[U0001F33EU0001F373U0001F37CU0001F393U0001F3A4U0001F3A8U0001F3EBU0001F3EDU0001F4BBU0001F4BCU0001F527U0001F52CU0001F680U0001F692]|U0001F91Du200D[U0001F468U0001F469][U0001F3FB-U0001F3FE]|[U0001F9AF-U0001F9B3U0001F9BCU0001F9BD]))?)?|U0001F46A|[U0001F46B-U0001F46D][U0001F3FB-U0001F3FF]?|U0001F46E(?:u200D[u2640u2642]uFE0F?|[U0001F3FB-U0001F3FF](?:u200D[u2640u2642]uFE0F?)?)?|U0001F46F(?:u200D[u2640u2642]uFE0F?)?|[U0001F470U0001F471](?:u200D[u2640u2642]uFE0F?|[U0001F3FB-U0001F3FF](?:u200D[u2640u2642]uFE0F?)?)?|U0001F472[U0001F3FB-U0001F3FF]?|U0001F473(?:u200D[u2640u2642]uFE0F?|[U0001F3FB-U0001F3FF](?:u200D[u2640u2642]uFE0F?)?)?|[U0001F474-U0001F476][U0001F3FB-U0001F3FF]?|U0001F477(?:u200D[u2640u2642]uFE0F?|[U0001F3FB-U0001F3FF](?:u200D[u2640u2642]uFE0F?)?)?|U0001F478[U0001F3FB-U0001F3FF]?|[U0001F479-U0001F47B]|U0001F47C[U0001F3FB-U0001F3FF]?|[U0001F47D-U0001F480]|[U0001F481U0001F482](?:u200D[u2640u2642]uFE0F?|[U0001F3FB-U0001F3FF](?:u200D[u2640u2642]uFE0F?)?)?|U0001F483[U0001F3FB-U0001F3FF]?|U0001F484|U0001F485[U0001F3FB-U0001F3FF]?|[U0001F486U0001F487](?:u200D[u2640u2642]uFE0F?|[U0001F3FB-U0001F3FF](?:u200D[u2640u2642]uFE0F?)?)?|[U0001F488-U0001F48E]|U0001F48F[U0001F3FB-U0001F3FF]?|U0001F490|U0001F491[U0001F3FB-U0001F3FF]?|[U0001F492-U0001F4A9]|U0001F4AA[U0001F3FB-U0001F3FF]?|[U0001F4AB-U0001F4FC]|U0001F4FDuFE0F?|[U0001F4FF-U0001F53D]|[U0001F549U0001F54A]uFE0F?|[U0001F54B-U0001F54EU0001F550-U0001F567]|[U0001F56FU0001F570U0001F573]uFE0F?|U0001F574[uFE0FU0001F3FB-U0001F3FF]?|U0001F575(?:u200D[u2640u2642]uFE0F?|[uFE0FU0001F3FB-U0001F3FF](?:u200D[u2640u2642]uFE0F?)?)?|[U0001F576-U0001F579]uFE0F?|U0001F57A[U0001F3FB-U0001F3FF]?|[U0001F587U0001F58A-U0001F58D]uFE0F?|U0001F590[uFE0FU0001F3FB-U0001F3FF]?|[U0001F595U0001F596][U0001F3FB-U0001F3FF]?|U0001F5A4|[U0001F5A5U0001F5A8U0001F5B1U0001F5B2U0001F5BCU0001F5C2-U0001F5C4U0001F5D1-U0001F5D3U0001F5DC-U0001F5DEU0001F5E1U0001F5E3U0001F5E8U0001F5EFU0001F5F3U0001F5FA]uFE0F?|[U0001F5FB-U0001F62D]|U0001F62E(?:u200DU0001F4A8)?|[U0001F62F-U0001F634]|U0001F635(?:u200DU0001F4AB)?|U0001F636(?:u200DU0001F32BuFE0F?)?|[U0001F637-U0001F644]|[U0001F645-U0001F647](?:u200D[u2640u2642]uFE0F?|[U0001F3FB-U0001F3FF](?:u200D[u2640u2642]uFE0F?)?)?|[U0001F648-U0001F64A]|U0001F64B(?:u200D[u2640u2642]uFE0F?|[U0001F3FB-U0001F3FF](?:u200D[u2640u2642]uFE0F?)?)?|U0001F64C[U0001F3FB-U0001F3FF]?|[U0001F64DU0001F64E](?:u200D[u2640u2642]uFE0F?|[U0001F3FB-U0001F3FF](?:u200D[u2640u2642]uFE0F?)?)?|U0001F64F[U0001F3FB-U0001F3FF]?|[U0001F680-U0001F6A2]|U0001F6A3(?:u200D[u2640u2642]uFE0F?|[U0001F3FB-U0001F3FF](?:u200D[u2640u2642]uFE0F?)?)?|[U0001F6A4-U0001F6B3]|[U0001F6B4-U0001F6B6](?:u200D[u2640u2642]uFE0F?|[U0001F3FB-U0001F3FF](?:u200D[u2640u2642]uFE0F?)?)?|[U0001F6B7-U0001F6BF]|U0001F6C0[U0001F3FB-U0001F3FF]?|[U0001F6C1-U0001F6C5]|U0001F6CBuFE0F?|U0001F6CC[U0001F3FB-U0001F3FF]?|[U0001F6CD-U0001F6CF]uFE0F?|[U0001F6D0-U0001F6D2U0001F6D5-U0001F6D7U0001F6DD-U0001F6DF]|[U0001F6E0-U0001F6E5U0001F6E9]uFE0F?|[U0001F6EBU0001F6EC]|[U0001F6F0U0001F6F3]uFE0F?|[U0001F6F4-U0001F6FCU0001F7E0-U0001F7EBU0001F7F0]|U0001F90C[U0001F3FB-U0001F3FF]?|[U0001F90DU0001F90E]|U0001F90F[U0001F3FB-U0001F3FF]?|[U0001F910-U0001F917]|[U0001F918-U0001F91F][U0001F3FB-U0001F3FF]?|[U0001F920-U0001F925]|U0001F926(?:u200D[u2640u2642]uFE0F?|[U0001F3FB-U0001F3FF](?:u200D[u2640u2642]uFE0F?)?)?|[U0001F927-U0001F92F]|[U0001F930-U0001F934][U0001F3FB-U0001F3FF]?|U0001F935(?:u200D[u2640u2642]uFE0F?|[U0001F3FB-U0001F3FF](?:u200D[u2640u2642]uFE0F?)?)?|U0001F936[U0001F3FB-U0001F3FF]?|[U0001F937-U0001F939](?:u200D[u2640u2642]uFE0F?|[U0001F3FB-U0001F3FF](?:u200D[u2640u2642]uFE0F?)?)?|U0001F93A|U0001F93C(?:u200D[u2640u2642]uFE0F?)?|[U0001F93DU0001F93E](?:u200D[u2640u2642]uFE0F?|[U0001F3FB-U0001F3FF](?:u200D[u2640u2642]uFE0F?)?)?|[U0001F93F-U0001F945U0001F947-U0001F976]|U0001F977[U0001F3FB-U0001F3FF]?|[U0001F978-U0001F9B4]|[U0001F9B5U0001F9B6][U0001F3FB-U0001F3FF]?|U0001F9B7|[U0001F9B8U0001F9B9](?:u200D[u2640u2642]uFE0F?|[U0001F3FB-U0001F3FF](?:u200D[u2640u2642]uFE0F?)?)?|U0001F9BA|U0001F9BB[U0001F3FB-U0001F3FF]?|[U0001F9BC-U0001F9CC]|[U0001F9CD-U0001F9CF](?:u200D[u2640u2642]uFE0F?|[U0001F3FB-U0001F3FF](?:u200D[u2640u2642]uFE0F?)?)?|U0001F9D0|U0001F9D1(?:u200D(?:[u2695u2696u2708]uFE0F?|[U0001F33EU0001F373U0001F37CU0001F384U0001F393U0001F3A4U0001F3A8U0001F3EBU0001F3EDU0001F4BBU0001F4BCU0001F527U0001F52CU0001F680U0001F692]|U0001F91Du200DU0001F9D1|[U0001F9AF-U0001F9B3U0001F9BCU0001F9BD])|U0001F3FB(?:u200D(?:[u2695u2696u2708]uFE0F?|u2764uFE0F?u200D(?:U0001F48Bu200D|)U0001F9D1[U0001F3FC-U0001F3FF]|[U0001F33EU0001F373U0001F37CU0001F384U0001F393U0001F3A4U0001F3A8U0001F3EBU0001F3EDU0001F4BBU0001F4BCU0001F527U0001F52CU0001F680U0001F692]|U0001F91Du200DU0001F9D1[U0001F3FB-U0001F3FF]|[U0001F9AF-U0001F9B3U0001F9BCU0001F9BD]))?|U0001F3FC(?:u200D(?:[u2695u2696u2708]uFE0F?|u2764uFE0F?u200D(?:U0001F48Bu200D|)U0001F9D1[U0001F3FBU0001F3FD-U0001F3FF]|[U0001F33EU0001F373U0001F37CU0001F384U0001F393U0001F3A4U0001F3A8U0001F3EBU0001F3EDU0001F4BBU0001F4BCU0001F527U0001F52CU0001F680U0001F692]|U0001F91Du200DU0001F9D1[U0001F3FB-U0001F3FF]|[U0001F9AF-U0001F9B3U0001F9BCU0001F9BD]))?|U0001F3FD(?:u200D(?:[u2695u2696u2708]uFE0F?|u2764uFE0F?u200D(?:U0001F48Bu200D|)U0001F9D1[U0001F3FBU0001F3FCU0001F3FEU0001F3FF]|[U0001F33EU0001F373U0001F37CU0001F384U0001F393U0001F3A4U0001F3A8U0001F3EBU0001F3EDU0001F4BBU0001F4BCU0001F527U0001F52CU0001F680U0001F692]|U0001F91Du200DU0001F9D1[U0001F3FB-U0001F3FF]|[U0001F9AF-U0001F9B3U0001F9BCU0001F9BD]))?|U0001F3FE(?:u200D(?:[u2695u2696u2708]uFE0F?|u2764uFE0F?u200D(?:U0001F48Bu200D|)U0001F9D1[U0001F3FB-U0001F3FDU0001F3FF]|[U0001F33EU0001F373U0001F37CU0001F384U0001F393U0001F3A4U0001F3A8U0001F3EBU0001F3EDU0001F4BBU0001F4BCU0001F527U0001F52CU0001F680U0001F692]|U0001F91Du200DU0001F9D1[U0001F3FB-U0001F3FF]|[U0001F9AF-U0001F9B3U0001F9BCU0001F9BD]))?|U0001F3FF(?:u200D(?:[u2695u2696u2708]uFE0F?|u2764uFE0F?u200D(?:U0001F48Bu200D|)U0001F9D1[U0001F3FB-U0001F3FE]|[U0001F33EU0001F373U0001F37CU0001F384U0001F393U0001F3A4U0001F3A8U0001F3EBU0001F3EDU0001F4BBU0001F4BCU0001F527U0001F52CU0001F680U0001F692]|U0001F91Du200DU0001F9D1[U0001F3FB-U0001F3FF]|[U0001F9AF-U0001F9B3U0001F9BCU0001F9BD]))?)?|[U0001F9D2U0001F9D3][U0001F3FB-U0001F3FF]?|U0001F9D4(?:u200D[u2640u2642]uFE0F?|[U0001F3FB-U0001F3FF](?:u200D[u2640u2642]uFE0F?)?)?|U0001F9D5[U0001F3FB-U0001F3FF]?|[U0001F9D6-U0001F9DD](?:u200D[u2640u2642]uFE0F?|[U0001F3FB-U0001F3FF](?:u200D[u2640u2642]uFE0F?)?)?|[U0001F9DEU0001F9DF](?:u200D[u2640u2642]uFE0F?)?|[U0001F9E0-U0001F9FFU0001FA70-U0001FA74U0001FA78-U0001FA7CU0001FA80-U0001FA86U0001FA90-U0001FAACU0001FAB0-U0001FABAU0001FAC0-U0001FAC2]|[U0001FAC3-U0001FAC5][U0001F3FB-U0001F3FF]?|[U0001FAD0-U0001FAD9U0001FAE0-U0001FAE7]|U0001FAF0[U0001F3FB-U0001F3FF]?|U0001FAF1(?:U0001F3FB(?:u200DU0001FAF2[U0001F3FC-U0001F3FF])?|U0001F3FC(?:u200DU0001FAF2[U0001F3FBU0001F3FD-U0001F3FF])?|U0001F3FD(?:u200DU0001FAF2[U0001F3FBU0001F3FCU0001F3FEU0001F3FF])?|U0001F3FE(?:u200DU0001FAF2[U0001F3FB-U0001F3FDU0001F3FF])?|U0001F3FF(?:u200DU0001FAF2[U0001F3FB-U0001F3FE])?)?|[U0001FAF2-U0001FAF6][U0001F3FB-U0001F3FF]?"
extract_emoji = re.compile(emoji_rx)                   # Match a single emoji
extract_emoji_chunks = re.compile(f'(?:{emoji_rx})+')  # Match one or more emojis
extract_5_emoji_string = re.compile(f'^(?:{emoji_rx}){{5}}$')  # Match string of 5 emojis

请参阅this regex demo匹配正确的表情符号数量(4702)。

这篇关于在推文中查找完整的表情符号,而不是单独的字符的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆