上传到 Big Query 时表情符号崩溃 [英] Emoji crashed when uploading to Big Query

查看:33
本文介绍了上传到 Big Query 时表情符号崩溃的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

目前,我在将(使用 python)表情符号数据上传到 BIG QUERY 时遇到问题

这是我尝试上传到 BQ 的示例代码:

 {"emojiCharts":{"emoji_icon":"ud83dudc4d","re​​post": 4, "doc": 4, "engagement": 0, "reach": 0, "impression":0}}{"emojiCharts":{"emoji_icon":"ud83dudc49","re​​post": 4, "doc": 4, "engagement": 43, "reach": 722, "impression": 4816}}{"emojiCharts":{"emoji_icon":"u203c","re​​post": 4, "doc": 4, "engagement": 0, "reach": 0, "impression": 0}}{"emojiCharts":{"emoji_icon":"ud83cudf89","re​​post": 5, "doc": 5, "engagement": 43, "reach": 829, "impression": 5529}}{"emojiCharts":{"emoji_icon":"ud83dude34","re​​post": 5, "doc": 5, "engagement": 222, "reach": 420, "impression": 2805}}{"emojiCharts":{"emoji_icon":"ud83dude31","re​​post": 3, "doc": 3, "engagement": 386, "reach": 2868, "impression": 19122}}{"emojiCharts":{"emoji_icon":"ud83dudc4dud83cudffb","re​​post": 5, "doc": 5, "engagement": 43, "reach": 1064, "impression": 7098}}{"emojiCharts":{"emoji_icon":"ud83dude3b","re​​post": 3, "doc": 3, "engagement": 93, "reach": 192, "impression": 1283}}{"emojiCharts":{"emoji_icon":"ud83dude2d","re​​post": 6, "doc": 6, "engagement": 212, "reach": 909, "impression": 6143}}{"emojiCharts":{"emoji_icon":"ud83eudd84","re​​post": 8, "doc": 8, "engagement": 313, "reach": 402, "impression": 2681}}{"emojiCharts":{"emoji_icon":"ud83dude18","re​​post": 7, "doc": 7, "engagement": 0, "reach": 8454, "impression": 56366}}{"emojiCharts":{"emoji_icon":"ud83dude05","re​​post": 5, "doc": 5, "engagement": 74, "reach": 1582, "impression": 10550}}{"emojiCharts":{"emoji_icon":"ud83dude04","re​​post": 5, "doc": 5, "engagement": 73, "reach": 3329, "impression": 22206}}

问题是大查询看不到任何这个表情符号(ud83dude04),只会以这种格式显示(u203c)

即使字段是STRING,它显示2个黑色rombs,为什么BQ不能将emoji显示为字符串而不将其转换为实际的emoji?

问题:

有什么方法可以将 EMOJI 上传到 Big Query 以使其正确加载?-将在 Google 数据洞察中使用"

我是否应该手动(硬编码)将所有表情符号代码更改为可接受的代码,这是可接受的格式?

解决方案

正如用户在评论中提到的数字":

<块引用>

查看

关于此的其他想法:

Currently, I'm facing an issue with uploading (using python) EMOJI data to the BIG QUERY

This is sample code which I'm trying to upload to BQ:

 {"emojiCharts":{"emoji_icon":"ud83dudc4d","repost": 4, "doc": 4, "engagement": 0, "reach": 0, "impression": 0}} 
 {"emojiCharts":{"emoji_icon":"ud83dudc49","repost": 4, "doc": 4, "engagement": 43, "reach": 722, "impression": 4816}} 
 {"emojiCharts":{"emoji_icon":"u203c","repost": 4, "doc": 4, "engagement": 0, "reach": 0, "impression": 0}} 
 {"emojiCharts":{"emoji_icon":"ud83cudf89","repost": 5, "doc": 5, "engagement": 43, "reach": 829, "impression": 5529}} 
 {"emojiCharts":{"emoji_icon":"ud83dude34","repost": 5, "doc": 5, "engagement": 222, "reach": 420, "impression": 2805}} 
 {"emojiCharts":{"emoji_icon":"ud83dude31","repost": 3, "doc": 3, "engagement": 386, "reach": 2868, "impression": 19122}} 
 {"emojiCharts":{"emoji_icon":"ud83dudc4dud83cudffb","repost": 5, "doc": 5, "engagement": 43, "reach": 1064, "impression": 7098}} 
 {"emojiCharts":{"emoji_icon":"ud83dude3b","repost": 3, "doc": 3, "engagement": 93, "reach": 192, "impression": 1283}} 
 {"emojiCharts":{"emoji_icon":"ud83dude2d","repost": 6, "doc": 6, "engagement": 212, "reach": 909, "impression": 6143}} 
 {"emojiCharts":{"emoji_icon":"ud83eudd84","repost": 8, "doc": 8, "engagement": 313, "reach": 402, "impression": 2681}} 
 {"emojiCharts":{"emoji_icon":"ud83dude18","repost": 7, "doc": 7, "engagement": 0, "reach": 8454, "impression": 56366}} 
 {"emojiCharts":{"emoji_icon":"ud83dude05","repost": 5, "doc": 5, "engagement": 74, "reach": 1582, "impression": 10550}} 
 {"emojiCharts":{"emoji_icon":"ud83dude04","repost": 5, "doc": 5, "engagement": 73, "reach": 3329, "impression": 22206}}

Issues is that big query cannot see any of this emoji (ud83dude04) and will display only in this format (u203c)

Even if the field is STRING it displays 2 black rombs, why BQ cannot display emoji as a string without converting it to the actual emoji?

Questions:

Is there are any way to upload EMOJI to Big Query that it will load up correctly? - "will be used in Google Data Studio"

Should I manually (hardcoded) change all emoji code the acceptable ones, which is the acceptable format?

解决方案

As user 'numeral' mentions in their comment:

Check out charbase.com/1f618-unicode-face-throwing-a-kiss What you want is to convert the javascript escape characters to actual unicode data.

, you need to change the encoding of the emojis for them to be accurately represented as one character:

SELECT "U0001f604 U0001f4b8"
--   , "ud83dudcb8"
--   , "ud83dude04"

The 2nd and 3d line fail with an error like Illegal escape sequence: Unicode value ud83d is invalid at [2:7], but the first line gives the correct display in BigQuery and Data Studio:

Additional thoughts about this:

这篇关于上传到 Big Query 时表情符号崩溃的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆