GoLang - 坚持使用ISO-8859-1字符集 [英] GoLang - Persist using ISO-8859-1 charset

查看:964
本文介绍了GoLang - 坚持使用ISO-8859-1字符集的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我正在开发一个项目,我们需要将信息保留在具有ISO-8859-1表的旧数据库中。所以在写数据库之前,我需要将它从UTF-8转换为ISO-8859-1,并且每次从数据库中检索它时,我都需要将它转换回UTF-8。

I'm developing a project where we need to persist our information in a legacy database that has ISO-8859-1 tables. So before writing something to the database I need to convert it from UTF-8 to ISO-8859-1, and every time I retrieve it from the database, I need to convert it back to UTF-8.

我试图对每个需要保留的文本字段使用库 code.google.com/p/go-charset/ 作为以下内容。

I was trying to use the library code.google.com/p/go-charset/ as the following for each text field that I need to persist.

import (
  "bytes"
  "code.google.com/p/go-charset/charset"
  _ "code.google.com/p/go-charset/data"
  "fmt"
  "io/ioutil"
  "strings"
)

func toISO88591(utf8 string) string {
    buf := new(bytes.Buffer)

    w, err := charset.NewWriter("latin1", buf)
    if err != nil {
        panic(err)
    }
    defer w.Close()

    fmt.Fprintf(w, utf8)
    return buf.String()
}

func fromISO88591(iso88591 string) string {
    r, err := charset.NewReader("latin1", strings.NewReader(iso88591))
    if err != nil {
        panic(err)
    }

    buf, err := ioutil.ReadAll(r)
    if err != nil {
        panic(err)
    }

    return string(buf)
}

即使我使用 toISO88591 功能,数据仍然保留在UTF-8中。我在这种转换中做错了什么?

The problem is that the data is still persisted in UTF-8 even if I use the function toISO88591. I am doing something wrong in this conversion?

我的数据库是MySQL,我使用 github.com/go-sql-driver/mysql

My database is a MySQL, and I'm using the github.com/go-sql-driver/mysql driver with the following connection parameters:

<user>:<password>@tcp(<host>:<port>)/<database>?collation=latin1_general_ci

p>

推荐答案


package charset

import "code.google.com/p/go-charset/charset" 

func NewWriter

func NewWriter(charset string, w io.Writer) (io.WriteCloser, error)

NewWriter 返回一个新的 WriteCloser 写入 w 。它将
的UTF-8文本写入命名为
set的文本的 w 中写入。需要 Close 才能将任何剩余的部分
翻译字符清除到输出中。

NewWriter returns a new WriteCloser writing to w. It converts writes of UTF-8 text into writes on w of text in the named character set. The Close is necessary to flush any remaining partially translated characters to the output.






我会按照指示操​​作: Close 是必要的,以清除任何剩余的部分
翻译字符输出。例如,


I would follow the instructions: "The Close is necessary to flush any remaining partially translated characters to the output." For example,

package main

import (
    "bytes"
    "code.google.com/p/go-charset/charset"
    _ "code.google.com/p/go-charset/data"
    "fmt"
    "io/ioutil"
    "strings"
)

func toISO88591(utf8 string) (string, error) {
    buf := new(bytes.Buffer)
    w, err := charset.NewWriter("latin1", buf)
    if err != nil {
        return "", err
    }
    fmt.Fprintf(w, utf8)
    w.Close()
    return buf.String(), nil
}

func fromISO88591(iso88591 string) (string, error) {
    r, err := charset.NewReader("latin1", strings.NewReader(iso88591))
    if err != nil {
        return "", err
    }
    buf, err := ioutil.ReadAll(r)
    if err != nil {
        return "", err
    }
    return string(buf), nil
}

func main() {
    utfi := "£5 for Peppé"
    fmt.Printf("%q\n", utfi)
    iso, err := toISO88591(utfi)
    if err != nil {
        fmt.Println(err)
    }
    fmt.Printf("%q\n", iso)
    utfo, err := fromISO88591(iso)
    if err != nil {
        fmt.Println(err)
    }
    fmt.Printf("%q\n", utfo)
    fmt.Println(utfi == utfo)
}

输出:

"£5 for Peppé"
"\xa35 for Pepp\xe9"
"£5 for Peppé"
true

这篇关于GoLang - 坚持使用ISO-8859-1字符集的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持IT屋!

查看全文
登录 关闭
扫码关注1秒登录
发送“验证码”获取 | 15天全站免登陆