【以下的问题经过翻译处理】 我有一个CSV文件
"one","two","three","four","five","six","seven","eight","nine","ten"
"one","two","three","four","five " quote " five2","six","seven","eight","nine","ten"
"one","two","three","four","five \
five2","six","seven","eight","nine","ten"
第二行是正常的,第三行有一个字段带有“转义引号”,第四行有转义的换行符。由于有换行符,该表无法正确导入。是否有其他方法?
我找到了一种方法,但它不正确。
CREATE EXTERNAL TABLE IF NOT EXISTS test-general-log
.general-log
(
A
string,
B
string,
C
string,
D
string,
E
string
)
ROW FORMAT SERDE 'org.apache.hadoop.hive.serde2.RegexSerDe'
WITH SERDEPROPERTIES (
"input.regex" = '"([^"])","([^"])"'
) LOCATION 's3://test-general-log/exportedlogs/0baa57384-2c6e-4e016-85f0-bc58b31e6300/'
TBLPROPERTIES ('has_encrypted_data'='false');