Commit Graph

115 Commits

Author SHA1 Message Date
e10defeaba [enhancement](plubin)support json format and other options in logstash doris output plugin (#27318) 2024-02-04 14:28:38 +08:00
9d3a017706 [fix](doriswriter)Fix the problem that specifying multiple loadurls does not take effect #29865 2024-01-16 18:31:27 +08:00
b443db6e3e [fix](doriswriter)Fix multiple concurrent import label duplication issues #28344 2023-12-14 12:39:22 +08:00
691889419f [fix](dbt) fix dbt doris bug that appeared in annotated models and table column comment bug (#24986) 2023-10-10 05:10:46 -05:00
fb6cb88341 [feature-wip](dbt) dbt view columns comment and view rename change (#23917)
1. dbt view columns comment 
2. view rename change,Adjust view override logic.
2023-09-11 11:15:23 +08:00
aad3eb257f update gopkg.in/yaml.v3 v3.0.0-20210107192922-496545a6307b to 3.0.0 (#24056)
There are 1 security vulnerabilities found in gopkg.in/yaml.v3 v3.0.0-20210107192922-496545a6307b

CVE-2022-28948
What did I do?
Upgrade gopkg.in/yaml.v3 from v3.0.0-20210107192922-496545a6307b to 3.0.0 for vulnerability fix

What did you expect to happen?
Ideally, no insecure libs should be used.

How can we automate the detection of these types of issues?
By using the GitHub Actions configurations provided by murphysec, we can conduct automatic code security checks in our CI pipeline.

The specification of the pull request
PR Specification from OSCS
2023-09-09 00:37:39 +08:00
d71b99b88a [fix](dbt) fix dbt doris user non-root user permission for show frintends sql (#22815) 2023-08-17 09:40:53 +08:00
acc24df10a [fix](datax)doris writer url decoder fix #22401
When the user imports data, there are some special characters in the data, which will cause the import to fail
The following error message appears:

2023-07-28 15:15:28.960  INFO 21756 --- [-interval-flush] c.a.d.p.w.d.DorisWriterManager           : Doris interval Sinking triggered: label[datax_doris_writer_7aa415e6-5a9c-4070-a699-70b4a627ae64].
2023-07-28 15:15:29.015  INFO 21756 --- [       Thread-3] c.a.d.p.w.d.DorisStreamLoadObserver      : Start to join batch data: rows[95968] bytes[3815834] label[datax_doris_writer_7aa415e6-5a9c-4070-a699-70b4a627ae64].
2023-07-28 15:15:29.038  INFO 21756 --- [       Thread-3] c.a.d.p.w.d.DorisStreamLoadObserver      : Executing stream load to: 'http://10.38.60.218:8030/api/ods_prod/ods_pexweb_online_product/_stream_load', size: '3911802'
2023-07-28 15:15:31.559  WARN 21756 --- [       Thread-3] c.a.d.p.w.d.DorisStreamLoadObserver      : Request failed with code:500
2023-07-28 15:15:31.561  INFO 21756 --- [       Thread-3] c.a.d.p.w.d.DorisStreamLoadObserver      : StreamLoad response :null
2023-07-28 15:15:31.564  WARN 21756 --- [       Thread-3] c.a.d.p.w.d.DorisWriterManager           : Failed to flush batch data to Doris, retry times = 0

java.io.IOException: Unable to flush data to Doris: unknown result status.
	at com.alibaba.datax.plugin.writer.doriswriter.DorisStreamLoadObserver.streamLoad(DorisStreamLoadObserver.java:66) ~[doriswriter-0.0.1-SNAPSHOT.jar:na]
	at com.alibaba.datax.plugin.writer.doriswriter.DorisWriterManager.asyncFlush(DorisWriterManager.java:163) [doriswriter-0.0.1-SNAPSHOT.jar:na]
	at com.alibaba.datax.plugin.writer.doriswriter.DorisWriterManager.access$000(DorisWriterManager.java:19) [doriswriter-0.0.1-SNAPSHOT.jar:na]
	at com.alibaba.datax.plugin.writer.doriswriter.DorisWriterManager$1.run(DorisWriterManager.java:134) [doriswriter-0.0.1-SNAPSHOT.jar:na]
	at java.lang.Thread.run(Thread.java:748) [na:1.8.0_221]

在fe.log日志中发现下面的错误信息:

ava.lang.IllegalArgumentException: URLDecoder: Illegal hex characters in escape (%) pattern - For input string: " l"
        at java.net.URLDecoder.decode(URLDecoder.java:194) ~[?:1.8.0_221]
        at org.springframework.http.converter.FormHttpMessageConverter.read(FormHttpMessageConverter.java:352) ~[spring-web-5.3.22.jar:5.3.22]
        at org.springframework.web.filter.FormContentFilter.parseIfNecessary(FormContentFilter.java:109) ~[spring-web-5.3.22.jar:5.3.22]
        at org.springframework.web.filter.FormContentFilter.doFilterInternal(FormContentFilter.java:88) ~[spring-web-5.3.22.jar:5.3.22]
        at org.springframework.web.filter.OncePerRequestFilter.doFilter(OncePerRequestFilter.java:117) ~[spring-web-5.3.22.jar:5.3.22]
        at org.eclipse.jetty.servlet.FilterHolder.doFilter(FilterHolder.java:193) ~[jetty-servlet-9.4.48.v20220622.jar:9.4.48.v20220622]
        at org.eclipse.jetty.servlet.ServletHandler$Chain.doFilter(ServletHandler.java:1626) ~[jetty-servlet-9.4.48.v20220622.jar:9.4.48.v20220622]
        at org.springframework.web.filter.CharacterEncodingFilter.doFilterInternal(CharacterEncodingFilter.java:201) ~[spring-web-5.3.22.jar:5.3.22]
        at org.springframework.web.filter.OncePerRequestFilter.doFilter(OncePerRequestFilter.java:117) ~[spring-web-5.3.22.jar:5.3.22]
        at org.eclipse.jetty.servlet.FilterHolder.doFilter(FilterHolder.java:193) ~[jetty-servlet-9.4.48.v20220622.jar:9.4.48.v20220622]
        at org.eclipse.jetty.servlet.ServletHandler$Chain.doFilter(ServletHandler.java:1626) ~[jetty-servlet-9.4.48.v20220622.jar:9.4.48.v20220622]
        at org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java:552) ~[jetty-servlet-9.4.48.v20220622.jar:9.4.48.v20220622]
        at org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.java:143) ~[jetty-server-9.4.48.v20220622.jar:9.4.48.v20220622]
        at org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java:600) ~[jetty-security-9.4.48.v20220622.jar:9.4.48.v20220622]
        at org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper.java:127) ~[jetty-server-9.4.48.v20220622.jar:9.4.48.v20220622]
        at org.eclipse.jetty.server.handler.ScopedHandler.nextHandle(ScopedHandle
2023-07-31 12:57:10 +08:00
32fce013f7 [feature](docs) add docs dbt-doris adapter (#22067) 2023-07-21 23:34:47 +08:00
c65781d4b8 [feature](dbt) materialization table skip the process of backup (#21993)
1.  materialization table skip the process of backup
2. materialization table to full refresh mode atomically
3. Handle the case where the `rename table` is null
2023-07-20 15:59:55 +08:00
1110ff49f3 [feature-wip](dbt) exchange table temp to target table atomically (#21931)
exchange table temp to target table atomically
2023-07-19 10:20:50 +08:00
21633908bd [feature-wip](dbt) overwrite the materialization for table and view (#21935)
overwrite the materialization for table and view
2023-07-19 10:20:29 +08:00
020f238fbc [feature](dbt) read table columns from model config file (#21831)
1、read table columns (datat_ype) from model config file
2、read table description(comment) from model config file
2023-07-17 15:45:12 +08:00
8272232e21 [fix](dbt) fix _MISSING_TYPE object is not callable bug (#21577) 2023-07-07 10:45:42 +08:00
d3eeb233c8 [fix](dbt) dbt getconfig array or string (#21345)
{{ config(unique_key='id') }}
{{ config(unique_key=['id','name']) }}
Follow the dbt habit, use string for a single column name, and use array for multiple columns
2023-07-05 11:42:38 +08:00
e510e6b0a6 [fix](dbt) dbt-doris match dbt-core==1.5 (#21392)
dbt-doris==0.2 match dbt-core==1.3 or older version

dbt-doris Subsequent version match dbt-core==1.4,1.5
2023-07-05 11:42:19 +08:00
c9c183e498 [fix](dbt) dbt seed config read (#21492) 2023-07-05 11:41:59 +08:00
e3454494b4 [improvement](extension) add new features to Mysql to doris tools (#14243) 2023-06-19 11:08:48 +08:00
e1184bf4dc [fix](dbt) dbt incremental append (#20513) 2023-06-09 01:41:33 +08:00
24dd3f19cd [feature](extension) support beats output to doris (#18448) 2023-04-16 18:17:48 +08:00
45dbd4d872 [fix](dbt)fix dbt incremental #16840
fix dbt incremental :new ideas for no rollback and support incremental data rerun .
add snapshot
use 'mysql-connector-python' mysql driver to replace 'MysqlDb' driver
2023-02-18 20:40:56 +08:00
1f07e54178 [typo](docs)fix logstash readme url (#15573) 2023-01-03 22:57:36 +08:00
bd5882d08a [fix](datax)doris writer write error (#14276)
* doris writer write error
2022-11-18 18:20:13 +08:00
f5761c658f [Fix]Fix the extension mysql_to_doris bug (#13723)
* Fix the extension mysql_to_doris  BUG

e_mysql_to_doris.sh: command error,This error causes script execution errors.  :ERROR 1103 (42000) at line 1: Incorrect table name ''.
 ` ` symbol position error

* Update extension/mysql_to_doris/bin/e_mysql_to_doris.sh


Co-authored-by: Adonis Ling <adonis0147@gmail.com>
2022-10-31 08:45:34 +08:00
6ff6a4f8b2 [fix] The special library name table name problem (#13519) 2022-10-21 22:45:35 +08:00
ed19562cb3 And ali datax unified configuration naming, modify maxBatchSize to batchSize(#13278)
And ali datax unified configuration naming, modify maxBatchSize to batchSize
2022-10-11 14:51:19 +08:00
6ee150755a [refactor](datax)Refactoring doris writer code (#13226)
* Refactoring doris writer code
2022-10-11 08:47:05 +08:00
54e6f12110 [improvement](mysql-to-doris)Fully resolved Mysql external table issues (#13229)
Fully resolved Mysql external table issues
2022-10-10 16:48:52 +08:00
29fc167548 [Bug](Datax)Fix bug that the dataxwriter will drop column when convert map to json (#13042)
* fix bug that when value is null,toJSONString will drop this key value.
2022-09-29 11:37:10 +08:00
f1811e41bc [fix](config)Update user_define_tables.sh #12542 2022-09-16 10:27:28 +08:00
ef37396b63 [fix](dbt)fix dbt incremental bug (#12280) 2022-09-04 16:40:40 +08:00
df51c78593 [fix](dbt)fix dbt run abnormal #12242 2022-09-01 12:10:48 +08:00
e48b691139 Failed to get doris_odbc_name value in mysql_to_doris, replace driver value error (#11965)
Failed to get doris_odbc_name value in mysql_to_doris, replace driver value error
2022-08-29 19:13:54 +08:00
4217b9c1d3 [feature] (dbt) add incremental and init interactive command line (#11870)
add dbt-doris incremental model and init interactive command line
2022-08-25 15:03:28 +08:00
d4749c2652 [extension](mysql-to-doris) add odbc conf and some fix (#11692) 2022-08-20 18:27:48 +08:00
27f652aaff [extension](feature)Mysql database import doris by external tables (#10905) 2022-08-11 10:18:45 +08:00
a4f9628576 [improvement](datax) improvement json import and support csv writing
1.At present, read_json_by_line and fuzzy_parse are used for json format writing, and the performance of streamload writing will decrease. It is modified to strip_outer_array and fuzzy_parse writing, and the speed is increased by about 3 times.

2.Add csv writing, the column separator is set to \x01, and the row separator is set to \x02, the performance is about 5 times higher than before
2022-08-09 11:50:24 +08:00
65dd8eb885 Update init-env.sh (#11111)
This script is missing "!"
2022-07-22 21:55:12 +08:00
468040974e [compile]Update init-env.sh (#10451) 2022-06-30 11:28:06 +08:00
67f341f44e [TLP](step-1) Remove incubator prefix (#10230)
Remove some `incubator-` prefix in source code.
The document is not modified, will be done in next PR.
2022-06-19 19:34:52 +08:00
87e3904cc6 Fix some typos for docs. (#9680) 2022-05-19 20:55:21 +08:00
c1707ca388 [feature][datax]doriswriter support timeZone (#9327) 2022-05-06 18:39:10 +08:00
7af79e1df5 [Feature][dbt] add partition_type support (#9389) 2022-05-06 15:27:34 +08:00
2c81624765 [Features]Add dbt doris adapter (#9299)
* Add dbt doris adapter

* Add licence header to each file

* Fix licence header
2022-04-29 11:40:29 +08:00
3dd6b42781 [fix](datax) Fix the problem of keyword error when importing datax (#8893) 2022-04-08 09:20:54 +08:00
3b159a9820 support doriswriter build in macos (#8330)
support doriswriter build in macos (#8330)
2022-03-07 09:53:16 +08:00
c3b010b277 [refactor] Remove flink/spark connectors (#8004)
As we discussed in dev@doris[1]
Flink/Spark connectors has been moved to new repo: https://github.com/apache/incubator-doris-connectors

[1] https://lists.apache.org/thread/hnb7bf0l6y6rzb9pr6lhxz3jjoo04skl
2022-02-10 15:00:36 +08:00
4ada8e4854 [fix](httpv2) make http v2 and v1 interface compatible (#7848)
http v2 TableSchemaAction adds the return value of aggregation_type,
and modifies the corresponding code of Flink/Spark Connector
2022-01-31 22:12:34 +08:00
4bdeef3b64 [chore][fix][doc](fe-plugin)(mysqldump) fix build auditlog plugin error (#7804)
1. fix problems when build fe_plugins
2. format
3. add docs about dump data using mysql dump
2022-01-26 09:11:23 +08:00
60c6bb4f92 [Feature][flink-connector] support flink delete option (#7457)
* Flink Connector supports delete option on Unique models
Co-authored-by: wudi <wud3@shuhaisc.com>
2022-01-23 20:24:41 +08:00