Commit Graph

55 Commits

Author SHA1 Message Date
takatost
3efaa713da
feat: use xinference client instead of xinference (#1339) 2023-10-13 02:46:09 -05:00
takatost
9822f687f7
fix: max tokens of OpenAI gpt-3.5-turbo-instruct to 4097 (#1338) 2023-10-13 02:07:07 -05:00
Garfield Dai
42a5b3ec17
feat: advanced prompt backend (#1301)
Co-authored-by: takatost <takatost@gmail.com>
2023-10-12 10:13:10 -05:00
takatost
cbf095465c
feat: remove llm client use (#1316) 2023-10-11 14:02:53 -05:00
takatost
2851a9f04e
feat: optimize minimax llm call (#1312) 2023-10-11 07:17:41 -05:00
takatost
c536f85b2e
fix: compatibility issues with the tongyi model. (#1310) 2023-10-11 05:16:26 -05:00
takatost
8480b0197b
fix: prompt for baichuan text generation models (#1299) 2023-10-10 13:01:18 +08:00
takatost
4ab4bcc074
feat: support openllm embedding (#1293) 2023-10-09 23:09:35 -05:00
takatost
1d4f019de4
feat: add baichuan llm support (#1294)
Co-authored-by: zxhlyh <jasonapring2015@outlook.com>
2023-10-09 23:09:26 -05:00
takatost
373e90ee6d
fix: detached model in completion thread (#1269) 2023-10-02 22:27:25 +08:00
takatost
41d4c5b424
fix: count down thread in completion db not commit (#1267) 2023-10-02 10:19:26 +08:00
takatost
8606d80c66
fix: request timeout when openai completion (#1265) 2023-10-01 16:00:23 +08:00
takatost
a31466d34e
fix: db session not commit before long llm call running (#1251) 2023-09-27 21:40:26 +08:00
takatost
d38eac959b
fix: wenxin model name invalid when llm call (#1248) 2023-09-27 16:29:13 +08:00
Garfield Dai
e409895c02
Feat/huggingface embedding support (#1211)
Co-authored-by: StyleZhang <jasonapring2015@outlook.com>
2023-09-22 13:59:02 +08:00
takatost
435f804c6f
fix: gpt-3.5-turbo-instruct context size to 8192 (#1196) 2023-09-19 02:10:22 +08:00
takatost
ae3f1ac0a9
feat: support gpt-3.5-turbo-instruct model (#1195) 2023-09-19 02:05:04 +08:00
takatost
827c97f0d3
feat: add zhipuai (#1188) 2023-09-18 17:32:31 +08:00
takatost
c8bd76cd66
fix: inference embedding validate (#1187) 2023-09-16 03:09:36 +08:00
takatost
f9082104ed
feat: add hosted moderation (#1158) 2023-09-12 10:26:12 +08:00
Jyong
642842d61b
Feat:dataset retiever resource (#1123)
Co-authored-by: jyong <jyong@dify.ai>
Co-authored-by: StyleZhang <jasonapring2015@outlook.com>
2023-09-10 15:17:43 +08:00
Joel
2d5ad0d208
feat: support optional query content (#1097)
Co-authored-by: Garfield Dai <dai.hai@foxmail.com>
2023-09-10 00:12:34 +08:00
takatost
c4d8bdc3db
fix: hf hosted inference check (#1128) 2023-09-09 00:29:48 +08:00
takatost
a7cdb745c1
feat: support spark v2 validate (#1086) 2023-09-01 20:53:32 +08:00
takatost
2eba98a465
feat: optimize anthropic connection pool (#1066) 2023-08-31 16:18:59 +08:00
takatost
417c19577a
feat: add LocalAI local embedding model support (#1021)
Co-authored-by: StyleZhang <jasonapring2015@outlook.com>
2023-08-29 22:22:02 +08:00
takatost
0796791de5
feat: hf inference endpoint stream support (#1028) 2023-08-26 19:48:34 +08:00
takatost
9ae91a2ec3
feat: optimize xinference request max token key and stop reason (#998) 2023-08-24 18:11:15 +08:00
takatost
2c30d19cbe
feat: add baichuan prompt (#985) 2023-08-24 10:22:36 +08:00
takatost
9b247fccd4
feat: adjust hf max tokens (#979) 2023-08-23 22:24:50 +08:00
takatost
a76fde3d23
feat: optimize hf inference endpoint (#975) 2023-08-23 19:47:50 +08:00
takatost
78d3aa5fcd
fix: embedding init err (#956) 2023-08-22 17:43:59 +08:00
takatost
e0a48c4972
fix: xinference chat support (#939) 2023-08-21 20:44:29 +08:00
takatost
6c832ee328
fix: remove openllm pypi package because of this package too large (#931) 2023-08-21 02:12:28 +08:00
takatost
25264e7852
feat: add xinference embedding model support (#930) 2023-08-20 19:35:07 +08:00
takatost
18dd0d569d
fix: xinference max_tokens alisa error (#929) 2023-08-20 19:12:52 +08:00
takatost
3ea8d7a019
feat: add openllm support (#928) 2023-08-20 19:04:33 +08:00
takatost
da3f10a55e
feat: server xinference support (#927) 2023-08-20 17:46:41 +08:00
takatost
95b179fb39
fix: replicate text generation model validate (#923) 2023-08-19 21:40:42 +08:00
takatost
3a0a9e2d8f
fix: embedding get price definition missing (#922) 2023-08-19 21:31:40 +08:00
takatost
0a0d63457d
feat: record price unit in messages (#919) 2023-08-19 18:51:40 +08:00
takatost
920fb6d0e1
fix: embedding price config (#918) 2023-08-19 16:54:08 +08:00
Krasus.Chen
fd0fc8f4fe
Fix/price calc (#862) 2023-08-19 16:41:35 +08:00
takatost
1c552ff23a
fix: azure embedding model credentials include base_model_name is invalid for openai sdk (#917) 2023-08-19 16:24:18 +08:00
takatost
5163dd38e5
fix: run extra model serval ex not return (#916) 2023-08-19 14:35:16 +08:00
takatost
2a27dad2fb
fix: run model serval ex not return (#915) 2023-08-19 14:16:41 +08:00
takatost
1d9cc5ca05
fix: universal chat when default model invalid (#905) 2023-08-18 16:20:42 +08:00
takatost
9adbeadeec
feat: claude paid optimize (#890) 2023-08-17 16:56:20 +08:00
takatost
f42e7d1a61
feat: add spark v2 support (#885) 2023-08-17 15:08:57 +08:00
takatost
cc52cdc2a9
Feat/add free provider apply (#829) 2023-08-14 12:44:35 +08:00