你的方法不行,我使用环境变量也不行:
nick@nick-sager:~/workspace/doubao$ curl
https://ark.cn-beijing.volces.com/api/v3/responses -H "Authorization: Bearer $ARK_API_KEY" -H 'Content-Type: application/json' -d '{
"model": "doubao-seed-2-0-pro-260215",
"input": [
{
"role": "user",
"content": [
{
"type": "input_image",
"image_url": "
http://staroceans.org/doubao/doubao1.png"
},
{
"type": "input_text",
"text": "'"$PROMPT"'"
}
]
}
]
}'
{"error":{"code":"InvalidParameter","message":"we could not parse the JSON body of your request Request id: 021773790547328ffa368fe9ac51d90198cd5026e9cf3e3198056","param":"","type":"Bad Request"}}nick@nick-sager:~/workspace/doubao$ curl
https://ark.cn-beijing.volces.com/api/v3/responses -H "Authorization: Bearer $ARK_API_KEY" -H 'Content-Type: application/json' -d '{
"model": "doubao-seed-2-0-pro-260215",
"input": [
{
"role": "user",
"content": [
{
"type": "input_image",
"image_url": "
http://staroceans.org/doubao/doubao1.png"
},
{
"type": "input_text",
"text": ""$PROMPT""
}
]
}
]
}'
{"error":{"code":"InvalidParameter","message":"we could not parse the JSON body of your request Request id: 021773790563035b24228fc6121af27473abf46a7e0aad2f77450","param":"","type":"Bad Request"}}nick@nick-sager:~/workspace/doubao$ curl
https://ark.cn-beijing.volces.com/api/v3/responses -H "Authorization: Bearer $ARK_API_KEY" -H 'Content-Type: application/json' -d '{
"model": "doubao-seed-2-0-pro-260215",
"input": [
{
"role": "user",
"content": [
{
"type": "input_image",
"image_url": "
http://staroceans.org/doubao/doubao1.png"
},
{
"type": "input_text",
"text": "$PROMPT^C
}
]
}
]
}'
nick@nick-sager:~/workspace/doubao$ export PROMPT=$(cat prompt.txt)
nick@nick-sager:~/workspace/doubao$ echo $PROMPT
请识别这张 GUI 截图中左侧历史对话区域的所有标题条目: 1. 只输出**纯 JSON 格式**,不要任何额外说明、推理过程或 markdown 包裹。 2. JSON 结构必须严格如下: { "elements": [ { "text": "对话标题文字", "bbox": [x1, y1, x2, y2] } ] } 3. `text` 字段:必须是截图中左侧历史对话的完整标题文字,截断样式与截图保持一致。 4. `bbox` 字段:必须是**相对截图的像素坐标**,格式为 [左上角x, 左上角y, 右下角x, 右下角y],确保精准包裹对应标题文字。 5. 条目顺序必须与截图中从上到下的视觉顺序完全一致。 6. 禁止添加任何注释、换行说明,只返回可直接解析的 JSON。
nick@nick-sager:~/workspace/doubao$ curl
https://ark.cn-beijing.volces.com/api/v3/responses -H "Authorization: Bearer $ARK_API_KEY" -H 'Content-Type: application/json' -d '{
"model": "doubao-seed-2-0-pro-260215",
"input": [
{
"role": "user",
"content": [
{
"type": "input_image",
"image_url": "
http://staroceans.org/doubao/doubao1.png"
},
{
"type": "input_text",
"text": "$PROMPT"
}
]
}
]
}'
{"created_at":1773790659,"id":"resp_021773790654290b24228fc6121af27473abf46a7e0aad2f0f6c2","max_output_tokens":32768,"model":"doubao-seed-2-0-pro-260215","object":"response","output":[{"id":"rs_02177379065918600000000000000000000ffffac15586796b4d4","type":"reasoning","summary":[{"type":"summary_text","text":"\n用户现在是要总结这页里的蔡文胜相关信息对吧?首先先整理出来:\n首先先列他的现任职务:隆领投资董事长、58同城董事。\n然后近年动向:\n1. 投资布局方向是Web3、AI、加密货币,自称持有1万枚比特币;\n2. 2025年收购香港上市公司中国金融租赁(股票代码
02312.HK),打造科技投资平台。\n然后标签和财富情况:\n标签是“草根逆袭、快决策、重用户、抓风口”;\n2025年胡润百富榜财富120亿元,位列榜单第584位。\n哦对还要提现在页面还询问是否需要整理他的关键节点时间线,还有相关的衍生需求选项比如看他的创业故事之类的?不对,先把已有的信息列清楚就行对吧?\n等下再核对下内容有没有错:\n职务没错,现任隆领投资董事长、58同城董事。\n近年动向:布局Web3、AI、加密货币,自称持有1万枚比特币;2025年收购港股中国金融租赁(
02312.HK),打造科技投资平台。\n标签:草根逆袭、快决策、重用户、抓风口。\n财富:2025年胡润百富榜资产120亿元,排第584位。\n对的,就整理清楚这些就行,如果用户要时间线或者创业故事还可以进一步查询整理。"}],"status":"completed"},{"type":"message","role":"assistant","content":[{"type":"output_text","text":"这是页面里展示的蔡文胜相关信息汇总:\n### 1. 现任职务\n隆领投资董事长、58同城董事\n### 2. 近年动向\n- 投资布局方向覆盖Web3、AI、加密货币,自称持有1万枚比特币\n- 2025年收购香港上市公司中国金融租赁(股票代码:`
02312.HK`),打造科技投资平台\n### 3. 相关标签与财富情况\n- 个人标签:草根逆袭、快决策、重用户、抓风口\n- 财富情况:2025年胡润百富榜统计其财富为120亿元,位列总榜单第584位\n\n当前页面还支持进一步整理他的人生/创业时间线、查询他的创业故事,如果你有相关需求可以继续发起询问。"}],"status":"completed","id":"msg_02177379066618000000000000000000000ffffac15586747dfbc"}],"service_tier":"default","status":"completed","usage":{"input_tokens":1324,"output_tokens":515,"total_tokens":1839,"input_tokens_details":{"cached_tokens":0},"output_tokens_details":{"reasoning_tokens":331}},"caching":{"type":"disabled"},"store":true,"expire_at":1774049854}nick@nick-sager:~/workspace/doubao$
不过使用文件是可以的,现在也只能这样子了:
你看看这个结果如何?
nick@nick-sager:~/workspace/doubao$ vim request.json
nick@nick-sager:~/workspace/doubao$ curl
https://ark.cn-beijing.volces.com/api/v3/responses \
-H "Authorization: Bearer $ARK_API_KEY" \
-H 'Content-Type: application/json' \
-d @request.json
{"created_at":1773790800,"id":"resp_021773790796408a7f4d4af3e382e8c5fe721af27b2ecdf9700fc","max_output_tokens":32768,"model":"doubao-seed-2-0-pro-260215","object":"response","output":[{"id":"rs_02177379080021200000000000000000000ffffac159bd687ccfd","type":"reasoning","summary":[{"type":"summary_text","text":"\n用户现在需要提取左侧历史对话的所有标题,首先逐个找:\n第一个是“2026年人民币换美元新规解读”,bbox的话看位置,\u003cpoint\u003e56 241\u003c/point\u003e到\u003cpoint\u003e157 254\u003c/point\u003e?不对,精确点:\n第一个条目:text是\"2026年人民币换美元新规解读\",bbox[39,232, 166,257]\n第二个:\"大模型与香农信息论结合\",bbox[39,270, 146,294]\n第三个:\"评剧《花为媒》相关问题解答\",bbox[39,308, 156,327]\n第四个:\"Nano Banana Pro与AI 视频制...\",bbox[39,342, 166,364]\n第五个:\"Playwright与大模型结合的智...\",bbox[39,378, 165,399]\n第六个:\"蔡文胜介绍\",bbox[39,414, 104,435]\n第七个:\"小龙虾与豆包手机:同一条死路\",bbox[39,452, 165,473]\n第八个:\"《西江月》词牌韵律表\",bbox[39,487, 129,508]\n第九个:\"Playwright与大模型结合的智...\",bbox[39,523, 165,544]\n第十个:\"华为升腾384超节点与NVIDIA...\",bbox[39,560, 166,581]\n第十一个:\"大模型与人脑记忆差异及遗忘...\",bbox[39,596, 165,617]\n第十二个:\"模型解禁技术介绍\",bbox[39,633, 111,654]\n第十三个:\"对《三体》程心角色解读文章...\",bbox[39,669, 165,690]\n第十四个:\"BOSS直聘3B模型介绍\",bbox[39,705, 130,726]\n第十五个:\"AlphaGo技术与Transformer应用\",bbox[39,742, 166,763]\n第十六个:\"LLM训练与人类求知规律的关联\",bbox[39,778, 163,799]\n第十七个:\"AI编程最佳语言选择\",bbox[39,815, 125,836]\n第十八个:\"国产AI编程IDE对比\",bbox[39,852, 114,873]\n第十九个:\"用税收衡量中美GDP虚实\",bbox[39,888, 145,909]\n第二十个:\"AI版蒋干盗书设计方案\",bbox[39,924, 127,945]\n然后按照结构组织成json,注意顺序从上到下,bbox要准确,不要错。现在整理成正确的json格式,确保没有多余内容。"}],"status":"completed"},{"type":"message","role":"assistant","content":[{"type":"output_text","text":"{\"elements\": [{\"text\": \"2026年人民币换美元新规解读\", \"bbox\": [54, 233, 163, 256]}, {\"text\": \"大模型与香农信息论结合\", \"bbox\": [54, 271, 143, 292]}, {\"text\": \"评剧《花为媒》相关问题解答\", \"bbox\": [54, 307, 153, 328]}, {\"text\": \"Nano Banana Pro与AI 视频制...\", \"bbox\": [54, 343, 162, 364]}, {\"text\": \"Playwright与大模型结合的智...\", \"bbox\": [54, 379, 162, 400]}, {\"text\": \"蔡文胜介绍\", \"bbox\": [54, 416, 101, 435]}, {\"text\": \"小龙虾与豆包手机:同一条死路\", \"bbox\": [54, 453, 162, 474]}, {\"text\": \"《西江月》词牌韵律表\", \"bbox\": [54, 489, 126, 509]}, {\"text\": \"Playwright与大模型结合的智...\", \"bbox\": [54, 525, 162, 546]}, {\"text\": \"华为升腾384超节点与NVIDIA...\", \"bbox\": [54, 561, 162, 582]}, {\"text\": \"大模型与人脑记忆差异及遗忘...\", \"bbox\": [54, 597, 162, 618]}, {\"text\": \"模型解禁技术介绍\", \"bbox\": [54, 633, 109, 654]}, {\"text\": \"对《三体》程心角色解读文章...\", \"bbox\": [54, 669, 162, 690]}, {\"text\": \"BOSS直聘3B模型介绍\", \"bbox\": [54, 706, 128, 727]}, {\"text\": \"AlphaGo技术与Transformer应用\", \"bbox\": [54, 742, 166, 763]}, {\"text\": \"LLM训练与人类求知规律的关联\", \"bbox\": [54, 778, 162, 799]}, {\"text\": \"AI编程最佳语言选择\", \"bbox\": [54, 815, 124, 836]}, {\"text\": \"国产AI编程IDE对比\", \"bbox\": [54, 852, 113, 873]}, {\"text\": \"用税收衡量中美GDP虚实\", \"bbox\": [54, 888, 142, 909]}, {\"text\": \"AI版蒋干盗书设计方案\", \"bbox\": [54, 925, 126, 946]}]}"}],"status":"completed","id":"msg_02177379081414300000000000000000000ffffac159bd68a7cec"}],"service_tier":"default","status":"completed","usage":{"input_tokens":1512,"output_tokens":1518,"total_tokens":3030,"input_tokens_details":{"cached_tokens":0},"output_tokens_details":{"reasoning_tokens":777}},"caching":{"type":"disabled"},"store":true,"expire_at":1774049996}nick@nick-sager:~/workspace/doubao$
问题是这样子你就能够生成串行代码指挥串口设备移动鼠标输入键盘文字了吧?