当前位置: 首页 > news >正文

合肥网站制作费用网站制作与网站建设实际报告

合肥网站制作费用,网站制作与网站建设实际报告,百度做网站推广,中国工程建设监理网站背景 在笔者应用大模型的场景中#xff0c;对话模型(即大模型-chat系列)通常具有比较重要的地位#xff0c;我们通常基于与大模型进行对话来获取我们希望理解的知识。然而大模型对话是依据何种数据格式来进行训练的#xff0c;他们的数据为什么这么来进行组织#xff0c;本…背景 在笔者应用大模型的场景中对话模型(即大模型-chat系列)通常具有比较重要的地位我们通常基于与大模型进行对话来获取我们希望理解的知识。然而大模型对话是依据何种数据格式来进行训练的他们的数据为什么这么来进行组织本篇文章将进行总结。 Chat Markup Language Chat Markup Language (CML) 是一种用于描述对话结构的标记语言。它可以帮助大模型和 AI 助手之间的对话更加结构化和清晰。CML 可以描述对话中的各种元素例如对话的开始和结束、用户和 AI 助手的发言、对话中的问题和回答等等。使用 CML 可以使得对话的处理更加方便和高效同时也可以提高对话的可读性和可维护性。 DeepMind的相关研究指出相关研究指出LLM可以通过选取合适的prompt)来转化为对话代理。这些文本提示通常包含一种所谓的“系统”信息来定义 LLM 的角色以及一系列人机对话的示例。 对数话数据格式 一种简单对话数据的构建格式是单纯的把系统信息和角色信息插入到每一个训练样本中然后在对话用序列结尾的token(如)分隔开。如下所示 Below is a dialogue between a human and AI assistant ...Human: Is it possible to imagine a society without law? Assistant: It is difficult to imagine ... Human: It seems like you ... Assistant: You are correct ... Human: Yeah, but laws are complicated .. EOS这种简单对话数据构建方法可能会导致对话推理过程中生成不必要的对话轮次因此需要进行改进。一种更好的结构化方法是ChatML它对每个对话轮次进行包装并使用预定义的特殊Token来表示询问或回答的角色。这种方法可以更好地区分对话中不同角色的发言并且可以更准确地捕捉对话的语境和上下文。相比于简单的插入系统信息和角色信息的方法ChatML更加灵活和可扩展可以适应不同类型的对话场景和任务。 ChatGPT的ChatML 根据OpenAI ChatML V0将每个角色(system,user,assistant)的文本按照如下进行拼接 |im_start|system You are ChatGPT, a large language model trained by OpenAI. Answer as concisely as possible. Knowledge cutoff: 2021-09-01 Current date: 2023-03-01|im_end| |im_start|user How are you|im_end| |im_start|assistant I am doing well!|im_end| |im_start|user How are you now?|im_end|其中|im_start|、|im_end|是special token进行tokenizer encode的时候他们会被编码成对应的单个token id他们专门用来表示一个角色说话的start和end。 千问-chat版本也是用的这种数据构建方法。https://arxiv.org/pdf/2309.16609.pdf 另一种的ChatML 我们使用一些特殊的标记来表示不同的信息类型 |system|表示系统信息的开始用于描述聊天机器人的身份和角色。|user|表示用户说出的话语。|assistant|表示机器人说出的话语。|end|表示话语或系统信息的结尾。 案例 对话数据 sample{messages: [{content: Is it possible to imagine a society without law?,role: user,},{content: It is difficult to imagine a society that is able to be maintained without any semblance of Law. Laws exists for the purpose of maintaining society and past human behavior suggests that there would people that have no interest in promoting social cohesion and norms so its unlikely that complex social and economic systems could develop without a legal system.,role: assistant,},{content: It seems like you consider the absence of law equal to the absence of anything that could guide the behaviour of the individual, however there are many other such things, like religion, conscience, tradition to mention few important patterns which are known to have crucial influence on society even today, but also we could imagine more, like universal love if everyone suddenly became awaken in some spiritual sense, or simply just having an omnipotent leader who maintains a certain order without ever codifying the principles in law. Is it still difficult to imagine?,role: user,},{content: You are correct that there are other factors that can guide behavior in a society and play a role in shaping individuals behavior and interactions with each other. However, even in societies where these factors are present, laws still serve an important role in maintaining social order and resolving conflicts. \n\nFor example, religious teachings may prohibit theft and murder, but laws provide a system of justice for those who violate these prohibitions. Personal morality and conscience may guide individuals to act in a certain way, but laws provide a framework for resolving disputes when conflicting moral beliefs arise. Furthermore, the other factors you mentioned, such as universal love or an omnipotent leader, are not necessarily stable or reliable guides for behavior. Universal love, while a noble ideal, may not be achievable in practice, and an omnipotent leader may be prone to corruption or abuse of power.\n\nLaws provide a crucial foundation for maintaining social order and ensuring justice. It is possible to imagine alternative systems of governance, but it is unlikely that a society without any sort of legal framework would be stable or functional in the long term.,role: assistant,},{content: Yeah, but laws are complicated. Most people cant understand them in depth. Some would argue it is almost a self-serving system which put energy into growing itself(eg.: patent trolling). I think there must be a less complex system which keeps up order in society.,role: user,},] }chatGPT chatGLM封装 im_start |im_start| im_end |im_end|def prepare_dialogue(example):system_msg Below is a dialogue between a human and an AI assistant called StarChat.prompt im_start system\n system_msg im_end \nfor message in example[messages]:if message[role] user:prompt im_start user\n message[content] im_end \nelse:prompt im_start assistant\n message[content] im_end \nreturn promptprint(prepare_dialogue(sample))另一种 chatGLM封装 system_token |assistant| user_token |user| assistant_token |assistant| end_token |end|def prepare_dialogue(example):system_msg Below is a dialogue between a human and an AI assistant called StarChat.prompt system_token \n system_msg end_token \nfor message in example[messages]:if message[role] user:prompt user_token \n message[content] end_token \nelse:prompt assistant_token \n message[content] end_token \nreturn promptprint(prepare_dialogue(sample))将特殊字符添加到tokenizer中 openAI版本 tokenizer.add_special_tokens({additional_special_tokens: [|im_start|, |im_end|]}) print(tokenizer.additional_special_tokens) tokenizer(|im_start|)另一种版本 tokenizer.add_special_tokens({additional_special_tokens: [|system|, |assistant|, |user|, |end|]})可见特殊字符被封装到单独一个 token 的 ID中 构建标签 我们可以遮盖掉来自用户话语部分的损失函数值。因为我们的模型是基于用户的话语进行训练的只被训练去预测 AI 助手说话的部分在模型推理时只需要根据用户的话回答用户。下面是一个简单的函数用于遮盖掉用户部分的标签并将所有用户部分的令牌转换为-100接下来-100将被损失函数忽略。 tokenizer.add_special_tokens({additional_special_tokens: [|system|, |assistant|, |user|, |end|]})def mask_user_labels(tokenizer, labels):user_token_id tokenizer.convert_tokens_to_ids(user_token)assistant_token_id tokenizer.convert_tokens_to_ids(assistant_token)for idx, label_id in enumerate(labels):if label_id user_token_id:current_idx idxwhile labels[current_idx]! assistant_token_id and current_idx len(labels):labels[current_idx] -100 # Ignored by the losscurrent_idx 1dialogue |user|\nHello, can you help me?|end|\n|assistant|\nSure, what can I do for you?|end|\n input_ids tokenizer(dialogue).input_ids labels input_ids.copy() mask_user_labels(tokenizer, labels) labelstokenizer.add_special_tokens({additional_special_tokens: [|im_start|, |im_end|]})def mask_user_labels(tokenizer, labels):im_start_id tokenizer.convert_tokens_to_ids(|im_start|)im_end_id tokenizer.convert_tokens_to_ids(|im_end|)user_id tokenizer.convert_tokens_to_ids(user)assitant_id tokenizer.convert_tokens_to_ids(assitant)for idx, label_id in enumerate(labels):if label_id im_start_id:if idx len(labels):if labels[idx1] user_id:current_idx idxwhile labels[current_idx1]! im_start_id:labels[current_idx] -100 # Ignored by the losslabels[current_idx1] -100 # Ignored by the losscurrent_idx 1dialogue |im_start|user\nHello, can you help me?|im_end|\n|im_start|assistant\nSure, what can I do for you?|im_end|\n input_ids tokenizer(dialogue).input_ids labels input_ids.copy() mask_user_labels(tokenizer, labels) labels所有用户输入的ID都被遮蔽了。在微调阶段这些特殊的标记将学习到它们自己特定的嵌入。
http://www.dnsts.com.cn/news/4638.html

相关文章:

  • 网站建设服安徽建设工程信息网监理查询
  • 网站教人做核能灯旧手机服务器wordpress
  • 精美的微网站能够做冶金工程毕业设计的网站
  • 网站实现功能网页游戏代理平台
  • qq空间认证的网站后台根目录中小企业erp软件排名
  • 天津网站建设索王道下拉房产中介如何找客源
  • 可以看所有网站的浏览器佛山关键词网站排名
  • 代码做网站图片怎么插深圳博大建设公司
  • 响应式 网站 开发价格低英语翻译
  • vue做网站前台网站建设技术可行性
  • 电子书网站用dz还是wordpresswordpress是啥东西
  • 四川建设厅招投标官方网站wordpress新闻墙插件
  • 企业网站网页布局中国网络营销传播网
  • phpcms 网站访问统计哪个网站做废旧好
  • 网站做的不满意网站的设计流程有哪些步骤
  • 重庆建网站推广价格官网优化公司
  • 网站排名优化软件小程序商店制作教程
  • 开题报告风景区网站开发微信公众号人工服务电话
  • 中国小康建设网 官方网站设计师常备设计网站大全
  • 张家港市规划建设网站美食网站设计欣赏
  • 秦皇岛工程建设信息网站网易云wordpress代码
  • 巩义网站建设微商分销系统开发
  • 天津网站建设是什么企业推广软件有哪些
  • 做微信封面的网站巴彦淖尔市 网站建设
  • 网站推广主要方法请人做网站要多少
  • 企业网站需要在电信做哪些备案清溪网站建设公司
  • 肥城网站网站建设无代码建站
  • 网站推广初期目标营销网络地图
  • 豆角网是哪个网站开发的自己弄个网站
  • php网站建设网站wordpress5.2.2