Inference Sign

About 50 results

Open links in new tab

Any time

zhihu.com
https://www.zhihu.com › question
为什么 2024 年以后 MMDiT 模块成为了大规模文生视频或者文生图片 …
也可能是我的偏见。但是似乎SD3 paper发表以后很多开源工作/技术报告都不约而同的使用了这个架构，抛弃了…
zhihu.com
https://www.zhihu.com › question
机器学习中Inference 和predict的区别是什么? - 知乎
Inference: You want to understand how ozone levels are influenced by temperature, solar radiation, and wind. Since you assume that the residuals are normally distributed, you use a linear regression model.
zhihu.com
https://www.zhihu.com › question
LLM的pad策略，为啥训练时是right，预测是left？ - 知乎
Dec 10, 2024 · 上面这俩在训练时是等效的。关键还是 padding 方向和 ignore_label 的设置方式要匹配。 position_ids 的影响也不大，目前像 Hugging Face 这种库可以自行处理。如下例中是 batch size 为 2 …
zhihu.com
https://www.zhihu.com › question
PyTorch如何量化模型（int8）并使用GPU（训练/Inference）？
或者是否可以通过将PyTorch模型转化成TensorRT进行int8的GPU Inference?
zhihu.com
https://www.zhihu.com › question
如何看待DeepSeek发布的新模型DeepSeek-Math-V2？ - 知乎
论文中最heavy的模式（能拿金牌的模式）是64证明——64* 64验证——16迭代，假设每一步是10k token，这样一道题就要消耗大约10亿的inference token，在DSA下成本大概是一千多块钱。如果不 …
zhihu.com
https://www.zhihu.com › question
求助！大家有没有因果发现，因果推断网课推荐？ - 知乎
刚好最近写了个因果推断系列文章，以下是我觉得比较好学习资料： Brady Neal的课程： Brady Neal《因果推理导论》中英字幕_哔哩哔哩_bilibili ，英文教学，但语速很慢。 2. 清华大学丁鹏教授：《 …
zhihu.com
https://www.zhihu.com › question
计算机视觉入门书？ - 知乎
虽然可能答偏题，但是就入门来说，还是想提一下这个神奇的网站 Annotated Computer Vision Bibliography: Table of Contents Keith Price老爷子从1994年开始做了这个索引，涵盖了所有计算机视 …
zhihu.com
https://www.zhihu.com › question › answers › updated
机器学习中Inference 和predict的区别是什么? - 知乎
Inference in deep learning: More specifically, the trained neural network is put to work out in the digital world using what it has learned — to recognize images, spoken words, a blood disease, predict the …
zhihu.com
https://www.zhihu.com › question
GPT模型单次inference输入生成下一个token，为什么会产生kv-cache？ …
GPT模型单次inference输入生成下一个token，为什么会产生kv-cache？我听说GPT模型单次inference输入生成下一个token时，可能需要进行多次inference，并形成kv-cache，这会造成芯片负担比较大。 …
zhihu.com
https://www.zhihu.com › question
请解释下variational inference？ - 知乎
进一步地，operator variational inference (OPVI) [19] 则重新审视了这个优化目标的设计问题，提出了一个更加general的框架，把KL纳入其中。总的来说，相比前两类问题，这个问题的工作较少，毕竟我 …

Pagination
- 1
- 2
- 3
- Next