Fix figures in design doc (#18612)
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
This commit is contained in:
@@ -142,8 +142,6 @@ title: vLLM Paged Attention
|
|||||||
|
|
||||||
<figure markdown="span">
|
<figure markdown="span">
|
||||||
{ align="center" alt="query" width="70%" }
|
{ align="center" alt="query" width="70%" }
|
||||||
<figcaption>
|
|
||||||
</figcaption>
|
|
||||||
</figure>
|
</figure>
|
||||||
|
|
||||||
- Each thread defines its own `q_ptr` which points to the assigned
|
- Each thread defines its own `q_ptr` which points to the assigned
|
||||||
@@ -153,8 +151,6 @@ title: vLLM Paged Attention
|
|||||||
|
|
||||||
<figure markdown="span">
|
<figure markdown="span">
|
||||||
{ align="center" alt="q_vecs" width="70%" }
|
{ align="center" alt="q_vecs" width="70%" }
|
||||||
<figcaption>
|
|
||||||
</figcaption>
|
|
||||||
</figure>
|
</figure>
|
||||||
|
|
||||||
```cpp
|
```cpp
|
||||||
@@ -194,8 +190,6 @@ title: vLLM Paged Attention
|
|||||||
|
|
||||||
<figure markdown="span">
|
<figure markdown="span">
|
||||||
{ align="center" alt="key" width="70%" }
|
{ align="center" alt="key" width="70%" }
|
||||||
<figcaption>
|
|
||||||
</figcaption>
|
|
||||||
</figure>
|
</figure>
|
||||||
|
|
||||||
- The diagram above illustrates the memory layout for key data. It
|
- The diagram above illustrates the memory layout for key data. It
|
||||||
@@ -211,8 +205,6 @@ title: vLLM Paged Attention
|
|||||||
|
|
||||||
<figure markdown="span">
|
<figure markdown="span">
|
||||||
{ align="center" alt="k_vecs" width="70%" }
|
{ align="center" alt="k_vecs" width="70%" }
|
||||||
<figcaption>
|
|
||||||
</figcaption>
|
|
||||||
</figure>
|
</figure>
|
||||||
|
|
||||||
```cpp
|
```cpp
|
||||||
@@ -372,20 +364,14 @@ title: vLLM Paged Attention
|
|||||||
|
|
||||||
<figure markdown="span">
|
<figure markdown="span">
|
||||||
{ align="center" alt="value" width="70%" }
|
{ align="center" alt="value" width="70%" }
|
||||||
<figcaption>
|
|
||||||
</figcaption>
|
|
||||||
</figure>
|
</figure>
|
||||||
|
|
||||||
<figure markdown="span">
|
<figure markdown="span">
|
||||||
{ align="center" alt="logits_vec" width="50%" }
|
{ align="center" alt="logits_vec" width="50%" }
|
||||||
<figcaption>
|
|
||||||
</figcaption>
|
|
||||||
</figure>
|
</figure>
|
||||||
|
|
||||||
<figure markdown="span">
|
<figure markdown="span">
|
||||||
{ align="center" alt="v_vec" width="70%" }
|
{ align="center" alt="v_vec" width="70%" }
|
||||||
<figcaption>
|
|
||||||
</figcaption>
|
|
||||||
</figure>
|
</figure>
|
||||||
|
|
||||||
- Now we need to retrieve the value data and perform dot multiplication
|
- Now we need to retrieve the value data and perform dot multiplication
|
||||||
|
|||||||
Reference in New Issue
Block a user