abhi9u @ abhi9u @lemmy.world

Posts

36
Comments

8
Joined

2 yr. ago

Technology @lemmy.world

abhi9u @lemmy.world

2d ago

One Law to Rule Them All: The Iron Law of Software Performance

blog.codingconfessions.com /p/one-law-to-rule-all-code-optimizations

Technology @lemmy.world

abhi9u @lemmy.world

1w ago

Why This Python Performance Trick Doesn’t Matter Anymore

blog.codingconfessions.com /p/old-python-performance-trick

Technology @lemmy.world

abhi9u @lemmy.world

3mo ago

Python Performance: Why 'if not list' is 2x Faster Than Using len()

blog.codingconfessions.com /p/python-performance-why-if-not-list

Technology @lemmy.world

abhi9u @lemmy.world

4mo ago

Hardware-Aware Coding: CPU Architecture Concepts Every Developer Should Know

blog.codingconfessions.com /p/hardware-aware-coding

Technology @lemmy.world

abhi9u @lemmy.world

4mo ago

Context Switching and Performance: What Every Developer Should Know

blog.codingconfessions.com /p/context-switching-and-performance

Technology @lemmy.world

abhi9u @lemmy.world

6mo ago

How Unix Spell Ran in 64kB RAM

blog.codingconfessions.com /p/how-unix-spell-ran-in-64kb-ram

Technology @lemmy.world

abhi9u @lemmy.world

6mo ago

Linux Context Switching Internals: Part 1 - Process State and Memory

blog.codingconfessions.com /p/linux-context-switching-internals

Technology @lemmy.world

abhi9u @lemmy.world

7mo ago

The CAP Theorem of Clustering: Why Every Algorithm Must Sacrifice Something

blog.codingconfessions.com /p/the-cap-theorem-of-clustering

Technology @lemmy.world

abhi9u @lemmy.world

8mo ago

Disillusioning the Magic of the fork System Call

blog.codingconfessions.com /p/the-magic-of-fork

Technology @lemmy.world

abhi9u @lemmy.world

8mo ago

An Unreachable Hidden XKCD Easter Egg inside CPython

blog.codingconfessions.com /p/cpython-hidden-xkcd-easter-egg

Technology @lemmy.world

abhi9u @lemmy.world

8mo ago

CPython's Garbage Collector and its Impact on Application Performance

blog.codingconfessions.com /p/connecting-cpythons-gc-internals

Technology @lemmy.world

abhi9u @lemmy.world

8mo ago

The Pythonic Emptiness

blog.codingconfessions.com /p/the-pythonic-emptiness

Technology @lemmy.world

abhi9u @lemmy.world

9mo ago

A Selective Survey of Efficient Speculative Decoding Techniques for LLM Inference

blog.codingconfessions.com /p/a-selective-survey-of-speculative-decoding

Technology @lemmy.world

abhi9u @lemmy.world

10mo ago

CPython Runtime Internals: Key Data Structures & Runtime Bootstrapping

blog.codingconfessions.com /p/cpython-runtime-internals

Technology @lemmy.world

abhi9u @lemmy.world

11mo ago

The Design & Implementation of the CPython Virtual Machine

blog.codingconfessions.com /p/cpython-vm-internals

Technology @lemmy.world

abhi9u @lemmy.world

11mo ago

Are Function Calls Still Slow in Python? An Analysis of Recent Optimizations in CPython

blog.codingconfessions.com /p/are-function-calls-still-slow-in-python

Technology @lemmy.world

abhi9u @lemmy.world

12mo ago

Two Threads, One Core: How Simultaneous Multithreading Works Under the Hood

blog.codingconfessions.com /p/simultaneous-multithreading

Technology @lemmy.world

abhi9u @lemmy.world

1y ago

CPython Garbage Collection: The Internal Mechanics and Algorithms

blog.codingconfessions.com /p/cpython-garbage-collection-internals

Technology @lemmy.world

abhi9u @lemmy.world

1y ago

How Python Compares Floats and Ints: When Equals Isn’t Really Equal

blog.codingconfessions.com /p/how-python-compares-floats-and-ints

Technology @lemmy.world

abhi9u @lemmy.world

1y ago

A Deep Dive into the Underlying Architecture of Groq's LPU

codeconfessions.substack.com /p/groq-lpu-design

2y ago

An Analysis of DeepMind's 'Language Modeling Is Compression' Paper

Jump

Yes, that makes much more sense.

2y ago

An Analysis of DeepMind's 'Language Modeling Is Compression' Paper

Jump

Interesting. I'm just thinking aloud to understand this.

In this case, the models are looking at a few sequence of bytes in their context and are able to predict the next byte(s) with good accuracy, which allows efficient encoding. Most of our memories are associative, i.e. we associate them with some concept/name/idea. So, do you mean, our brain uses the concept to predict a token which gets decoded in the form of a memory?

2y ago

An Analysis of DeepMind's 'Language Modeling Is Compression' Paper

Jump

Yes. They also mention that using such large models for compression is not practical because their size thwarts any amount of data you might want to compress. But this result gives a good picture into how generalized such large models are, and how well they are able to predict the next tokens for image/audio data at a high accuracy.

2y ago

An Analysis of DeepMind's 'Language Modeling Is Compression' Paper

Jump

Do you mean the number of tokens in the LLM's tokenizer, or the dictionary size of the compression algorithm?

The vocab size of the pretrained models is not mentioned anywhere in the paper. Although, they did conduct an experiment where they measured compression performance while using tokenizers of different vocabulary sizes.

If you meant the dictionary size of the compression algorithm, then there was no dictionary because they only used arithmetic coding to do the compression which doesn't use dictionaries.

2y ago

How to build a computer using origami

Jump

Thank you!

2y ago

How to build a computer using origami

Jump

I don't know. I have found that the folks on Technology community appreciate many of my computer science posts. But a dedicated Comp Science community which is active, will be awesome.

2y ago

How to build a computer using origami

Jump

Me too. :)

2y ago

How CPython Implements and Uses Bloom Filters for String Processing

Jump

I have the same problem. The number of things I want to read and write about is scaling faster than I can tackle them :)