TIP: Click on subject to list as thread! ANSI
echo: fidosoft.husky
to: Wilfred van Velzen
from: mark lewis
date: 2016-10-31 22:33:26
subject: DUPES!

* Originally in bama
* Crossposted in fidosoft.husky

31 Oct 16 21:21, you wrote to me:

 WV>>> Dupes have by definition the same dupe-checksum as the original, so
 WV>>> storing them is useless!

 ml>> maybe to you but to another developer who may be doing other things
 ml>> than /just/ dupes, it may be needed or desired... don't be so square
 ml>> that your mind is closed to other possibilities and thought
 ml>> processes... perhaps someone is keeping count of duplicate MSGIDs and
 ml>> needing also to store the CRC/hash for each of them...

 WV> If the CRC/hash is the same, because it's a dupe, you don't have to
 WV> store it again.

like i said before, that depends on the database format...

 WV> If you want to keep count of duplicate MSGID's you store those, or
 WV> just a count of them. That's got little to do with dupe hash's...

they can easily be stored and counted in the same database... who are you
or i to say that that is wrong? ;)

 ml>> example: yesterday i saw two messages with exactly the same MSGID,
 ml>> header, time stamps, and what looks to be the exact same message
 ml>> body... the seenbys and paths were different yet HPT with its
 ml>> HASH+MSGID dupe checking missed seeing the second one as a dupe...
 ml>> both were back-to-back in my message base so it was easy to flip back
 ml>> and forth between them to try to see any differences... golded+'s
 ml>> [I]nfo showed that they were virtually identical but there looked to
 ml>> be one space character immediately after the MSGID that was in one
 ml>> post and not the other... now, get this, both MSGIDs, even though
 ml>> they are the same, are in the dupe database but with different
 ml>> hashes...

 WV> I fail to see what's your point here. They weren't dupes according to
 WV> hpt, so the different crc/hash's were stored.

the point is that they ARE dupes and should have been detected as dupes...
someone's formula is incorrect... it reminds me of a mailer/tosser combo
that takes the header plus the first 40 bytes to calculate their hash
with... software developers need to know about this so they can take steps
to put the data in the proper place so this particular software will see
the data and catch dupes as needed... this means to place the MSGID at the
top or just after the AREA tag... but see? this is the type of knowledge
that is gained after years of working with FTN software and being aware of
how things were done in the past... not assuming that everything is done
the same everywhere... especially since many developers are now gone and
their knowlegede is no longer first or second hand... even with new
developers having taken over older software and the details of the
operation not having been transmitted along with the actual code base...
the new maintainers have to figure it out for themselves or be told by
someone that knows these details... reading the code is one thing...
understanding it and why it was done a certain way is another...

)\/(ark

Always Mount a Scratch Monkey
Do you manage your own servers? If you are not running an IDS/IPS yer doin'
it wrong...
... I'm not an asshole. I'm a hemorrhoid. I irritate assholes.
---
* Origin: (1:3634/12.73)
SEEN-BY: 3/50 16/101 34/999 90/1 116/18 123/500 128/187 140/1 218/700 222/2
SEEN-BY: 230/150 240/1120 249/303 250/1 261/38 100 266/404 267/155 280/464
SEEN-BY: 280/1027 282/1031 1056 292/907 908 320/219 340/400 393/68 396/45
SEEN-BY: 633/267 280 640/384 712/550 848 770/1 801/161 189 2320/100
@PATH: 3634/12 123/500 261/38 712/848 633/267

SOURCE: echomail via fidonet.ozzmosis.com

Email questions or comments to sysop@ipingthereforeiam.com
All parts of this website painstakingly hand-crafted in the U.S.A.!
IPTIA BBS/MUD/Terminal/Game Server List, © 2025 IPTIA Consulting™.