Ideas for a federated anime tracker

asudox@lemmy.world · 1 year ago

Ideas for a federated anime tracker

gomp@lemmy.ml · 1 year ago

but this most likely is against the ToS of every anime tracking website

AFAIK scraping publicly accessible websites is fine in most countries (IANAL, look into it)

asudox@lemmy.world · 1 year ago

Ok cool, so even when that is out of the way, are the images copyrighted? Because I’d like to download the images and host them on my servers.

pedroapero@lemmy.ml · edit-2 1 year ago

I had a try ad Bookwyrm, which seems similar to what you intend to build. I was disappointed because the database itself is federated, meaning it’s full of duplicated entries.

asudox@lemmy.world · edit-2 1 year ago

Correct. I thought there was some way to prevent those duplicates because of the federation, but apparently there’s no such check being done so it’s all duplicates. For that reason I won’t put any effort into making the database federated. Only threads, reviews and comments will be federated. This might change as I make it though. There are also clubs in MAL which I might copy and implement in this project and they could be federated as well.

typhoon@lemmy.world · 1 year ago

Decentralizing the database in a federated structure for anime tracking is a very good idea. Right now I’m using Anitrend that is open source but is only a interface to Anilist.

Like you pointed I think the major challenge will be to establish a solid new shows database entries police(s). Not sure how we could manage that effectively. Governance will be key but you also don’t want to be a hostage of this project for your life.

One aspect to be taken in consideration is privacy. I think a lot of people would appreciate to have access to the new tracker without having to share anything about them.

asudox@lemmy.world · edit-2 1 year ago

Oh well, no. The database won’t be decentralized. That just invites chaos. Bookwyrm did that and now there are lots of duplicates in their library, which I definitely want to avoid. The things that will be federated are animes’ forums (as communities in Lemmy for example), and those communities will have threads that will be either a forum thread or a review which others can comment on from any federated platform. Some other things might get federated in the future.

I decided to just scrape either MAL alone or multiple sources.

If you have any ideas on how the database can be decentralized while efficiently avoiding duplicates and spam, please do say it.

Rimu@piefed.social · 1 year ago

You might want to discuss this in one of the communities at https://ani.social/.

muhyb@programming.dev · 1 year ago

Never go for user-submitted / approved by moderator system. I’ve been in one project like that years ago and it’s not gonna work.

I’ll make another comment once I get up.

asudox@lemmy.world · 1 year ago

Is there a good reason why? Even MAL takes user submissions.

muhyb@programming.dev · 1 year ago

It’s a slow system. MAL takes user submissions because they already have a big database and those submissions help filling the cracks. If you don’t have a database to begin with it slows things down, especially if these submissions will go by trust-based. Doing this by randoms is also very risky.

However, I’m not completely oppose to that idea because it can be helpful for some areas. My suggestion is start with a basic dog tag system, where the anime name, alternative names, status (airing, completed), season and start date, studios (also licensors and producers), age rating etc. These information needs to be scraped for the fastest way to form a quick database, they are publicly available (even on Wikipedia) so it should be fine to scrape. You can even go full Wikipedia after got only the names. User submissions could be useful for the introduction / summary parts of the titles at this stage. For only names (and basic tags), you can scrape AniDB from this list. It’s just a search query so shouldn’t be against their ToS.

You can also check Kitsu for ideas, I like their DB request system. Pretty basic but can be done differently with the power of ActivityPub.