|
|
Welcome to the Invelos forums. Please read the forum
rules before posting.
Read access to our public forums is open to everyone. To post messages, a free
registration is required.
If you have an Invelos account, sign in to post.
|
|
|
|
Invelos Forums->General: General Discussion |
Page:
1 2 3 4 5 Previous Next
|
Creating my own movie database with titles/cast/crew and everything |
|
|
|
Author |
Message |
Registered: March 20, 2007 | Reputation: | Posts: 2,853 |
| Posted: | | | | Quoting movie_madness: Quote: Quoting scotthm:
Quote: Really? It includes mine. But you had to enter all the cast and crew and all other movie details yourself, didn't you? Even if you create a custom media type for VHS, and even though DVDP's database already has the movie (for the DVD and BD editions), but if you enter a VHS for that movie, you still have to enter all cast and crew and everything else yourself. You can't "carry over" movie details from one medium to another. Sure you can "clone" a DVD in the database into a custom VHS profile in your database. No problem. Quote: By importing IMDb's data, I finished cataloguing thousands of my movies in a matter of days. Now tell me how long you spent on yours. You can easily import IMDB data into DVD Profiler too as long as you keep all the data local and don't contribute it to Invelos' online database. --------------- |
| Registered: August 23, 2008 | Reputation: | Posts: 1,656 |
| Posted: | | | | Quoting movie_madness: Quote: DVD Profiler is nice and all but if you're like me, you also have laserdiscs, VHS, Video CDs, DVR recordings, and online movie downloads, and DVD Profiler can't include them. I have a considerable amount of movies not on either DVD or Blu-ray. And I have tons of TV shows that will never ever be on any home video format at all (Oscar shows, rare TCM movies, and the likes). In my case, DVD Profiler just seems woefully inadequate to catalog my collection. So I decided to take matters into my own hands. It had to involve IMDb, since it is the only possible place for me get info on any title conceivable.
If you have programming skills then you must have thought of doing something like this, or may have already done it. First, I installed the Python programming language. Then I installed IMDbPY, a set of Python scripts that can retrieve data from IMDb. I modified the scripts to suit my particular needs. I created a text file of all my movie titles in my collection. I imported the file to a Python script, and got a list of IMDb movie IDs (e.g. 0157059) in return. I used the movie IDs and ran another script, and got a list of cast and crew of all those movies. I transferred all the IMDb data I got to my choice of database program (in my case, Microsoft Access). Lastly, I wrote a few vbscript web pages that connects to the Access database. The webpages let me search for names and return the info I want.
I had thought this project would take me ages, but it didn't! I have 5000+ movies and TV shows in my collection and it only took me a week and half to get the necessary IMDb data for about 4000 shows into my database. And the process was not entirely automatic either - I had to look up IMDb movie ID one by one. But IMDbPY performed pretty speedily. It took only 1-2 seconds to look up an ID. It took just less than an hour to retrieve the complete cast and crew of a THOUSAND movies.
Of course, I had looked for third-party software that can import IMDb data but none worked as I had wanted, mostly because they retrieved incomplete data. With IMDbPY, I was able to retrieve COMPLETE cast and crew, complete everything, just like you see them on the website.
Has anyone here actually done this?? Now I'm able to say that I have. Nice work! Sounds pretty sweet. | | | Reviewer, HorrorTalk.com
"I also refuse to document CLT results and I pay my bills to avoid going to court." - Sam, keeping it real, yo. |
| Registered: March 14, 2007 | Reputation: | Posts: 6,747 |
| Posted: | | | | Quoting SpikyCactus: Quote: Quoting TheMadMartian:
Quote: Quoting movie_madness:
Quote: DVD Profiler is nice and all but if you're like me, you also have laserdiscs, VHS, Video CDs, DVR recordings, and online movie downloads, and DVD Profiler can't include them. Actually, DVD Profiler can, it's called custom media types. Using those, you can track anything you want...even wine.
I've only got about 15 bottles of wine and I tend to drink them anyway, so I'm not sure it's worth bothering.
Can you use it for cacti? I only have three cacti that are all descendents of the same cactus so I don't have a collection for them either. But I do actually profile my books and my comics with DVDP. It's not perfect but I can enter authors, pencillers, genres, now even colour (mangas are usually b/w) and I have cover scans. It suits my needs and that's all that counts. | | | Karsten DVD Collectors Online
|
| Registered: March 14, 2007 | Posts: 2,337 |
| Posted: | | | | Much easier and better way would have been using Custom Media type and plugin that imports Cast & Crew data from IMDB. IMDB on the other hand doesn't have episode level Cast & Crew info for TV-series'. Only disadvantage would have been that if you have several copies of the same film, editing Cast & Crew woud require some manual copying. |
| Registered: March 31, 2007 | Posts: 662 |
| Posted: | | | | Quoting Kulju: Quote: IMDB on the other hand doesn't have episode level Cast & Crew info for TV-series'. Do you mean these data: Married with Children - S01E01? | | | |
| Registered: August 7, 2007 | Posts: 102 |
| Posted: | | | | IMDb does have episode-level cast & crew, but it doesn't have *season*-level cast & crew, which I actually need for my project. |
| Registered: March 14, 2007 | Reputation: | Posts: 6,747 |
| Posted: | | | | Quoting movie_madness: Quote: IMDb does have episode-level cast & crew, but it doesn't have *season*-level cast & crew, which I actually need for my project. Simple: Get all the episode infos from IMDb and check them. If a person has appeared in (let's say) >= 90% of all episodes, s/he is a season cast. Everyone else is episode cast. | | | Karsten DVD Collectors Online
|
| Registered: August 7, 2007 | Posts: 102 |
| Posted: | | | | Quoting DJ Doena: Quote: Quoting movie_madness:
Quote: IMDb does have episode-level cast & crew, but it doesn't have *season*-level cast & crew, which I actually need for my project.
Simple: Get all the episode infos from IMDb and check them. If a person has appeared in (let's say) >= 90% of all episodes, s/he is a season cast. Everyone else is episode cast. You don't want to make assumptions like that. Some actors may have appeared in early seasons and left the show later; some may have joined the show later. Not to mention, TV shows with no regulars: anthology series (Twilight Zone), documentary series, and the likes. You just have to do it the hard way and create one big table with complete season, episode, cast info: Season Episode Cast 1 1 Actor 1 1 1 Actor 2 1 1 Actor 3 1 2 Actor 1 1 2 Actor 3 1 2 Actor 4 2 1 Actor 1 2 1 Actor 2 2 1 Actor 4 2 2 Actor 1 2 2 Actor 4 etc. Then create a season-level summation: Season Cast 1 Actor 1 (2 episodes) 1 Actor 2 (1 episode) 1 Actor 3 (2 episodes) 1 Actor 4 (1 episode) 2 Actor 1 (2 episodes) 2 Actor 2 (1 episode) 2 Actor 4 (2 episode) |
| Registered: August 7, 2007 | Posts: 102 |
| Posted: | | | | Adding to what I already said about why I'm doing what I'm doing:
DVDP's database has no info whatsoever on the *bonus features* of the discs. Some bonus features are notable documentaries, featurettes, interviews, etc., and they have cast & crew too, and they certainly deserve to be catalogued. Some of these items are actually listed on IMDb, which is further motivation for me to import data from it. In my own database, I have the main movie and its bonus documentaries and featurettes listed separately, making them all searchable. Furthermore, I import cast & crew data for all these bonus featurettes. Thus, I'm able to find someone's appearances in not just the main movies, but also bonus features.
DVDP's database does not catalog the participants of audio commentaries, or any audio-based bonus features. Wouldn't it be nice to be able to search what audio supplements someone has done? This is info not even IMDb has, and in fact, it exists nowhere as far as I know. This is something a *disc* database like DVDP should take on.
For those who suggested I use plugins to import IMDb data into DVDP, I'm going to give that a try. But the fundamental problem remains: DVDP does not have a normalized movie database; it is a disc database containing redundant movie data, which would lead to greater time spent on keying in and maintaining the movie data. |
| Registered: August 7, 2007 | Posts: 102 |
| Posted: | | | | Quoting Winston Smith: Quote: As Kino said you are not the first or the last to clone IMDb. The single most inaccurate database in the World, for the reasons Kino alluded to. They have or had no standards for data entry. This is supposedly the selling point of DVDP but I'm not so sure about that. Yes, DVDP's database is handled by people who are more serious about managing a database than IMDb users. But there are FAR FAR fewer people doing this for DVDP than the people who do it for IMDb. If there are 10,000 people posting entries to a movie on IMDb, the chances are VERY good that we end up with very few errors -- even if 1000 or 3000 of those 10000 people are idiots who post entries that are way off. This is because the sheer number of times this movie is being looked at and worked on would make it highly likely that a very high percentage of errors would be corrected when it's said and done. I think this is the reason why I rarely see errors in the IMDb's cast and crew. I may see *incomplete* cast and crew. But I rarely see outright errors. One outright error I notice is the name of the actress who plays Josef K's cousin in Orson Welles' "The Trial". IMDb has her name as Naydra Shore, when on-screen credits show "Maydra Shore" as the name. I have a copy of the movie script and it also shows "Maydra Shore". This shows how obscure an entry has to be to produce an outright error. |
| Registered: March 14, 2007 | Posts: 2,337 |
| Posted: | | | | Quoting StaNDarD: Quote: Quoting Kulju:
Quote: IMDB on the other hand doesn't have episode level Cast & Crew info for TV-series'. Do you mean these data: Married with Children - S01E01? I stand corrected, thanks. |
| Registered: August 7, 2007 | Posts: 102 |
| Posted: | | | | Just want to digress a little bit on how a very high number of half-reliable data would often excel over a much lower number of more reliable data...
People used to say speech recognition and handwriting recognition couldn't be done. They said if we humans couldn't even recognize somebody's handwriting or speech half the time ourselves, then how could a computer? It turns out a computer has one capability humans don't have: the ability to accept and process a ginormous amount of data. Let's say we want a computer to recognize the handwriting of the letter "L". What do we do? We feed thousands and thousands of handwriting samples of the "L" into the computer. Some people write the "L" with a loop, some L's are slanted left and some right, some L's are just a vertical line, some are printed, etc. etc., plus every other conceivable way of writing an L. We feed all these into the computer, and with a good algorithm, it CAN determine if ANY piece of handwriting is an L or not with a VERY HIGH degree of certainly. I think anyone who has used a smart device's handwriting feature would notice this. Plus, the growing speed of computer processors (CPUs) over the years have helped to make this goal a reality. So there is why the SHEER number of samples, such as those possible on IMDb, *CAN* often transcend themselves into something previous thought not possible. To look at it in extreme closeup: let's say IMDb user #1 posts in the quote section that Bogart says "Play it again, Sam" in the movie "Casablanca." User #2 amends it with "Play it again." User #3 thinks, no, he does say "Play it again, Sam". User #4 posts "Play it, Sam." And so on and so forth. Now what is the probablity that by the 10,000 poster, that we would have the correct quote posted? Very high. |
| Registered: March 14, 2007 | Posts: 5,734 |
| Posted: | | | | Quoting movie_madness: Quote: DVD Profiler is nice and all I agree. | | | Don't confuse while the film is playing with when the film is played. [Ken Cole, DVD Profiler Architect] |
| Registered: March 13, 2007 | Reputation: | Posts: 13,203 |
| Posted: | | | | Quoting movie_madness: Quote: DVDP's database has no info whatsoever on the *bonus features* of the discs. Some bonus features are notable documentaries, featurettes, interviews, etc., and they have cast & crew too, and they certainly deserve to be catalogued. Quote: DVDP's database does not catalog the participants of audio commentaries, or any audio-based bonus features. Wouldn't it be nice to be able to search what audio supplements someone has done? Whether or not something deserves to be catalogued or is 'nice' is in the eye of the beholder. Speaking for myself, I can live without either of those...especially audio commentaries as I never listen to them. | | | No dictator, no invader can hold an imprisoned population by force of arms forever. There is no greater power in the universe than the need for freedom. Against this power, governments and tyrants and armies cannot stand. The Centauri learned this lesson once. We will teach it to them again. Though it take a thousand years, we will be free. - Citizen G'Kar |
| Registered: August 7, 2007 | Posts: 102 |
| Posted: | | | | Quoting TheMadMartian: Quote: Quoting movie_madness:
Quote: DVDP's database has no info whatsoever on the *bonus features* of the discs. Some bonus features are notable documentaries, featurettes, interviews, etc., and they have cast & crew too, and they certainly deserve to be catalogued.
Quote: DVDP's database does not catalog the participants of audio commentaries, or any audio-based bonus features. Wouldn't it be nice to be able to search what audio supplements someone has done? Whether or not something deserves to be catalogued or is 'nice' is in the eye of the beholder. Speaking for myself, I can live without either of those...especially audio commentaries as I never listen to them. Some discs have very good supplements so you're missing out of them if you never look or listen to them, especially when you've already paid for them. E.g., the "Gone with the Wind" Blu-ray has one of the best making-of documentaries ever made, the 2-hour 1988 film "The Making of a Legend: Gone with the Wind", which has a long list of credits because many people, some of whom have since passed on, appear in it. You don't think it's worthy to catalog this? Let's say you search your DVDP for something Susan Hayward appears in. You likely won't be able to find out that she actually appears in "The Making of a Legend: Gone with the Wind". There is an audition footage of her in it. Wouldn't you be interested in knowing that?? I bet some people probably own "Gone with the Wind" blu-ray and don't even know it, especially when there's a huge amount of credits involved. THAT'S the kind of the situation I don't want to be in. I want to KNOW as much about my collection as I can. This is not something in the "eye of the beholder". I would think this is something we ALL want to do since we already PAID for the content we own. |
| Registered: March 13, 2007 | Reputation: | Posts: 3,321 |
| Posted: | | | | First of all, nicely done. I wouldn't do it to my database, but we're not talking about mine. So well done there. Second of all, as long as you don't try to contribute this data, you're just fine. It's YOUR database. You should set it up in a manner that works best for you. If IMDB data is what you want, then you're in great shape. Third, next time you should start here. Your Python solution sounds very clever and was probably the best route in this particular case. But we do have some great user-created tools that you could possibly use to your advantage next time around. We can also assist with other things too that may save you time if you're not as familiar with the program (though it sounds like you have a solid grasp at this point!) Finally, IMDB is on-par with Satan for most users here. Trying to convince them that IMDB data doesn't come from the pits of Hell is a pointless exercise. But hey, if you like banging your head against the wall, go for it! | | | Get the CSVExport and Database Query plug-ins here. Create fake parent profiles to organize your collection. |
|
|
Invelos Forums->General: General Discussion |
Page:
1 2 3 4 5 Previous Next
|
|
|
|
|
|
|
|
|