![]() |
The Testing is half empty
or half finished, depending on whether you are a pessimist or an optimist, i suppose. Looking at the wiki and other sources of discussion on the Extras-Testing validation process, there are a few things that stand out as missing and in need of implementation.
Before i cover those, though, let's review the purpose of -testing:
In short, it should be a way for apps to get thoroughly tested for basic requirements by others before allowing an app in Extras. Here are the needed changes to accomplish this in a way that is less onerous to developers and testers alike: First, a way to have two tiers of testers and criteria. The description was of a pool of "official" testers required to give some percentage of the total thumbs-up. The importance of this is twofold: it allows for thorough testing of all QA criteria, and it allows for "casual" or fan testers, with a smaller mandate, to do the majority of the tests, thus expediting testing. All that is needed for this is a process for admitting "official" testers, a mechanism in the promoter script to recognize and identify those in the test reporting page, and a revised list of QA criteria for the informal testers. The official testers could use the existing comment mechanism to report what they had checked in-depth to verify all points of the QA checklist were covered. Second, a reduced burden of testing and quarantine for updates. The present requirement for ten karma and ten days does not encourage "release early, release often". Note that deputizing ordinary users as testers via the mechanism in the first point would help alleviate this update testing problem, but ten days and ten full testing karma is too much. The current testing QA list is too confusing for ordinary users who want to help, which leaves them frustrated at the delay in releases and with no way to help. |
Re: The Testing is half empty
Yeap, the testing queue is stuck. The queue is now a popularity contest as I was afraid in fall. It is hard to say what people have actually tested, and I think only few have used QA checklist. Plus the website needs some work. Also I think that testing queue shouldn't be a place where software is brought to find out what bugs there is, software should be release quality and testing queue is for acceptance testing only. If a feature does not work, disable it and release when it is ready.
As I understand the medicine is to: 1) Retain discussion history for a package 2) For new versions there should be changelog 3) Automatically test as many things as possible on the checklist 2) Those not automatically tested should be tested atleast by one tester 4) Functionality should be tested by more than one person. I would still keep 10 people as acceptance level for now 5) Thumb down should be accompanied by a bug report When this system would work, then would be a time to fine-tune: - How to make bugfix releases go through the system faster? - How to clean up the queue from stuff which is not going through? - How many testers is right amount to make certain that functionality is ok? |
Re: The Testing is half empty
You should only make promotion form Extras-Testing into Extras harder if there are any real problems with Extras packages. As there are no big problems with Extras packages that I know of at the moment, making promotion harder will not bring any advantage. It will simply prevent packages from reaching Extras, spoiling it for everybody.
|
Re: The Testing is half empty
I agree with VRe. Also, I think more exposure is a good thing. Few people know this is possible or that it helps or know where to vote and what to test.
I'm thinking mandatory exposure to community, say, reserved forum section/threads? Complete with links? Also, a checklist template could be used to tag votes, maybe we can see later what was tested and wasn't? Maybe by whom and how so we can find and ask later how it worked? |
Re: The Testing is half empty
Hang on a moment guys. I don't really care if extra features are added but i think that is Brainstorm material.
All i am requesting up front is that two things already planned to be part of the process--official testers and reduced requirements for updates--are finished. The system as it exists in a half-finished state is not accomplishing its purpose and at the same time getting in the way. If i knew better where to make this request, i would, but posting a plea in community seemed like the best bet. |
Re: The Testing is half empty
Quote:
Actually these three problems were not difficult to find, and they were even commented during the QA process. Perhaps what happens is that the goal is difficult to achieve in practice (one person going through 10 blocker criteria, with the skills it implies). Which leads to even the more riguroous evaluators being soft here and there. I think the key feature that would make testing more accessible to power users and even fun is to split ratings by blocker criteria, as discussed some time ago. Some users will be happy checking whether the features advertized are in place. Others will follow the steps to check whether the app is optified. If at the end the remaining items are system performance and power managent, normal users will be able to assess at least if they noticed serious/noticeable problems. Of course much better if someone can run the right tools and do proper tests, but in their absence happy testers have chances to lead to happy users. |
Re: The Testing is half empty
Honestly, my experience from Diablo is that I can leave a package in Extras-devel for 3 months and get almost no feedback, and then I promote it to Extras and I get three bug reports in a day.
I suspect the same thing is happening in Fremantle. I can't be sure, my apps are stuck in -testing with sufficient karma but I guess not enough quarantine time? The ten days quarantine thing is the part I don't understand. What exactly are they waiting for? If you get the votes, your package should move onwards. |
Re: The Testing is half empty
Quote:
That is another problem. Packages were pulled only when there were serious problems floating and after contacting personally the maintainers. The Load applet for instance is still there, even if the bug is evident. Still, the responsibility of pull it off (or even ask for it) is not defined and there you have still the app getting more downloads and confusing more users. And don't get me wrong, I don't blame the developers! Bugs happen, problems happen. This is not about being harder in order to try to prevent problem, but about being more flexible so the problems can be dealt easily when they come. |
Re: The Testing is half empty
Testing attributes need to be built into the Maemo infrastructure IMO. Apps should have the right metadata wrapped around them throughout their development/deployment lifecycle, and there needs to be a Maemo-managed web-based system in place to interact with that metadata. Once that's in place, making sure testing efforts are meaningful becomes simply a matter of exposing and updating the proper App attributes at various points along the lifecycle. Right now there does not appear to be enough detail.
related to this Brainstorm: http://talk.maemo.org/showthread.php?t=38014 (sorry, Flandry, I see this was outside your intended scope) |
Re: The Testing is half empty
Quote:
Since current voting system seems to lend itself more towards a popularity contest as others have said, perhaps the quarantine period could be used as final acceptance testing by 'official' testers? I agree that a well defined process for removing a package from extras would be beneficial, but I'm not sure that alone is enough to reduce the quarantine time. |
Re: The Testing is half empty
Quote:
Quote:
So, as you might have guessed, I am very skeptical on any "improvements" anyone is going to bring to the Extras vetting process. If anything, we need less enforcement, not more. |
Re: The Testing is half empty
Who is in charge of this and what is necessary to get changes made?
Let me point it out that at no point in my initial post am i advocating a system less secure than what is in place lest the impression be given that it was a request that needs to be debated. Again, it's simply a request that the process be completely implemented as planned. Since we're kinda off in la-la hypothetical land with the discussion, let me throw at you a scenario: it is discovered that a popular app in Extras has a timebomb built into it that will do something really heinous in one more day. What do we do? First of all, we do need a demote button and it does need to be accessible to official testers. They need to be able to demote something from Extras found to be in violation of QA criteria, and able to throw testing versions back to -devel when blockers are discovered (to avoid wasting tester's time). Second of all, the system needs a built-in facility to inject a stub package into Extras that displays a customized message when installed or executed. Bonus points if app manager recognizes it as an "urgent update" and gives unusual notice of its existence. |
Re: The Testing is half empty
Quote:
Quote:
Quote:
Quote:
|
Re: The Testing is half empty
Quote:
Quote:
|
Re: The Testing is half empty
There's nothing wrong with my wallpapers. They are fair use and legal and I have not been told otherwise. They were taken down because of a misunderstanding and I'm not bothering to put them back up because, quite frankly, I'm tired of hearing this kind of crap.
It's a big problem when I use a small portion of a bigger work for my wallpapers (fair use), but it's perfectly okay for certain other members of the community to grab 100% of a copyright image and use it in their package. Awesome. I'm so done with all this. |
Re: The Testing is half empty
Quote:
Quote:
In general, I would strongly suggest against implementing "solutions" against hypothetical situations. Nothing good has ever come from it. Remember this next time you fly somewhere. |
Re: The Testing is half empty
Good, so we are agreed that this doesn't need more debate and just needs to be finished as planned. Wonderful.
BTW there's no need to rely on individuals for every bit of infrastructure. That's the whole point of open source. Would linux exist without Linus? Not per se. Would it have evolved into what it is today without him? Probably. I'm sure i don't speak for only myself when i say there are volunteers to help get this implemented as planned, i didn't come stir things up without willingness to participate where possible. |
Re: The Testing is half empty
Quote:
|
Re: The Testing is half empty
Quote:
But I fear providing facilities to demote from testing->devel will just end up causing more of the 'Power User' demographic to go enable the devel repository. |
Re: The Testing is half empty
I'm responsible for the coordination of these improvements, but unfortunately Neils(X-Fade) that will implement the improvements is busy with things that have more priority at the moment. I think he will accept help if someone want to jump in and contribute some code.
|
Re: The Testing is half empty
Quote:
Quote:
Quote:
|
Re: The Testing is half empty
Quote:
|
Re: The Testing is half empty
Quote:
I have a proposal: make -devel the snake lair. Regular users will be discouraged from using it, but those with the know will be urged to try it more, to compensate. The main objective would be to make sure the app doesn't kill the device or make it go crazy. Once the app is declared sane, it can go to -testing. Since there are (thanks to the above paragraph) now a lot more users, things get tested quicker and more thoroughly (law of large numbers says faults come out better on a larger testing base). And the normal extras gets the high-quality apps that should be showcased. That's not to say they can't have any bugs, but those should be minor and more like annoyances Also, +1 for official testers |
Re: The Testing is half empty
Ok would you please point out where this is at? Gitorious?
|
Re: The Testing is half empty
Re: The quarantine issue:
I think that the 10 day quarantine just slows down the whole system. It certaintly slows down me. I promoted a package to testing that had a rather serious bug in it: one of the dialogs in the settings interface would not open. It was a rather small issue, because I'm not sure that hat many people would have a need to open that dialog, but for those that did try, it would be very confusing. After 3 or 4 days into the testing process someone raised it to my attention and it was a simple bug, one line needed changing. I did this and uploaded a fixed version to -devel. Problem was, this was during a builder outage and the builder would not build it. Finally three or four days later I got the package built and went to promote it to -testing. I saw the version with the bug in it that had been sitting in -testing had 11 karma. The 10 day quarantine was going to be up soon, but should I promote it to Extras and then queue the new version into -testing? With the bug? I decided against it and threw away 9 days of quarantine and 11 positive votes for a one line change. This fixed version got the required karma in a short amount of time, but am still waiting for the quarantine to be up... (which I think is completely useless). It's bad enough you have to get 10 positive votes even for a 1 line bug fix, why delay it any further? Additionally, it says on the wiki: Quote:
So, how are any bugs going to be found in +10 karma apps if people aren't even testing them? |
Re: The Testing is half empty
Quote:
|
Re: The Testing is half empty
Quote:
Whatever enhances the collaborative process between users and developers should be encouraged. Whatever discourages the collaborative process should be abandoned. Regulations should be minimized, and carefully chosen.The ultimate aim is NOT protection. The ultimate aim is fun or useful programs. |
Re: The Testing is half empty
Quote:
I started the thread to request that portions of the process that were discussed to be part of it actually be added. We don't really need ten testers; we need a thorough test of the app and 10 seemed like an arbitrarily good way to ensure that. We do't need 10 days of quarantine... " " " So, let's deputize the really active testers as "official", give them pet criteria to cover if they want, make the new requirement 3 or 2 or 1 official tester who will verify all points as assigned, and leave the other testing to power users during the quarantine period. And let's not make new versions go through the same process as first-time apps. Edit: as for the individual vs community example, it's the difference between relying on one person (xfade in this case) vs a team or community to get things done. He's "busy with other, more important things" which says to me right there that there is too much hanging over that one person's head. |
Re: The Testing is half empty
Quote:
Quote:
Quote:
Quote:
Quote:
Quote:
|
Re: The Testing is half empty
Quote:
|
Re: The Testing is half empty
my personal frustrations with the process come because i promote to extras tesing to get broader feedback since most people are told not to go near devel.
but that doesn't mean i'm proposing that version to promote up. i found that the first couple of times I promoted to extras-testing people gave thums up others gave feedback. i worked on andifxed issues, made more improvements, promote the next version...lose all vote history, each time i go round the cycle the program gets better but it recieves fewer votes/interest. there is no distinction between 'try and give feedback' versus 'i think it's ready for extras please vote' |
Re: The Testing is half empty
Quote:
|
Re: The Testing is half empty
Quote:
|
Re: The Testing is half empty
Quote:
|
Re: The Testing is half empty
i said nothing about making the thumbs down red. It is unfortunate that you can't give up the misconceptions you got from jumping in the thread late, but forunately for you we're actually both interested in the same outcome.
It does bring the thread (finally) back to the initial question, but now it appears there is even stranger discrepancies between the agreed process and the actual one. Why hasn't something as simple and useful as reducing the promotion karma to 5 been done? Changing one number in the code... |
Re: The Testing is half empty
Quote:
|
Re: The Testing is half empty
Quote:
Just fixed the link above, now points directly to the package interface code. |
Re: The Testing is half empty
Quote:
If no longer actually part of the testing plan as discussed at said IRC meeting (which transcripts would be nice to see), that section should be removed by someone who is privvy to the information. It seems the most sensible way to make sure every app is given at least a good testing (as opposed to the present system where it can be resolved entirely by popular vote), but that would mean giving those creepy weird careful tester people some specific tester status. Nobody has given a sensible counter to that including you, who just seem to be bothered by those who will apply literally a set of guidelines. |
Re: The Testing is half empty
Here is a summary of the possible improvements: http://wiki.maemo.org/Extras-testing...A_Improvements
This was not discussed yet with the maemo.org tema, and is far from final, feel free to suggest some improvements. |
Re: The Testing is half empty
Can we look at the Debian project to get some ideas?
In Maemo, there is one thing that bothers me sometimes: the overreaction of people when someone points to a link in extras-testing or extras-devel. The you-will-be-eaten-by-dragons stuff. I use Debian testing as my stable desktop. Of course there are hip-cups sometimes, but that's part of the game that I'm willing to play. What if instead of all that overreaction, people could be asked some simple question: "Do you think you can troubleshoot, re-flash your device and restore your backup if anything goes wrong?" 1) "Yes"->extras-devel 2) "Maybe, if I follow a re-flash guide"->extra-testing 3) "I don't know what you are talking about" -> stay on extras only maybe more people would show up, but they would be warned in a positive and constructive way. |
All times are GMT. The time now is 19:32. |
vBulletin® Version 3.8.8