Texas Will Use Computers To Grade Written Answers On This Year’s STAAR Tests

Keaton Peters reports via the Texas Tribune: Students sitting for their STAAR exams this week will be part of a new method of evaluating Texas schools: Their written answers on the state’s standardized tests will be graded automatically by computers. The Texas Education Agency is rolling out an “automated scoring engine” for open-ended questions on the State of Texas Assessment of Academic Readiness for reading, writing, science and social studies. The technology, which uses natural language processing technology like artificial intelligence chatbots such as GPT-4, will save the state agency about $15-20 million per year that it would otherwise have spent on hiring human scorers through a third-party contractor.

The change comes after the STAAR test, which measures students’ understanding of state-mandated core curriculum, was redesigned in 2023. The test now includes fewer multiple choice questions and more open-ended questions — known as constructed response items. After the redesign, there are six to seven times more constructed response items. “We wanted to keep as many constructed open ended responses as we can, but they take an incredible amount of time to score,” said Jose Rios, director of student assessment at the Texas Education Agency. In 2023, Rios said TEA hired about 6,000 temporary scorers, but this year, it will need under 2,000.

To develop the scoring system, the TEA gathered 3,000 responses that went through two rounds of human scoring. From this field sample, the automated scoring engine learns the characteristics of responses, and it is programmed to assign the same scores a human would have given. This spring, as students complete their tests, the computer will first grade all the constructed responses. Then, a quarter of the responses will be rescored by humans. When the computer has “low confidence” in the score it assigned, those responses will be automatically reassigned to a human. The same thing will happen when the computer encounters a type of response that its programming does not recognize, such as one using lots of slang or words in a language other than English. “In addition to ‘low confidence’ scores and responses that do not fit in the computer’s programming, a random sample of responses will also be automatically handed off to humans to check the computer’s work,” notes Peters. While similar to ChatGPT, TEA officials have resisted the suggestion that the scoring engine is artificial intelligence. They note that the process doesn’t “learn” from the responses and always defers to its original programming set up by the state.

Read more of this story at Slashdot.

Warner Bros. Issues DMCA’s After ‘Suicide Squad’ Game Cracked to Allow Playing as Unreleased Characters

“It appears the live-service shooter Suicide Squad: Kill The Justice League is, once again, suffering from a hacker problem,” reports Kotaku:

Instead of doing absolutely absurd amounts of damage, this time hackers have figured out how to gain access to unreleased characters and skins. And publisher WB Games is reportedly issuing DMCA takedown notices against any assets that have found their way online.

As reported by IGN, one hacker discovered how to play as Deathstroke, one of the four characters developer Rocksteady Studios teased for an upcoming Suicide Squad season… There were also unreleased skins for The Joker and King Shark that folks have somehow accessed, all of which began circulating on Reddit and X/Twitter on April 4.

Not long after, the assets were removed, with folks believing WB Games was behind the strikes. YouTuber TrixRidiculous, who primarily covers DC- and Marvel-related RPGs, had their posts on X/Twitter swiftly taken down by a DMCA strike.”I posted three pics to Twitter,” TrixRidiculous told Kotaku over email. “Within probably 30 minutes, I received a DMCA strike from WB Games [Kotaku saw a screenshot of this notice]. Please just bring attention to the fact that the leaderboard is riddled with hackers/cheaters that have gone unbanned since launch, as that’s all I was trying to do anyway.”

This sentiment is shared across the game’s official subreddit, with folks posting about “losing interest” in Suicide Squad due to hackers flooding the leaderboards.

Read more of this story at Slashdot.

US Energy Department Announces ‘Blueprint’ for Slashing Emissions From Buildings and Reducing Energy Use

This week America’s Department of Energy announced “a comprehensive plan to reduce greenhouse-gas emissions from buildings by 65% by 2035 and 90% by 2050.”

The U.S. Department of Energy (DOE) led the Blueprint’s development in collaboration with the Department of Housing and Urban Development, the Environmental Protection Agency, and other federal agencies. The Blueprint is the first sector-wide strategy for building decarbonization developed by the federal government… “America’s building sector accounts for more than a third of the harmful emissions jeopardizing our air and health…” said U.S. Secretary of Energy Jennifer M. Granholm. “As part of a whole-of-government approach, the Department of Energy is outlining for the first time ever a comprehensive federal plan to reduce energy in our homes, schools, and workplaces — lowering utility bills and creating healthier communities while combating the climate crisis.”

Buildings account for more than one third of domestic climate pollution and $370 billion in annual energy costs… The Blueprint projects reductions of 90% of total greenhouse gas emissions from the buildings sector, which will save consumers more than $100 billion in annual energy costs and avoid $17 billion in annual health costs.
Just for example, the Department of Energy’s Affordable Home Energy Shot program “aims to reduce the upfront cost of upgrading a home by at least 50% and reduce energy bills by 20% within a decade.” (Meanwhile, the federal government’s role in making more change happen faster includes financing, funding R&D on lower-cost technologies, expanding markets, and “supporting the development and implementation of emissions-reducing building codes and appliance standards.”)

Besides the national blueprint, the Department also announced an expansion of its Better Buildings Commercial Building Heat Pump Accelerator initiative. In this program, “manufacturers will produce higher efficiency and life cycle cost-effective heat pump rooftop units and commercial organizations will evaluate and adopt next-generation heat pump technology.”
U.S. Secretary of Energy Jennifer M. Granholm said the program “builds on more than a decade of public-private partnerships to get cutting edge clean technologies from lab to market, helping to slash harmful carbon emissions throughout our economy.”

On average, between 20% and 30% of the nation’s energy is wasted, presenting a significant opportunity to increase energy efficiency. Through the Better Buildings Initiative, DOE partners with public and private sector stakeholders to pursue ambitious portfolio-wide energy, waste, water, and/or emissions reduction goals and publicly share solutions. By improving building design, materials, equipment, and operations, energy efficiency gains can be achieved across broad segments of the nation’s economy.

The Accelerator initiative was developed with commercial end users like Amazon, IKEA, and Target, and already includes manufacturers AAON, Carrier Global Corp., Lennox International, Rheem Manufacturing Co., Trane Technologies, and York International Corp. The Accelerator aims to bring more efficient, affordable next-generation heat pump rooftop units to market as soon as 2027 — which will slash both emissions and energy costs in half compared to natural gas-fueled heat pumps. If deployed at scale, they could save American businesses and commercial entities $5 billion on utility bills every year.

Read more of this story at Slashdot.

CNN Investigates ‘Space Shuttle Columbia: The Final Flight’

CNN revisits 2003’s disastrous landing of the Space Shuttle Columbia tonight with two “immersive” specials co-produced by BBC and Mindhouse Productions “featuring exclusive interviews and revealing never-before-broadcast footage,” according to an announcement — with two more specials airing next week.

You can watch a trailer here.

Across four episodes, the story of the ticking-clock of Columbia’s final mission is told in dramatic detail, beginning months before the troubled launch, unfolding across the sixteen days in orbit, and concluding with the investigation into the tragic loss of the seven astronauts’ lives. Weaving together intimate footage shot by the astronauts themselves inside the orbiter, exclusive first-hand testimony from family members of the Shuttle’s crew, key players at NASA — some of whom have never spoken before — and journalists who covered the story on the ground, the series paints an intimate portrait of the women and men onboard and uncovers in forensic detail the trail of events and missed opportunities that ultimately led to disaster.

CNN says the first two episodes will livestream tonight at 9 p.m. EST (time-delayed on the west coast until 9 p.m.PST) — and then be available on-demand starting Monday — “for pay TV subscribers via CNN.com, CNN connected TV and mobile apps.” CNN’s web site offers a “preview” of its live TV offerings here.

They’re promising “the inside story of one America’s most iconic institutions, uncovering how financial pressures and a culture of complacency may have contributed to the events of February 1, 2003. The series also reflects on the legacy of the Space Shuttle era, serving as a timely exploration of the challenges and inherent dangers that remain relevant to space travel today.”

On its web site CNN has also published two companion articles — one by Rice history professor Douglas Brinkley arguing that NASA “was America’s crown jewel. After the Columbia disaster it was never quite the same.”

Because other shuttle missions had returned safely with “shredded” surface tiles — and because the stalwart Columbia had brought astronauts home from 27 previous flights — many NASA officials were lulled into complacency. They went so far as to assure the pilot and commander via email that “there is no concern … We have seen the same phenomenon on several other flights and there is absolutely no concern for entry.”

NASA officials also decided against enlisting spy satellite photography to examine the shuttle damage more thoroughly. If they had, it’s possible that the astronauts could have repaired the spaceplane or at least abandoned it for refuge on the International Space Station…

As the Columbia Accident Investigation Board (CAIB) noted in its final report, “the NASA organizational culture had as much to do with this accident as the foam.” All of NASA’s launches were suspended for two years. While the shuttles eventually flew again, post-Columbia, the program was stunted and curtailed.
The article notes that since then SpaceX, Blue Origin, and the United Launch Alliance (Lockheed Martin and Boeing) “are thriving today in the space industry,” along with Virgin Galactic and Axiom Space. “NASA, far from feeling threatened, has encouraged many of the private companies with massive contracts. The agency already had a long history of dealing with sub-contractors, using its pocketbook to steer aerospace development; that tradition has adjusted seamlessly to the current space economy.”

In the other article CNN Space & Science writer Jackie Wattles notes that when America later retired its Space Shuttle program in 2011, “no U.S. astronaut would travel to space on an American-made rocket for nearly a decade.”

Read more of this story at Slashdot.

How the European Space Agency Celebrated April Fool’s Day

The European Space Agency has a Planetary Defence Office, which includes its Near-Earth Object Coordination Centre. “It has come to our attention,” they wrote in the April edition of their monthly newsletter, “that a recent trend among journalists has been to come up with creative comparisons to convey the size of an asteroid to the public.”

So then, as explained by RockDoctor (Slashdot reader #15,477) “they propose a number of standardised units of comparison for journalists describing ‘death from the skies'”.

An excerpt from that April 1 newsletter:
In the absence of a handy skyscraper, animals commonly used have included giraffes, corgis and an entire colony of penguins. But how do these comparisons stack up? Let’s look at some of our favourite unusual suspects:
– Corgi: At around 30 cm tall, a space rock the size of a corgi wouldn’t pose much of
a threat.
– Half a giraffe: An adult giraffe can reach up to 5.5 metres in height, so half a giraffe
would be about 2.75 metres. While not as impressive as a full skyscraper, an
asteroid that size could certainly destroy a building or two…
– Elephants: An adult African elephant can reach 7 metres at the shoulder. Ninety
elephants stacked on top of each other would form a staggering pile over 630
metres high, creating a devastating but probably not planet-ending event.

As this menagerie of animals can cause a lot of confusion, we at the NEOCC
recommend the use of a Standardised Giraffe Unit (SGU, 1 SGU = 5 penguins) for ease
of comparison.

RockDoctor shares this additional thought in his original submission about the newly proposed standardized unit.

“The world may be turtles all the way down, but it’s giraffes all the way up.”

Read more of this story at Slashdot.

Four Baseball Teams Now Let Ticket-Holders Enter Using AI-Powered ‘Facial Authentication’

“The San Francisco Giants are one of four teams in Major League Baseball this season offering fans a free shortcut through the gates into the ballpark,” writes SFGate.

“The cost? Signing up for the league’s ‘facial authentication’ software through its ticketing app.”

The Giants are using MLB’s new Go-Ahead Entry program, which intends to cut down on wait times for fans entering games. The pitch is simple: Take a selfie through the MLB Ballpark app (which already has your tickets on it), upload the selfie and, once you’re approved, breeze through the ticketing lines and into the ballpark. Fans will barely have to slow down at the entrance gate on their way to their seats…

The Philadelphia Phillies were MLB’s test team for the technology in 2023. They’re joined by the Giants, Nationals and Astros in 2024…

[Major League Baseball] says it won’t be saving or storing pictures of faces in a database — and it clearly would really like you to not call this technology facial recognition. “This is not the type of facial recognition that’s scanning a crowd and specifically looking for certain kinds of people,” Karri Zaremba, a senior vice president at MLB, told ESPN. “It’s facial authentication. … That’s the only way in which it’s being utilized.”

Privacy advocates “have pointed out that the creep of facial recognition technology may be something to be wary of,” the article acknowledges. But it adds that using the technology is still completely optional.

And they also spoke to the San Francisco Giants’ senior vice president of ticket sales, who gushed about the possibility of app users “walking into the ballpark without taking your phone out, or all four of us taking our phones out.”

Read more of this story at Slashdot.