How is Tesla expected to remotely control its robotaxis, and what are its limitations?

Reuters4 hours ago

June 20 (Reuters) - Tesla (TSLA.O), opens new tab is expected to tiptoe into its long-awaited robotaxi service in Austin, Texas, as soon as Sunday with about 10 of its Model Y SUVs that will operate within strict limits. CEO Elon Musk has said the company is being "super paranoid" about safety and that humans will remotely monitor the fleet.
Remote access and control - known in the industry as "teleoperation" - is used in varying degrees by the handful of robotaxi startups operating around the globe. The technology has clear advantages and important limitations.
Here are some details of how it works:
Teleoperation is the control of machines by humans in a different location, usually over a wireless network.
It is used to train robots to operate autonomously, monitor their autonomous activity, and take over when required.
The global robotaxi industry is still in test mode, as companies deploy the vehicles in limited geographic areas and continually adjust the artificial intelligence software that controls them. Teleoperation is often used to intervene when a vehicle is unsure of what to do.
Alphabet's (GOOGL.O), opens new tab Waymo, for example, has a team of human "fleet response" agents who respond to questions from the Waymo Driver - its bot.
"Much like phone-a-friend, when the Waymo vehicle encounters a particular situation on the road, the autonomous driver can reach out to a human fleet response agent for additional information," Waymo said in a blog post last year.
Former Waymo CEO John Krafcik told Reuters, "the cars aren't being actively monitored," adding that the software is "the ultimate decision-maker."
A Waymo video shows a car asking a remote operator whether a street with emergency response vehicles is open to traffic. When the human says yes, the vehicle proceeds.
In contrast, other companies, such as Baidu's Apollo Go in China, have used fully remote backup drivers who can step in to virtually drive the vehicles. Baidu declined to comment.
Driving vehicles remotely on public roads has a major potential problem: it relies on cellular data connections that can drop or operate with a lag, disconnecting the vehicle from the remote driver in dangerous situations.
Philip Koopman, a Carnegie Mellon University engineering professor and autonomous-vehicle safety expert, said that approach could work for a small test deployment of 10 vehicles, such as Tesla's initial effort in Austin, but he called teleoperation "inherently unreliable technology."
"Eventually you will lose connection at exactly the worst time," he said. "If they've done their homework, this won't ever happen for 10 cars. With a million cars, it's going to happen every day."
Former Waymo CEO Krafcik agreed, adding that the time delay in cell signal makes remote driving "very risky."
On the other hand, relying on the vehicle to reach out for help and allowing the vehicle to be the decision-maker are risky as well, Koopman said, as it does not guarantee the vehicle will make the right decision.
Waymo declined to comment on the limitations of its approach.
Koopman also noted there are limits to how many vehicles one person can safely monitor.
A group of Democratic Texas lawmakers asked Tesla on Wednesday to delay its robotaxi launch until September, when a new autonomous-driving law is scheduled to take effect. The Austin-area lawmakers said in a letter that delaying the launch "is in the best interest of both public safety and building public trust in Tesla's operations."
Musk for years has promised, without delivering, that its Full Self-Driving (Supervised) advanced driver assistance software would graduate to completely self-driving and control robotaxis. This year, he said Tesla would roll out a paid service in Austin underpinned by an "unsupervised" version of the software.
"Teslas will be in the wild, with no one in them, in June, in Austin," Musk told analysts and investors in January. In May, he told CNBC that the robotaxi would only operate in parts of Austin that are safe for it, would avoid difficult intersections, and would use humans to monitor the vehicles.
What those teleoperators will do is not clear.
For years inside Tesla, company executives have expected to use teleoperators who could take over in case of trouble, said one person familiar with the matter. For instance, if a robotaxi were stuck in a crowded pedestrian area and confused about what to do next, a human teleoperator could take over and guide it, the source said.
Tesla advertised for teleoperation positions, saying the company needs the ability to "access and control" autonomous vehicles and humanoid robots remotely. Such employees can "remotely perform complex and intricate tasks," it said in the advertisements.
Tesla did not respond to a request for comment.
"We are being super paranoid about safety, so the date could shift," Musk said in a post on X last week while providing a tentative launch date of June 22.

Hashtags

Try Our AI Features

Explore what Daily8 AI can do for you:

Comments

No comments yet...

Aflac finds suspicious activity on US network that may impact Social Security numbers, other data

The Independent

20 minutes ago

The Independent

Aflac finds suspicious activity on US network that may impact Social Security numbers, other data

Aflac says that it has identified suspicious activity on its network in the U.S. that may impact Social Security numbers and other personal information, calling the incident part of a cybercrime campaign against the insurance industry. The company said Friday that the intrusion was stopped within hours. 'We continue to serve our customers as we respond to this incident and can underwrite policies, review claims, and otherwise service our customers as usual,' Aflac said in a statement. The company said that it's in the early stages of a review of the incident, and so far is unable to determine the total number of affected individuals. Aflac Inc. said potentially impacted files contain claims information, health information, Social Security numbers, and other personal information, related to customers, beneficiaries, employees, agents, and other individuals in its U.S. business. The Columbus, Georgia, company said that it will offer free credit monitoring and identity theft protection and Medical Shield for 24 months to anyone that calls its call center. Cyberattacks against companies have been rampant for years, but a string of attacks on retail companies have raised awareness of the issue because the breaches can impact customers. United Natural Foods, a wholesale distributor that supplies Whole Foods and other grocers, said earlier this month that a breach of its systems was disrupting its ability to fulfill orders — leaving many stores without certain items. In the U.K., consumers could not order from the website of Marks & Spencer for more than six weeks — and found fewer in-store options after hackers targeted the British clothing, home goods and food retailer. A cyberattack on Co-op, a U.K. grocery chain, also led to empty shelves in some stores. A security breach detected by Victoria's Secret last month led the popular lingerie seller to shut down its U.S. shopping site for nearly four days, as well as to halt some in-store services. Victoria's Secret later disclosed that its corporate systems also were affected, too, causing the company to delay the release of its first quarter earnings. The North Face said that it discovered a 'small-scale credential stuffing attack' on its website in April. The company reported that no credit card data was compromised and said the incident, which impacted 1,500 consumers, was 'quickly contained.' Adidas disclosed last month that an 'unauthorized external party' obtained some data, which was mostly contact information, through a third-party customer service provider.

Professional Quality Voice Cloning : Open Source vs ElevenLabs

Geeky Gadgets

43 minutes ago

Geeky Gadgets

Professional Quality Voice Cloning : Open Source vs ElevenLabs

What if you could replicate a voice so convincingly that even the closest of listeners couldn't tell the difference? The rise of professional-quality voice cloning has made this a reality, transforming industries from entertainment to customer service. But as this technology becomes more accessible, a pivotal question emerges: should you opt for the polished convenience of a commercial platform like ElevenLabs, or embrace the flexibility and cost-efficiency of open source solutions? The answer isn't as straightforward as it seems. While ElevenLabs promises quick results with minimal effort, open source tools offer a deeper level of customization—if you're willing to invest the time and expertise. This tension between convenience and control lies at the heart of the debate. In this article, Trelis Research explore the key differences between open source voice cloning models and ElevenLabs, diving into their strengths, limitations, and use cases. From the meticulous process of preparing high-quality audio data to the technical nuances of fine-tuning models like CSM1B and Orpheus, you'll uncover what it takes to achieve truly lifelike voice replication. Along the way, we'll also examine the ethical considerations and potential risks that come with wielding such powerful technology. Whether you're a curious enthusiast or a professional seeking tailored solutions, this exploration will challenge your assumptions and help you make an informed choice. After all, the voice you clone may be more than just a tool—it could be a reflection of your values and priorities. Mastering Voice Cloning What Is Voice Cloning? Voice cloning involves training a model to replicate a specific voice for text-to-speech (TTS) applications. This process requires high-quality audio data and advanced modeling techniques to produce results that are both realistic and expressive. Commercial platforms like ElevenLabs provide fast and efficient solutions, but open source models offer a cost-effective alternative for those willing to invest time in training and customization. By using these tools, you can create highly personalized voice outputs tailored to your specific needs. Data Preparation: The Foundation of Accurate Voice Cloning High-quality data is the cornerstone of successful voice cloning. To train a model effectively, you'll need at least three hours of clean, high-resolution audio recordings. The preparation process involves several critical steps that ensure the dataset captures the unique characteristics of a voice: Audio Cleaning: Remove background noise and normalize volume levels to ensure clarity and consistency. Remove background noise and normalize volume levels to ensure clarity and consistency. Audio Chunking: Divide recordings into 30-second segments, maintaining sentence boundaries to preserve coherence and context. Divide recordings into 30-second segments, maintaining sentence boundaries to preserve coherence and context. Audio Transcription: Use tools like Whisper to align text with audio, creating precise and synchronized training data. These steps are essential for capturing the nuances of a voice, including its tone, pitch, and emotional expression, which are critical for producing realistic outputs. Open Source vs ElevenLabs Watch this video on YouTube. Gain further expertise in AI voice cloning by checking out these recommendations. Open source Models: Exploring the Alternatives Open source voice cloning models provide powerful alternatives to commercial platforms, offering flexibility and customization. Two notable models, CSM1B (Sesame) and Orpheus, stand out for their unique features and capabilities: CSM1B (Sesame): This model employs a hierarchical token-based architecture to represent audio. It supports fine-tuning with LoRA (Low-Rank Adaptation), making it efficient for training on limited hardware while delivering high-quality results. This model employs a hierarchical token-based architecture to represent audio. It supports fine-tuning with LoRA (Low-Rank Adaptation), making it efficient for training on limited hardware while delivering high-quality results. Orpheus: With 3 billion parameters, Orpheus uses a multi-token approach for detailed audio representation. While it produces highly realistic outputs, its size can lead to slower inference times and increased complexity during tokenization and decoding. When fine-tuned with sufficient data, these models can rival or even surpass the quality of commercial solutions like ElevenLabs, offering a customizable and cost-effective option for professionals. Fine-Tuning: Customizing Open source Models Fine-tuning is a critical step in adapting pre-trained models to replicate specific voices. By applying techniques like LoRA, you can customize models without requiring extensive computational resources. During this process, it's important to monitor metrics such as training loss and validation loss to ensure the model is learning effectively. Comparing the outputs of fine-tuned models with real recordings helps validate their performance and identify areas for improvement. This iterative approach ensures that the final model delivers accurate and expressive results. Open Source vs. ElevenLabs: Key Differences ElevenLabs offers a streamlined voice cloning solution, delivering high-quality results with minimal input data. Its quick cloning feature allows you to replicate voices using small audio samples, making it an attractive option for users seeking convenience. However, this approach often lacks the precision and customization offered by open source models trained on larger datasets. Open source solutions like CSM1B and Orpheus, when fine-tuned, can match or even exceed the quality of ElevenLabs, providing a more flexible and cost-effective alternative for users with specific requirements. Generating Audio: Bringing Text to Life The final step in voice cloning is generating audio from text. Fine-tuned models can produce highly realistic outputs, especially when paired with reference audio samples to enhance voice similarity. However, deploying these models for high-load inference can present challenges due to limited library support and hardware constraints. Careful planning and optimization are essential to ensure smooth deployment and consistent performance, particularly for applications requiring real-time or large-scale audio generation. Technical Foundations of Voice Cloning The success of voice cloning relies on advanced technical architectures that enable models to produce realistic and expressive outputs. Key elements include: Token-Based Architecture: Audio is broken into tokens, capturing features such as pitch, tone, and rhythm for detailed representation. Audio is broken into tokens, capturing features such as pitch, tone, and rhythm for detailed representation. Hierarchical Representations: These allow models to understand complex audio features, enhancing expressiveness and naturalness in the generated outputs. These allow models to understand complex audio features, enhancing expressiveness and naturalness in the generated outputs. Decoding Strategies: Differences in decoding methods between models like CSM1B and Orpheus influence both the speed and quality of the generated audio. Understanding these technical aspects can help you select the right model and optimize it for your specific use case. Ethical Considerations in Voice Cloning Voice cloning technology raises important ethical concerns, particularly regarding potential misuse. The ability to create deepfake audio poses risks to privacy, security, and trust. As a user, it's your responsibility to ensure that your applications adhere to ethical guidelines. Prioritize transparency, verify the authenticity of cloned voices, and use the technology responsibly to avoid contributing to misuse or harm. Best Practices for Achieving Professional Results To achieve professional-quality voice cloning, follow these best practices: Use clean, high-quality audio recordings for training to ensure accuracy and clarity. Combine fine-tuning with cloning techniques to enhance voice similarity and expressiveness. Evaluate models on unseen data to test their generalization and reliability before deployment. These practices will help you maximize the potential of your voice cloning projects while maintaining ethical standards. Tools and Resources for Voice Cloning Several tools and platforms can support your voice cloning efforts, streamlining the process and improving results: Transcription Tools: Whisper is a reliable option for aligning text with audio during data preparation. Whisper is a reliable option for aligning text with audio during data preparation. Libraries and Datasets: Platforms like Hugging Face and Unsloth provide extensive resources for training and fine-tuning models. Platforms like Hugging Face and Unsloth provide extensive resources for training and fine-tuning models. Training Environments: Services like Google Colab, RunPod, and Vast AI offer cost-effective solutions for model training and experimentation. By using these resources, you can simplify your workflow and achieve high-quality results in your voice cloning projects. Media Credit: Trelis Research Filed Under: AI, Guides Latest Geeky Gadgets Deals Disclosure: Some of our articles include affiliate links. If you buy something through one of these links, Geeky Gadgets may earn an affiliate commission. Learn about our Disclosure Policy.

Microsoft's Copilot Turns Notepad Into Your Ultimate Writing Assistant

Geeky Gadgets

43 minutes ago

Geeky Gadgets

Microsoft's Copilot Turns Notepad Into Your Ultimate Writing Assistant

What if the humble Notepad, a tool synonymous with simplicity, suddenly became your most powerful writing assistant? With the integration of Copilot, Microsoft has transformed this classic text editor into a innovative productivity powerhouse. Imagine drafting a report, only to have Notepad suggest clearer phrasing, summarize your key points, or even format your ideas into a polished layout—all in real time. This isn't just an upgrade; it's a reimagining of what Notepad can be, blending its lightweight charm with the intelligence of AI. For a tool that's been a staple for decades, this leap forward is nothing short of innovative. In this piece, Aldo James explore how Copilot's AI-powered tools are reshaping the way users interact with Notepad. From rewriting and summarizing text to tailoring tone and structure, these features promise to save time and elevate the quality of your work. But what makes this update truly remarkable is its accessibility—offering advanced capabilities without overwhelming the simplicity that users love. Whether you're a professional juggling deadlines or a student organizing notes, this evolution of Notepad is designed to meet your needs in ways you might not expect. It's a shift that raises an intriguing question: could this be the future of everyday writing? Notepad Enhanced with AI AI-Powered Tools Embedded in Notepad The addition of Copilot brings a robust suite of AI-driven tools directly into Notepad, allowing users to perform a variety of tasks with efficiency and precision. These tools include rewriting text for improved clarity, summarizing lengthy content into concise points, and formatting text for enhanced readability. Whether you're drafting quick notes, preparing detailed documents, or brainstorming ideas, Copilot offers intelligent suggestions tailored to your specific needs. For instance, you can refine a lengthy paragraph into a more concise version or convert a block of text into a structured bulleted list. These capabilities are designed to save time while improving the overall quality of your writing. By integrating these features into Notepad, Microsoft has made advanced text editing accessible to users without requiring them to switch to more complex software. Customizable Text Tailored to Your Needs One of the standout features of Copilot in Notepad is its ability to adapt to your personal preferences. You can customize the tone of your writing to suit different contexts, whether you need a formal tone for professional documents, a casual tone for informal communication, or even a marketing-oriented style for promotional content. Additionally, Copilot allows you to adjust the length of your text, making it suitable for tasks ranging from drafting brief emails to creating comprehensive reports. The tool also offers flexible formatting options, allowing you to structure your text into lists, paragraphs, or other layouts. This adaptability ensures that the output aligns with your specific goals, making Copilot a valuable resource for a wide range of writing tasks. Whether you're a student, a professional, or a casual user, these features provide the flexibility needed to meet diverse requirements. MS Copilot in Notepad 2025 Watch this video on YouTube. Stay informed about the latest in AI-powered text editing by exploring our other resources and articles. Interactive Editing for Enhanced Precision Copilot's interactive editing capabilities make the process of refining text more dynamic and user-friendly. When you highlight a section of text, the tool generates multiple rewriting suggestions, giving you the opportunity to evaluate and select the best option. If none of the suggestions meet your expectations, you can use the 'try again' feature to request alternative edits. This iterative approach enables you to refine your content until it meets your standards, making sure a polished and professional final result. This feature is particularly useful for users who want to experiment with different writing styles or improve the clarity of their text. By providing real-time suggestions and allowing for multiple iterations, Copilot makes the editing process more efficient and less time-consuming. This interactive functionality sets it apart from traditional text editors, offering a level of precision that enhances the overall writing experience. Streamlined Productivity with Quick Shortcuts Efficiency is a core focus of the Copilot integration in Notepad. The tool includes quick shortcuts for common tasks such as summarizing, rewriting, or formatting text. These shortcuts are designed to integrate seamlessly into your workflow, allowing you to perform complex actions with minimal effort. For example, you can instantly summarize a long document into key points or reformat text for better readability with just a few clicks. By embedding these advanced capabilities into a lightweight application like Notepad, Microsoft ensures that users can harness the power of AI without the need for resource-intensive software. This approach not only enhances productivity but also preserves the simplicity and accessibility that have made Notepad a trusted tool for decades. Seamless Integration with the M365 Ecosystem To access Copilot in Notepad, users need an M365 subscription, which is available in personal, family, or business plans. The integration is designed to be intuitive, with a Copilot icon conveniently located in the top-right corner of the application. This ensures that the advanced features are easily accessible without disrupting the familiar Notepad interface. By linking Copilot to the broader M365 ecosystem, Microsoft provides a cohesive experience across its suite of productivity tools. This seamless integration allows users to transition effortlessly between applications, enhancing overall efficiency. Whether you're working on a document in Word, creating a presentation in PowerPoint, or drafting notes in Notepad, the consistent functionality of Copilot ensures a smooth and productive workflow. Notepad Transformed for the Modern User The integration of Copilot into Notepad represents a significant evolution for this classic application. By combining the simplicity of Notepad with the advanced capabilities of AI, Microsoft has created a tool that caters to a wide range of users, from casual note-takers to professionals managing complex projects. With features like customizable text editing, interactive suggestions, and seamless integration with the M365 ecosystem, Copilot in Notepad redefines what a text editor can achieve. Whether you're looking to streamline your workflow, enhance the quality of your writing, or organize your ideas more effectively, this update ensures that Notepad remains a relevant and powerful tool in an increasingly AI-driven world. Media Credit: Aldo James Filed Under: AI, Top News Latest Geeky Gadgets Deals Disclosure: Some of our articles include affiliate links. If you buy something through one of these links, Geeky Gadgets may earn an affiliate commission. Learn about our Disclosure Policy.

How is Tesla expected to remotely control its robotaxis, and what are its limitations?

Hashtags

Try Our AI Features

Comments

Related Articles

Aflac finds suspicious activity on US network that may impact Social Security numbers, other data

Professional Quality Voice Cloning : Open Source vs ElevenLabs

Microsoft's Copilot Turns Notepad Into Your Ultimate Writing Assistant

Get Started Now: Download the App