Blog
Think beyond the bot: 3 proven strategies for digital-first business success
Everyone’s talking about the importance of digital-first customer experience, especially these days—but you have to think bigger than that. It’s not enough to just have a great mobile app or self-service chatbot. Market-leading success relies on having a digital-first culture that drives your business and meets your customers where they are.
In simplest terms, ‘digital-first’ means making your operations as digital and mobile-friendly as possible, including enabling seamless communication across multiple responsive, persistent and asynchronous digital channels. Particularly in customer service, digital-first operations and the notion of engaging with your customers in the same place that they talk to their friends and family is a game-changer.
I’m not talking about technology replacing people, but about using the latest digital capabilities to make your organization radically more efficient and productive, and improving the customer experience at the same time. It’s all about empowering employees to do their best work, and meeting customer needs faster and better, wherever they are and whenever they need you.
Where should you focus first? I think a digital-first culture for customer service relies on a few fundamentals for meeting today’s demands. Let’s look at three proven strategies that are powering market leading companies.
1—Be where customers are—service on the go
Customer service is all about building relationships with your customers. In a digital-first world, those conversations are now across many channels from online support portals and webchat, to email, in-app, standard text messaging, and social media. Among all the options for digital engagement, asynchronous messaging has emerged as a clear winner with consumers. In fact, messaging has been the dominant way in which people communicate since over a decade ago. It makes sense that they want to engage with brands they love in the same place, given the familiarity and convenience.
For digital-first customer service, providing messaging capabilities is now the essential way to be there for customers in the moment. It gives customers a fast, convenient way to reach out, and can make it easier to resolve service issues cost-efficiently on the first contact.
Big tech leaders paved the way with technologies like Apple Business Chat and Google Business Messaging, enabling companies to integrate messaging at multiple touch points. New AI-powered solutions are now taking it a step further.
I like to call this the ‘asynchronous revolution.’ Digital-first leadership now demands multimodal customer service that supports ongoing conversations. The “start and stop” flexibility that people love so much in their personal messaging is now a business-critical interaction model that today’s digital consumers expect.
2—Make conversations seamless across channels
Being there for customers in digital messaging is key—but what if they need to pause the conversation or switch channels in the middle of an issue? Typically it means starting all over again, which is a frustrating experience. When people have to explain their problem to multiple agents, customer satisfaction drops off a cliff.
Delivering true omni-channel support is the next essential in a digital-first business, and it is finally here.
Whether consumers reach out via messaging, webchat, or voice, all of these channels must be intelligently integrated to ensure customer service agents really know each person. Employees need to be equipped with the right interaction history and procedural knowledge to quickly move toward resolution, no matter where or when that agent stepped into the conversation.
Many companies are finding that’s the most critical customer experience problem to solve, and the biggest source of frustration for consumers who need help. That’s why I’m so excited when I see new technologies seamlessly unify and thread conversation histories across channels. Innovative solutions are finally bringing a holistic approach to customer service.
Someone can request assistance in an app, then switch to a webchat, mobile messaging, or even voice, and any agent will know all the relevant details. It ensures the customer has an easy, cohesive experience, and enables agents to work more efficiently.
In this way, AI helps fuel a digital-first business and empower support and sales teams to treat every customer interaction as a moment that matters.

AI amplifies the effectiveness of digital-first channels and new techniques help to accelerate adoption. Powering digital and mobile touch points with data-driven intelligence also radically increases productivity at a lower cost—while delivering responsive, personalized customer experiences that win brand loyalty.
Michael Lawder
3—Empower employees with predictive capabilities
Once you’ve got digital messaging and omni-channel support in place, it’s time to kick it up a notch by augmenting customer service agents and sales teams with predictive and highly contextualized knowledge, powered by self-learning AI models. Most companies have vast volumes of data that can be harnessed to make support and marketing efforts substantially more productive.
Here’s where cutting-edge AI solutions really stand out. They can transcribe and analyze digital and voice conversations in real time, and integrate those insights with other transactional and historical data. The system can then proactively guide agents and sales reps with the most accurate and relevant information for a given customer need or situation, ensuring the best answer and outcome every time.
Those ‘conversational analytics’ translate into predictive insights that deliver powerful benefits:
- Dramatically improves productivity
- when your workforce knows exactly what to say and do through every interaction, without having to dig for details.
- Captures the knowledge
- of your best agents and sales reps. Sometimes that’s the best ‘data’ you’ve got, and it’s a lot more valuable when everyone can access it.
- Delivers data-driven intelligence
- to sales and marketing teams, making it easier to improve your operation and tailor customer experiences based on predictive insights instead of best guesses.
Ultimately, I believe the power of AI and predictive technologies like ASAPP are defining the future of digital-first business. This is the first time in the history of the customer service industry that we can simultaneously meet customers where they are, drive revenue growth, deliver a better customer and employee experience—and do it all at lower costs. And it’s about time.
A silver bullet to end the conflict between lower cost and better customer service
Modern CX Teams: What do they have that you don’t?
Long before the pandemic, consumers made the digital shift—but business has been slow to catch up. For over a decade, texting and digital messaging have been the most popular ways to communicate. Yet, the average Fortune 500 company still spends 80% or more of their contact center budget on voice calls—where the experience is worse and the cost is higher. Customers are handled more like anonymous callers, and the levers to drive efficiency and quality are limited.
For brands to stay competitive, customer service needs to modernize, using new technologies to really know customers and be there for them where and how they want to be supported. Call centers will still be part of the mix, but they need to deliver more value, and be integrated into a larger, holistic omni-channel support strategy. By embracing innovative capabilities, companies can deliver cost-efficient human-centric digital experiences that help win and keep customers.
Rethinking digital communications
When customers have a question or a problem, they want quick, convenient resolution on their preferred communication channel. Today that most often means digital. Customers may want to use self-service tools, reach out on social media, and chat, text, or message in a variety of ways. The best brands will make it an easy, personalized experience that nurtures loyalty and increases lifetime value, regardless of channel. There is real power in engaging with your customers in the same place they talk to their friends and family.
Many budget-minded executives see digital communications solely as a way to minimize costly engagement with customers. They measure success in deflection and containment. But that’s not a sustainable (or logical) approach. What if, instead, your digital capabilities made every interaction substantially better?

Reimagining CX with AI at the core dramatically changes the paradigm. Companies can deliver great experiences AND drive new levels of efficiency.
Michael Lawder
AI innovations are making that possible. Today, businesses can not only deliver smarter digital channels, but empower better customer service communications all around.
One of the biggest wins is meeting customer needs on the first contact. When an AI-powered solution integrates all your digital communication channels, both agents and customers have a greater opportunity to get it right the first time. If a customer does need to follow up on something, a smart system provides all the relevant details so any agent can quickly move things forward instead of starting all over.
That’s one part of a better, modernized customer experience. Taking it further, AI-driven solutions also make agents significantly more productive through every interaction. Machine learning augments CX teams with the right knowledge at the right time for faster, easier resolution. I love that some of the most advanced machine learning technologies are being used to help make people (in this case, agents) better.By dramatically increasing efficiency, customer care agents can deliver a more personalized and contextual interaction — turning potentially negative experiences into real loyalty moments.
Fueling conversation-powered operations
Conversations are the heart of customer service, and digital-first technologies can make those communications operationally more efficient. In real time, it means faster resolution for shorter and more concurrent interactions, at lower costs. In the long view, it provides a wealth of data for machine learning insights, providing ‘conversational analytics’ to fuel strategies and actions that help the business.
Think about ‘voice of the customer’ programs, where feedback mostly comes through surveys. What if you could easily uncover insights from conversations across many touch points?
After decades leading contact center organizations, I think the future of workforce management is in conversation-powered operations. Companies have a goldmine of data that isn’t being tapped. With the best of modern technology, machine learning can extract valuable customer sentiment from 100% of contacts with zero manual work instead of the old way of listening to calls and reading transcripts.
It can rapidly analyze thousands of agent conversations for internal quality assurance to improve compliance, soft skills, and process optimization. This modernization provides a significantly greater ability to understand the quality of service agents are delivering, and opens a window of insights into the enterprise. It’s a faster, smarter way to identify how and where to make both transactional and strategic improvements that are better for the business and the customer experience.
Humanizing your agents
Circling back to empowering agents, modern CX teams recognize customer service agents are the voice of their brand and should be brand ambassadors. We all know the adage about ‘making every agent as good as your best agents’—but you can only get there if you support them well. Instead of juggling multiple tools and inefficient processes, agents need to be able to focus on ensuring each customer feels known, valued, and supported.
That’s the kind of experience that powers a brand. And that’s where ASAPP really shines.
ASAPP modernizes CX with a streamlined, unified platform to easily support customers across channels. With AI-driven predictive knowledge, agents know what to say and do to serve each customer better and faster. It uses the power of technology not to replace the human touch or hide from customers, but to make contact more personalized. Ultimately, that’s the path to greatness for a brand…empowering both sides of the conversation with the right balance of digital efficiency and an emotional connection. That builds relationships with your customers and makes great customer experience a core part of the value proposition of your brand.
If you're missing the signals you'll likely miss the sale
ASAPP tops ASR leaderboard with E‑Branchformer
ASR technology has been beneficial for businesses and their customers for many years. ASR, or Automatic Speech Recognition, is the software that translates human speech into text. With continual advancements in research and AI modeling, accuracy has improved immensely over time. Developing the most accurate ASR possible has become a high priority for many top tech companies because of how much it benefits businesses when it’s done correctly.
ASR’s primary goal is to maintain high recognition accuracy. There are various units of evaluating recognition rates or error rates, such as phonemes, characters, words, or sentences. The most commonly used method to determine the accuracy of ASR is Word Error Rate (WER).
Healthy competition
To fairly compare AI speech recognition research studies across the industry, we evaluate WER on public datasets. Librispeech, one of the most widely used datasets, consists of about 1000 hours of English reading speech with transcription and extra text corpus. Researchers worldwide have been competing for years to substantiate their methods’ superiority using the Librispeech dataset and WER.
Recently, the speech community has been trending towards end-to-end (E2E) modeling for ASR. Instead of having separate acoustic and language models, as in conventional ASR methods, E2E modeling has achieved great success in both efficiency and accuracy by simultaneously training a single integrated model.
Although several E2E model structures, such as Transducer and Attention-based Encoder-Decoder (AED) have been explored, most of them share a common encoder, the module that extracts meaningful representative information from the input speech.
Speech scientists, looking to create a more powerful encoder, are actively studying novel training objectives, acoustic feature engineering, data augmentation methods, and self-supervised learning using untranscribed speech.
But these research areas don’t address a fundamental question, “What is the optimal neural network architecture for constructing the encoder?”
To address this question, ASAPP researchers recently developed the E-Branchformer model, a highly accurate neural network. Other similar models include Transformer, Conformer, and Branchformer; however, the E-Branchformer surpasses these models in accuracy. Here’s a quick overview of the different models ASAPP used to develop E-Branchformer.
Transformer
The Transformer has shown promising performance in several sequence modeling tasks for ASR and NLU (natural language understanding). This potential is due to the strength of multi-headed self-attention, which can effectively extract meaningful information from the input sequence, while considering the global context.
Conformer
To improve the Transformer, many methods have been introduced and utilized to create synergy by applying convolution, which has advantages in modeling the local context.
In particular, Conformer was introduced and is widely considered as the state-of-the-art accuracy in Librispeech ASR tasks.
By evaluating with an external Language Model (LM) trained using Librispeech text corpus, Conformer achieves 1.9% and 3.9% WER on test-clean and test-others, respectively. Although Conformer demonstrates that stacking convolution sequentially after self-attention is a better method than using them in parallel, other research studies, like Branchformer, have applied these two neural networks in parallel, and found performance to be noticeable.
Branchformer
Branchformer was introduced with three main components:
- Local-context branch using MLP with convolutional gating (cgMLP)
- Global-context branch using multi-headed self-attention
- Merging the module with a linear projection layer
Each branch is computed in parallel before results are merged. Through intensive experiments, Branchformer showed comparable performance with Conformer. Other experiments stacked different combinations by mixing Branchformer and Conformer, but didn’t achieve better results.
E-Branchformer
Inspired by the Branchformer studies, ASAPP researched how convolution and self-attention can be combined more effectively and efficiently.

This resulted in the highest performing model, E-Branchformer, setting the new state-of-the-art WER at 1.81% and 3.65% on Librispeech test-clean and test-other with an external LM.
Kwangyoun Kim
To develop E-Branchformer, we made two primary contributions to Branchformer that significantly improved performance.
- We enhanced the merging module, which combines the output of the global and local branches, by introducing additional convolutions. This change has the effect of combining self-attention with convolution sequentially and in parallel. Through extensive experiments on several types of merge modules, we proved that adding a single depth-wise convolution can significantly improve accuracy with negligible computational increase.
- We revisited the point-wise Feed-Forward Network (FFN). Transformer and its variants commonly stack FFN with self-attention in an interleaving pattern. We experimentally demonstrated that even in Branchformer, stacking FFN together is more effective in improving the model’s capacity. For example, we found that a stack of 17 Branchformers and 17 FFNs in an interleaving pattern has a similar model size to the 25 Branchformers, but is much more advantageous in accuracy.
ASAPP has topped the leaderboard of WER in Librispeech ASR tasks by using the newly proposed E-Branchformer. We are confident that this new model structure can be applied to other tasks and achieve impressive results.
We’re sharing our findings with the community so that everyone can benefit from them. You’ll be able to find all of the detailed methods and experimental results in our upcoming paper, which has been accepted and will be presented at SLT 2022. We’ll also release more information about how we implemented E-Branchformer. Our models’ recipes will be available through ESPnet, so anyone who wants to can reproduce our results. If you’d like to talk about E-Branchformer in person, please reach out to us during the SLT 2022 conference.
It's time to rethink customer service
It’s time to rethink customer service. For decades, I’ve seen how contact centers are pressured to focus on costs and end up sacrificing quality. It’s easy to understand why: Customer service is one of the largest operating expenses at most consumer companies. But, when the drive for efficiency comes at the expense of customer experience—and ultimately loyalty—companies often lose as much as they gain.
A no-win situation for companies and the consumers they serve
Here’s a typical scenario: To save money companies hire thousands of contact center workers in low cost locations who have never used their product or service, give them 2-3 weeks of training, and unleash them to interact with customers. They invest in bots to keep customers away. And they may try some chat channels, with hope of having agents help more than one customer at a time. None of these works particularly well to serve the customer. So, they may try one and then another—and because each channel is in a separate system—they have to start all over with every attempt.
The critical need for a new strategy
Meeting customer needs and controlling your brand have never been more challenging—nor more important. Consumers now have a powerful voice to sway opinion, their expectations can change with a tweet, and brand loyalty is fleeting. People in this mobile-first, digital-first world expect fast, anytime, anywhere service, but they also want companies to know and value them. Customer service needs to deliver all of that, yet too often it misses the mark because frontline workers aren’t set up to be successful.

Instead of using technology to keep customers away from your agents, use it to empower those agents to deliver great customer experiences. It will pay off in both greater productivity and increased customer and agent satisfaction.
Michael Lawder
Customer care agents are the voice of your brand. But what happens if they have to struggle to solve problems with insufficient knowledge and an array of inefficient tools? It’s no wonder many businesses have an agent annual attrition rate that often reaches 100% or higher—while customer satisfaction is going down.
After two decades working every level of customer service at top brands like Apple and Samsung, I know there’s a better way.
Empower every agent to be their best
Companies don’t have to choose between reducing cost and providing great customer experience. We just need to leave old thinking behind, and take advantage of emerging technologies that enable contact center employees to do their best work.
The key is in supporting agents, helping them to be more productive—and more engaged and satisfied with their jobs. When I think about the agent and contact center of the future, a few essentials rise to the top:
- Make it easy to focus on customers.
- Typically, agents spend most of their time and attention hunting for information across multiple systems, and manually working through processes. Meanwhile the customer waits and gets frustrated. Giving agents a streamlined, integrated system they can learn in hours instead of days will both increase operational efficiency and drive higher customer satisfaction.
- Give every agent instant access to the best knowledge available.
- It’s time we put the information agents need at their fingertips using the power of AI. No more relying on shoulder-tapping a coworker for answers or putting customers on long holds while culling through virtual libraries. Innovative technologies can predictively deliver the right knowledge and procedures, so agents know exactly what to say and do to best serve customers and solve problems faster. Augmenting workers with AI and machine learning means you don’t eliminate the human touch but make it dramatically more productive.
- Support customers seamlessly anywhere they are.
- Digital-first is now the name of the game, and that includes making it easy for customers to solve a problem or meet a need using any channel they prefer. Suppose someone starts on a call but needs to drop, and wants to finish resolving their issue later via chat or texting. Contact centers need to provide that continuity across channels, so agents can instantly pick up where things left off and customers have an effortless experience and feel that you know them. The more convenience and simplicity you provide, the more likely you will earn loyalty and build trust, driving lifetime value along the way.
The new demands of customer experience require a new approach for customer service. That’s why I’m so passionate about working at ASAPP. For the first time in decades, I see the promise of artificial intelligence in action. With a unique technology solution, ASAPP solves two primary challenges that are typically in conflict. Empowering agents to be more productive reduces costs and improves the bottom line, and at the same time creates a simpler and effortless customer experience that drives loyalty and retention for your brand.
It’s customer service for the future—right now, when consumers and businesses need it most.
The brittleness of RPA is failing you
Every day contact center agents help millions of customers. To assist in each one of those contacts, an agent must first listen to the customer, diagnose the issue, and apply problem-solving to determine the correct sequence of actions to address customers’ needs. In each step of this process, agents must fetch, read, and update information from multiple back-office applications that, more often than not, are complex systems optimized to support business operations, not for providing a great user experience to agents.
A high learning curve
In practice, agents must know the purpose of each of those back-office systems to determine in which one they may find the information they seek. In addition, they must know the specific navigation flow that leads to the information within each system. Enabling this level of knowledge is expensive because it requires significant agent training, documentation and learning environments for trainees. And while training is a good start, experience is also a key. But, deep experience is rare in an industry where attrition averages 30-45% and can range to more than 100% annually. As a result, agents spend an appreciable amount of time wandering in applications looking for information. This leads to longer waiting times, incomplete answers, and more frustration for customers.
The limitations of RPA
In legacy systems, Robotic Process Automation (RPA) is a standard way of automating tasks in User Interfaces (UIs). However, RPA is highly resource-intensive since it requires manual scripting of each sequence of actions to be performed across many applications with potentially hundreds of navigation flows each. Hence, RPA simply doesn’t scale efficiently or effectively.

Where RPA is brittle and resource-intensive to scale, a machine learning system creates navigation flows automatically—and readily adapts when there are changes in the UI or agents’ behavior.
Nicolás D'Ippolito, PhD
Adding to the challenge, to automate tasks in RPA, developers must provide a list of actions to be performed. The definition of action in RPA requires a detailed description that allows the robot to identify the specific object in the UI to which it has to interact. As a result RPA scripts are highly coupled with the UI structure, making them very fragile to subtle changes in the UI. Although backend systems tend to change slowly, frontend systems often change frequently. Since the trend in the industry is to adopt web-based systems both internal and SaaS, fragility of RPA tools is an increasingly large problem.
Meeting the challenge with AI
In contrast, ASAPP AI-powered features can automatically determine the back-office system and the navigation flow that gets the agent to the required information. Our models evaluate the conversation context and identify potential navigation suggestions. When a recommendation is found, the agent is presented with a compact description of the system and flow leading to the required information. If the agent takes the recommendation, the ASAPP platform leads the agent to the information.

To implement these UI augmentation features we combine machine learning with stochastic analysis to generate behaviour models that abstract the potential interactions with the UI. The process to train this system is based on analysis of historical user interaction data. These models allow for efficient recall of high probability navigation paths towards a navigation goal from the current system state. This capability gives us the power to automatically create a robust navigation tool for any given application. We then combine our navigation tools with NLP and classification models to determine for each conversation context which navigation tool and flow to use.
In addition to the automation benefits, our UI augmentation greatly increases the resiliency to changes in the UI. Since we have a behavior model of every system we can detect deviations from standard usage patterns due to changes in navigation flows in the UI. We can also detect new states in the system that we didn’t observe before due to changes in the UI or agents’ usage patterns. In both cases, these deviations are considered during the automatic retraining cycles. Our models adapt to the changes in the UI, and navigation tools are re-generated, ensuring agent access to needed information is always current.
UI augmentation enables agents to spend less time navigating back-office systems and more time helping customers. This leads to faster issue resolution, and therefore, happier customers. In addition, reducing cognitive load for agents opens the possibility for digital agents to engage with more than one customer at a time. So more customers can be served with the same amount of agents, further increasing agent productivity while also improving customer satisfaction as their issues are addressed more quickly and accurately. With the power of machine learning, we can train UI augmentation features on any enterprise’s infrastructure, enabling those companies to quickly get the benefits of agent augmentation.