SlideShare ist ein Scribd-Unternehmen logo
1 von 20
VLIW PROCESSORS
Department of E &TC, MITCOE, Pu
Introduction
o Very long instruction word or VLIW refers to a
processor architecture designed to take advantage of
instruction level parallelism
o Instruction of a VLIW processor consists of multiple
independent operations grouped together.
o There are Multiple Independent Functional Units in
VLIW processor architecture.
o Each operation in the instruction is aligned to a
functional unit.
o All functional units share the use of a common large
register file.
o This type of processor architecture is intended to allow
higher performance without the inherent complexity of
Different Approaches
Other approaches to improving performance in processor architectures
:
o Pipelining
Breaking up instructions into sub-steps so that instructions can be
executed partially at the same time
o Superscalar architectures
Dispatching individual instructions to be executed completely
independently in different parts of the processor
o Out-of-order execution
Executing instructions in an order different from the program
Instruction Level Parallelism (ILP )
o Instruction-level parallelism (ILP) is a measure of how
many of the operations in a computer program can be
performed simultaneously.
o The overlap among instructions is called instruction level
parallelism.
o Ordinary programs are typically written under a sequential
execution model where instructions execute one after the
other and in the order specified by the programmer.
o Goal of compiler and processor designers implementing ILP
is to identify and take advantage of as much ILP as possible.
What is ILP? (Example)
Consider the following program:
op 1 e = a + b
op2 f = c + d
op3 m = e * f
o Operation 3 depends on the results of operations 1 and 2, so
it cannot be calculated until both of them are completed
o However, operations 1 and 2 do not depend on any other
operation, so they can be calculated simultaneously
o If we assume that each operation can be completed in one
unit of time then these three instructions can be completed
in a total of two units of time
o giving an ILP of 3/2.
VLIW Compiler
o Compiler is responsible for static scheduling of instructions in
VLIW processor.
o Compiler finds out which operations can be executed in parallel
in the program.
o It groups together these operations in single instruction which is
the very large instruction word.
o Compiler ensures that an operation is not issued before its
operands are ready.
VLIW Instruction
o One VLIW instruction word encodes multiple operations which
allows them to be initiated in a single clock cycle.
o The operands and the operation to be performed by the various
functional units are specified in the instruction itself.
o One instruction encodes at least one operation for each execution
unit of the device.
o So length of the instruction increases with the number of execution
units
o To accommodate these operation fields, VLIW instructions are
usually at least 64 bits wide, and on some architectures are much
wider up to 1024 bits.
VLIW Instruction
Add r1,r2,r3; Sub r4,r5,r6; Ld r7,data; St r8,data
REGESTER FILES
ALU ALU
LOAD
/STORE
LOAD
/STORE
ILP in VLIW
o Consider the computation of y = a1x1 + a2x2 + a3x3
On a sequential processor On the VLIW processor with
2 load/store units, 1 multiply unit
and 1 add unit
cycle 1: load a1
cycle 2: load x1
cycle 3: load a2
cycle 4: load x2
cycle 5: multiply z1 a1 x1
cycle 6: multiply z2 a2 x2
cycle 7: add y z1 z2
cycle 8: load a3
cycle 9: load x3
cycle 10: multiply z1 a3 x3
cycle 11: add y y z2
cycle 1: load a1
load x1
cycle 2: load a2
load x2
Multiply z1 a1 x1
cycle 3: load a3
load x3
Multiply z2 a2 x2
cycle 4: multiply z3 a3 x3
add y z1 z2
cycle 5: add y y z3
requires 11 cycles. requires 5 cycles.
Block Diagram
Diagram (Conceptual Instruction
Execution)
Working
o Long instruction words are fetched from the memory
o A common multi-ported register file for fetching the operands
and storing the results.
o Parallel random access to the register file is possible through
the read/write cross bar.
o Execution in the functional units is carried out concurrently
with the load/store operation of data between RAM and the
register file.
o One or multiple register files for FX and FP data.
o Rely on compiler to find parallelism and schedule dependency
free program code.
Difference Between VLIW &
Superscalar Architecture
VLIW vs. Superscalar
Architecture
o Instruction formulation
 Superscalar:
⁻ Receive conventional instructions conceived for sequential
processors.
 VLIW:
⁻ Receive long instruction words, each comprising a field (or
opcode) for each execution unit.
⁻ Instruction word length depends number of execution units and
code length to control each unit (such as opcode length,
registers).
⁻ Typical word length is 64 – 1024 bits, much longer than
conventional machine word length.
VLIW vs. Superscalar
Architecture
o Instruction scheduling
 Superscalar:
⁻ Done dynamically at run-time by the hardware.
⁻ Data dependency is checked and resolved in
hardware.
⁻ Need a look ahead hardware window for instruction
fetch.
 VLIW:
⁻ Done statically at compile time by compiler.
⁻ Data dependency is checked by compiler.
⁻ In case of un-filled opcodes in a VLIW, memory
space and instruction bandwidth are wasted.
Comparison: CISC, RISC,
VLIW
ARCHITECTURE
CHARACTERIST
C
CISC RISC VLIW
Instruction Size Varies One size, usually 32 bits One size
Instruction
Semantics
Varies from simple to
complex; possibly
many
dependent operations
per instruction
Almost always one
simple
operation
Many simple,
independent
operations
Registers Few, sometimes
special
Many, general-purpose Many, general-
purpose
Hardware Design Exploit microcode
implementations
Exploit implementations
with one pipeline and &
no microcode
Exploit
implementations with
multiple pipelines, no
microcode &
no complex dispatch
logic
Advantages of VLIW
o Dependencies are determined by compiler and used to
schedule according to function unit latencies .
o Function units are assigned by compiler and correspond
to the position within the instruction packet.
o Reduces hardware complexity.
• Tasks such as decoding, data dependency detection,
instruction issues etc. becoming simple.
• Ensures potentially higher Clock Rate.
• Ensures Low power consumption
Disadvantages of VLIW
o Higher complexity of the compiler
o Compatibility across implementations : Compiler optimization
needs to consider technology dependent parameters such as
latencies and load-use time of cache.
o Unscheduled events (e.g. cache miss) stall entire processor .
o Code density: In case of un-filled opcodes in a VLIW, memory
space and instruction bandwidth are wasted i.e. low slot utilization.
o Code expansion: Causes high power consumption
Applications
o VLIW architecture is suitable for Digital Signal Processing
applications.
o Processing of media data like compression/decompression
of Image and speech data.
Examples of VLIW processor
o VLIW Mini supercomputers:
Multiflow TRACE 7/300, 14/300, 28/300
Multiflow TRACE /500
Cydrome Cydra 5
IBM Yorktown VLIW Computer
o Single-Chip VLIW Processors:
Intel iWarp, Philip’s LIFE Chips
o Single-Chip VLIW Media (through-put) Processors:
Trimedia, Chromatic, Micro-Unity
o DSP Processors (TI TMS320C6x )

Weitere ähnliche Inhalte

Was ist angesagt?

SOC System Design Approach
SOC System Design ApproachSOC System Design Approach
SOC System Design ApproachA B Shinde
 
Destination Sequenced Distance Vector Routing (DSDV)
Destination Sequenced Distance Vector Routing (DSDV)Destination Sequenced Distance Vector Routing (DSDV)
Destination Sequenced Distance Vector Routing (DSDV)ArunChokkalingam
 
Superscalar & superpipeline processor
Superscalar & superpipeline processorSuperscalar & superpipeline processor
Superscalar & superpipeline processorMuhammad Ishaq
 
Pipelining and vector processing
Pipelining and vector processingPipelining and vector processing
Pipelining and vector processingKamal Acharya
 
Multithreading computer architecture
 Multithreading computer architecture  Multithreading computer architecture
Multithreading computer architecture Haris456
 
Superscalar and VLIW architectures
Superscalar and VLIW architecturesSuperscalar and VLIW architectures
Superscalar and VLIW architecturesAmit Kumar Rathi
 
Real Time Kernels
Real Time KernelsReal Time Kernels
Real Time KernelsArnav Soni
 
INSTRUCTION LEVEL PARALLALISM
INSTRUCTION LEVEL PARALLALISMINSTRUCTION LEVEL PARALLALISM
INSTRUCTION LEVEL PARALLALISMKamran Ashraf
 
Wsn unit-1-ppt
Wsn unit-1-pptWsn unit-1-ppt
Wsn unit-1-pptSwathi Ch
 
Processor Organization and Architecture
Processor Organization and ArchitectureProcessor Organization and Architecture
Processor Organization and ArchitectureVinit Raut
 
Io t system management with
Io t system management withIo t system management with
Io t system management withxyxz
 
Classification of routing protocols
Classification of routing protocolsClassification of routing protocols
Classification of routing protocolsMenaga Selvaraj
 
Multiprocessor Systems
Multiprocessor SystemsMultiprocessor Systems
Multiprocessor Systemsvampugani
 
hierarchical bus system
 hierarchical bus system hierarchical bus system
hierarchical bus systemElvis Jonyo
 
Computer architecture multi processor
Computer architecture multi processorComputer architecture multi processor
Computer architecture multi processorMazin Alwaaly
 
Instruction Level Parallelism and Superscalar Processors
Instruction Level Parallelism and Superscalar ProcessorsInstruction Level Parallelism and Superscalar Processors
Instruction Level Parallelism and Superscalar ProcessorsSyed Zaid Irshad
 

Was ist angesagt? (20)

Parallel processing
Parallel processingParallel processing
Parallel processing
 
SOC System Design Approach
SOC System Design ApproachSOC System Design Approach
SOC System Design Approach
 
Superscalar Processor
Superscalar ProcessorSuperscalar Processor
Superscalar Processor
 
Destination Sequenced Distance Vector Routing (DSDV)
Destination Sequenced Distance Vector Routing (DSDV)Destination Sequenced Distance Vector Routing (DSDV)
Destination Sequenced Distance Vector Routing (DSDV)
 
Superscalar & superpipeline processor
Superscalar & superpipeline processorSuperscalar & superpipeline processor
Superscalar & superpipeline processor
 
Pipelining and vector processing
Pipelining and vector processingPipelining and vector processing
Pipelining and vector processing
 
Multithreading computer architecture
 Multithreading computer architecture  Multithreading computer architecture
Multithreading computer architecture
 
ARM Processors
ARM ProcessorsARM Processors
ARM Processors
 
Superscalar and VLIW architectures
Superscalar and VLIW architecturesSuperscalar and VLIW architectures
Superscalar and VLIW architectures
 
Parallel processing
Parallel processingParallel processing
Parallel processing
 
Real Time Kernels
Real Time KernelsReal Time Kernels
Real Time Kernels
 
INSTRUCTION LEVEL PARALLALISM
INSTRUCTION LEVEL PARALLALISMINSTRUCTION LEVEL PARALLALISM
INSTRUCTION LEVEL PARALLALISM
 
Wsn unit-1-ppt
Wsn unit-1-pptWsn unit-1-ppt
Wsn unit-1-ppt
 
Processor Organization and Architecture
Processor Organization and ArchitectureProcessor Organization and Architecture
Processor Organization and Architecture
 
Io t system management with
Io t system management withIo t system management with
Io t system management with
 
Classification of routing protocols
Classification of routing protocolsClassification of routing protocols
Classification of routing protocols
 
Multiprocessor Systems
Multiprocessor SystemsMultiprocessor Systems
Multiprocessor Systems
 
hierarchical bus system
 hierarchical bus system hierarchical bus system
hierarchical bus system
 
Computer architecture multi processor
Computer architecture multi processorComputer architecture multi processor
Computer architecture multi processor
 
Instruction Level Parallelism and Superscalar Processors
Instruction Level Parallelism and Superscalar ProcessorsInstruction Level Parallelism and Superscalar Processors
Instruction Level Parallelism and Superscalar Processors
 

Ähnlich wie VLIW Processors

Vliw and superscaler
Vliw and superscalerVliw and superscaler
Vliw and superscalerRafi Dar
 
Shown below is a VLIW system in which each long instruction word gen.pdf
Shown below is a VLIW system in which each long instruction word gen.pdfShown below is a VLIW system in which each long instruction word gen.pdf
Shown below is a VLIW system in which each long instruction word gen.pdfARCHANASTOREKOTA
 
Advanced Techniques for Exploiting ILP
Advanced Techniques for Exploiting ILPAdvanced Techniques for Exploiting ILP
Advanced Techniques for Exploiting ILPA B Shinde
 
Advanced processor principles
Advanced processor principlesAdvanced processor principles
Advanced processor principlesDhaval Bagal
 
Parallel Computing
Parallel ComputingParallel Computing
Parallel ComputingMohsin Bhat
 
VLIW(Very Long Instruction Word)
VLIW(Very Long Instruction Word)VLIW(Very Long Instruction Word)
VLIW(Very Long Instruction Word)Pragnya Dash
 
Crussoe proc
Crussoe procCrussoe proc
Crussoe proctyadi
 
Processor Verification Using Open Source Tools and the GCC Regression Test Suite
Processor Verification Using Open Source Tools and the GCC Regression Test SuiteProcessor Verification Using Open Source Tools and the GCC Regression Test Suite
Processor Verification Using Open Source Tools and the GCC Regression Test SuiteDVClub
 
Fpga based 128 bit customised vliw processor for executing dual scalarvector ...
Fpga based 128 bit customised vliw processor for executing dual scalarvector ...Fpga based 128 bit customised vliw processor for executing dual scalarvector ...
Fpga based 128 bit customised vliw processor for executing dual scalarvector ...eSAT Publishing House
 
Pipeline Mechanism
Pipeline MechanismPipeline Mechanism
Pipeline MechanismAshik Iqbal
 
Alley vsu functional_coverage_1f
Alley vsu functional_coverage_1fAlley vsu functional_coverage_1f
Alley vsu functional_coverage_1fObsidian Software
 
Algoritmi e Calcolo Parallelo 2012/2013 - OpenMP
Algoritmi e Calcolo Parallelo 2012/2013 - OpenMPAlgoritmi e Calcolo Parallelo 2012/2013 - OpenMP
Algoritmi e Calcolo Parallelo 2012/2013 - OpenMPPier Luca Lanzi
 
Unit 5 Advanced Computer Architecture
Unit 5 Advanced Computer ArchitectureUnit 5 Advanced Computer Architecture
Unit 5 Advanced Computer ArchitectureBalaji Vignesh
 

Ähnlich wie VLIW Processors (20)

Vliw and superscaler
Vliw and superscalerVliw and superscaler
Vliw and superscaler
 
Lec1 final
Lec1 finalLec1 final
Lec1 final
 
Shown below is a VLIW system in which each long instruction word gen.pdf
Shown below is a VLIW system in which each long instruction word gen.pdfShown below is a VLIW system in which each long instruction word gen.pdf
Shown below is a VLIW system in which each long instruction word gen.pdf
 
1.My Presentation.pptx
1.My Presentation.pptx1.My Presentation.pptx
1.My Presentation.pptx
 
Advanced Techniques for Exploiting ILP
Advanced Techniques for Exploiting ILPAdvanced Techniques for Exploiting ILP
Advanced Techniques for Exploiting ILP
 
Advanced processor principles
Advanced processor principlesAdvanced processor principles
Advanced processor principles
 
Parallel Computing
Parallel ComputingParallel Computing
Parallel Computing
 
VLIW(Very Long Instruction Word)
VLIW(Very Long Instruction Word)VLIW(Very Long Instruction Word)
VLIW(Very Long Instruction Word)
 
Assembly p1
Assembly p1Assembly p1
Assembly p1
 
Cisc(a022& a023)
Cisc(a022& a023)Cisc(a022& a023)
Cisc(a022& a023)
 
Crussoe proc
Crussoe procCrussoe proc
Crussoe proc
 
Difficulties in Pipelining
Difficulties in PipeliningDifficulties in Pipelining
Difficulties in Pipelining
 
Processor Verification Using Open Source Tools and the GCC Regression Test Suite
Processor Verification Using Open Source Tools and the GCC Regression Test SuiteProcessor Verification Using Open Source Tools and the GCC Regression Test Suite
Processor Verification Using Open Source Tools and the GCC Regression Test Suite
 
Fpga based 128 bit customised vliw processor for executing dual scalarvector ...
Fpga based 128 bit customised vliw processor for executing dual scalarvector ...Fpga based 128 bit customised vliw processor for executing dual scalarvector ...
Fpga based 128 bit customised vliw processor for executing dual scalarvector ...
 
Pipeline Mechanism
Pipeline MechanismPipeline Mechanism
Pipeline Mechanism
 
Alley vsu functional_coverage_1f
Alley vsu functional_coverage_1fAlley vsu functional_coverage_1f
Alley vsu functional_coverage_1f
 
Algoritmi e Calcolo Parallelo 2012/2013 - OpenMP
Algoritmi e Calcolo Parallelo 2012/2013 - OpenMPAlgoritmi e Calcolo Parallelo 2012/2013 - OpenMP
Algoritmi e Calcolo Parallelo 2012/2013 - OpenMP
 
Unit 5 Advanced Computer Architecture
Unit 5 Advanced Computer ArchitectureUnit 5 Advanced Computer Architecture
Unit 5 Advanced Computer Architecture
 
Vliw or epic
Vliw or epicVliw or epic
Vliw or epic
 
W04505116121
W04505116121W04505116121
W04505116121
 

Mehr von Sudhanshu Janwadkar

Keypad Interfacing with 8051 Microcontroller
Keypad Interfacing with 8051 MicrocontrollerKeypad Interfacing with 8051 Microcontroller
Keypad Interfacing with 8051 MicrocontrollerSudhanshu Janwadkar
 
ASIC design Flow (Digital Design)
ASIC design Flow (Digital Design)ASIC design Flow (Digital Design)
ASIC design Flow (Digital Design)Sudhanshu Janwadkar
 
Fpga architectures and applications
Fpga architectures and applicationsFpga architectures and applications
Fpga architectures and applicationsSudhanshu Janwadkar
 
Introduction to 8051 Timer/Counter
Introduction to 8051 Timer/CounterIntroduction to 8051 Timer/Counter
Introduction to 8051 Timer/CounterSudhanshu Janwadkar
 
Architecture of the Intel 8051 Microcontroller
Architecture of the Intel 8051 MicrocontrollerArchitecture of the Intel 8051 Microcontroller
Architecture of the Intel 8051 MicrocontrollerSudhanshu Janwadkar
 
Introduction to Embedded Systems
Introduction to Embedded SystemsIntroduction to Embedded Systems
Introduction to Embedded SystemsSudhanshu Janwadkar
 
Interconnects in Reconfigurable Architectures
Interconnects in Reconfigurable ArchitecturesInterconnects in Reconfigurable Architectures
Interconnects in Reconfigurable ArchitecturesSudhanshu Janwadkar
 
Design and Implementation of a GPS based Personal Tracking System
Design and Implementation of a GPS based Personal Tracking SystemDesign and Implementation of a GPS based Personal Tracking System
Design and Implementation of a GPS based Personal Tracking SystemSudhanshu Janwadkar
 
Embedded Logic Flip-Flops: A Conceptual Review
Embedded Logic Flip-Flops: A Conceptual ReviewEmbedded Logic Flip-Flops: A Conceptual Review
Embedded Logic Flip-Flops: A Conceptual ReviewSudhanshu Janwadkar
 

Mehr von Sudhanshu Janwadkar (20)

DSP Processors versus ASICs
DSP Processors versus ASICsDSP Processors versus ASICs
DSP Processors versus ASICs
 
Keypad Interfacing with 8051 Microcontroller
Keypad Interfacing with 8051 MicrocontrollerKeypad Interfacing with 8051 Microcontroller
Keypad Interfacing with 8051 Microcontroller
 
ASIC design Flow (Digital Design)
ASIC design Flow (Digital Design)ASIC design Flow (Digital Design)
ASIC design Flow (Digital Design)
 
Fpga architectures and applications
Fpga architectures and applicationsFpga architectures and applications
Fpga architectures and applications
 
LCD Interacing with 8051
LCD Interacing with 8051LCD Interacing with 8051
LCD Interacing with 8051
 
Interrupts in 8051
Interrupts in 8051Interrupts in 8051
Interrupts in 8051
 
Serial Communication in 8051
Serial Communication in 8051Serial Communication in 8051
Serial Communication in 8051
 
SPI Bus Protocol
SPI Bus ProtocolSPI Bus Protocol
SPI Bus Protocol
 
I2C Protocol
I2C ProtocolI2C Protocol
I2C Protocol
 
Introduction to 8051 Timer/Counter
Introduction to 8051 Timer/CounterIntroduction to 8051 Timer/Counter
Introduction to 8051 Timer/Counter
 
Intel 8051 Programming in C
Intel 8051 Programming in CIntel 8051 Programming in C
Intel 8051 Programming in C
 
Hardware View of Intel 8051
Hardware View of Intel 8051Hardware View of Intel 8051
Hardware View of Intel 8051
 
Architecture of the Intel 8051 Microcontroller
Architecture of the Intel 8051 MicrocontrollerArchitecture of the Intel 8051 Microcontroller
Architecture of the Intel 8051 Microcontroller
 
Introduction to Embedded Systems
Introduction to Embedded SystemsIntroduction to Embedded Systems
Introduction to Embedded Systems
 
CMOS Logic
CMOS LogicCMOS Logic
CMOS Logic
 
Interconnects in Reconfigurable Architectures
Interconnects in Reconfigurable ArchitecturesInterconnects in Reconfigurable Architectures
Interconnects in Reconfigurable Architectures
 
Introduction to FPGAs
Introduction to FPGAsIntroduction to FPGAs
Introduction to FPGAs
 
Design and Implementation of a GPS based Personal Tracking System
Design and Implementation of a GPS based Personal Tracking SystemDesign and Implementation of a GPS based Personal Tracking System
Design and Implementation of a GPS based Personal Tracking System
 
Embedded Logic Flip-Flops: A Conceptual Review
Embedded Logic Flip-Flops: A Conceptual ReviewEmbedded Logic Flip-Flops: A Conceptual Review
Embedded Logic Flip-Flops: A Conceptual Review
 
Pass Transistor Logic
Pass Transistor LogicPass Transistor Logic
Pass Transistor Logic
 

Kürzlich hochgeladen

"Exploring the Essential Functions and Design Considerations of Spillways in ...
"Exploring the Essential Functions and Design Considerations of Spillways in ..."Exploring the Essential Functions and Design Considerations of Spillways in ...
"Exploring the Essential Functions and Design Considerations of Spillways in ...Erbil Polytechnic University
 
Engineering Drawing section of solid
Engineering Drawing     section of solidEngineering Drawing     section of solid
Engineering Drawing section of solidnamansinghjarodiya
 
Autonomous emergency braking system (aeb) ppt.ppt
Autonomous emergency braking system (aeb) ppt.pptAutonomous emergency braking system (aeb) ppt.ppt
Autonomous emergency braking system (aeb) ppt.pptbibisarnayak0
 
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTIONTHE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTIONjhunlian
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort servicejennyeacort
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AIabhishek36461
 
Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...121011101441
 
Class 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm SystemClass 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm Systemirfanmechengr
 
home automation using Arduino by Aditya Prasad
home automation using Arduino by Aditya Prasadhome automation using Arduino by Aditya Prasad
home automation using Arduino by Aditya Prasadaditya806802
 
welding defects observed during the welding
welding defects observed during the weldingwelding defects observed during the welding
welding defects observed during the weldingMuhammadUzairLiaqat
 
Ch10-Global Supply Chain - Cadena de Suministro.pdf
Ch10-Global Supply Chain - Cadena de Suministro.pdfCh10-Global Supply Chain - Cadena de Suministro.pdf
Ch10-Global Supply Chain - Cadena de Suministro.pdfChristianCDAM
 
Research Methodology for Engineering pdf
Research Methodology for Engineering pdfResearch Methodology for Engineering pdf
Research Methodology for Engineering pdfCaalaaAbdulkerim
 
BSNL Internship Training presentation.pptx
BSNL Internship Training presentation.pptxBSNL Internship Training presentation.pptx
BSNL Internship Training presentation.pptxNiranjanYadav41
 
National Level Hackathon Participation Certificate.pdf
National Level Hackathon Participation Certificate.pdfNational Level Hackathon Participation Certificate.pdf
National Level Hackathon Participation Certificate.pdfRajuKanojiya4
 
11. Properties of Liquid Fuels in Energy Engineering.pdf
11. Properties of Liquid Fuels in Energy Engineering.pdf11. Properties of Liquid Fuels in Energy Engineering.pdf
11. Properties of Liquid Fuels in Energy Engineering.pdfHafizMudaserAhmad
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...VICTOR MAESTRE RAMIREZ
 
Work Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvWork Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvLewisJB
 
Virtual memory management in Operating System
Virtual memory management in Operating SystemVirtual memory management in Operating System
Virtual memory management in Operating SystemRashmi Bhat
 

Kürzlich hochgeladen (20)

"Exploring the Essential Functions and Design Considerations of Spillways in ...
"Exploring the Essential Functions and Design Considerations of Spillways in ..."Exploring the Essential Functions and Design Considerations of Spillways in ...
"Exploring the Essential Functions and Design Considerations of Spillways in ...
 
Engineering Drawing section of solid
Engineering Drawing     section of solidEngineering Drawing     section of solid
Engineering Drawing section of solid
 
Autonomous emergency braking system (aeb) ppt.ppt
Autonomous emergency braking system (aeb) ppt.pptAutonomous emergency braking system (aeb) ppt.ppt
Autonomous emergency braking system (aeb) ppt.ppt
 
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTIONTHE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
THE SENDAI FRAMEWORK FOR DISASTER RISK REDUCTION
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
 
Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AI
 
Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...Instrumentation, measurement and control of bio process parameters ( Temperat...
Instrumentation, measurement and control of bio process parameters ( Temperat...
 
Class 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm SystemClass 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm System
 
home automation using Arduino by Aditya Prasad
home automation using Arduino by Aditya Prasadhome automation using Arduino by Aditya Prasad
home automation using Arduino by Aditya Prasad
 
welding defects observed during the welding
welding defects observed during the weldingwelding defects observed during the welding
welding defects observed during the welding
 
Ch10-Global Supply Chain - Cadena de Suministro.pdf
Ch10-Global Supply Chain - Cadena de Suministro.pdfCh10-Global Supply Chain - Cadena de Suministro.pdf
Ch10-Global Supply Chain - Cadena de Suministro.pdf
 
Research Methodology for Engineering pdf
Research Methodology for Engineering pdfResearch Methodology for Engineering pdf
Research Methodology for Engineering pdf
 
Designing pile caps according to ACI 318-19.pptx
Designing pile caps according to ACI 318-19.pptxDesigning pile caps according to ACI 318-19.pptx
Designing pile caps according to ACI 318-19.pptx
 
BSNL Internship Training presentation.pptx
BSNL Internship Training presentation.pptxBSNL Internship Training presentation.pptx
BSNL Internship Training presentation.pptx
 
National Level Hackathon Participation Certificate.pdf
National Level Hackathon Participation Certificate.pdfNational Level Hackathon Participation Certificate.pdf
National Level Hackathon Participation Certificate.pdf
 
11. Properties of Liquid Fuels in Energy Engineering.pdf
11. Properties of Liquid Fuels in Energy Engineering.pdf11. Properties of Liquid Fuels in Energy Engineering.pdf
11. Properties of Liquid Fuels in Energy Engineering.pdf
 
Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...Software and Systems Engineering Standards: Verification and Validation of Sy...
Software and Systems Engineering Standards: Verification and Validation of Sy...
 
Work Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvWork Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvv
 
Design and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdfDesign and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdf
 
Virtual memory management in Operating System
Virtual memory management in Operating SystemVirtual memory management in Operating System
Virtual memory management in Operating System
 

VLIW Processors

  • 1. VLIW PROCESSORS Department of E &TC, MITCOE, Pu
  • 2. Introduction o Very long instruction word or VLIW refers to a processor architecture designed to take advantage of instruction level parallelism o Instruction of a VLIW processor consists of multiple independent operations grouped together. o There are Multiple Independent Functional Units in VLIW processor architecture. o Each operation in the instruction is aligned to a functional unit. o All functional units share the use of a common large register file. o This type of processor architecture is intended to allow higher performance without the inherent complexity of
  • 3. Different Approaches Other approaches to improving performance in processor architectures : o Pipelining Breaking up instructions into sub-steps so that instructions can be executed partially at the same time o Superscalar architectures Dispatching individual instructions to be executed completely independently in different parts of the processor o Out-of-order execution Executing instructions in an order different from the program
  • 4. Instruction Level Parallelism (ILP ) o Instruction-level parallelism (ILP) is a measure of how many of the operations in a computer program can be performed simultaneously. o The overlap among instructions is called instruction level parallelism. o Ordinary programs are typically written under a sequential execution model where instructions execute one after the other and in the order specified by the programmer. o Goal of compiler and processor designers implementing ILP is to identify and take advantage of as much ILP as possible.
  • 5. What is ILP? (Example) Consider the following program: op 1 e = a + b op2 f = c + d op3 m = e * f o Operation 3 depends on the results of operations 1 and 2, so it cannot be calculated until both of them are completed o However, operations 1 and 2 do not depend on any other operation, so they can be calculated simultaneously o If we assume that each operation can be completed in one unit of time then these three instructions can be completed in a total of two units of time o giving an ILP of 3/2.
  • 6. VLIW Compiler o Compiler is responsible for static scheduling of instructions in VLIW processor. o Compiler finds out which operations can be executed in parallel in the program. o It groups together these operations in single instruction which is the very large instruction word. o Compiler ensures that an operation is not issued before its operands are ready.
  • 7. VLIW Instruction o One VLIW instruction word encodes multiple operations which allows them to be initiated in a single clock cycle. o The operands and the operation to be performed by the various functional units are specified in the instruction itself. o One instruction encodes at least one operation for each execution unit of the device. o So length of the instruction increases with the number of execution units o To accommodate these operation fields, VLIW instructions are usually at least 64 bits wide, and on some architectures are much wider up to 1024 bits.
  • 8. VLIW Instruction Add r1,r2,r3; Sub r4,r5,r6; Ld r7,data; St r8,data REGESTER FILES ALU ALU LOAD /STORE LOAD /STORE
  • 9. ILP in VLIW o Consider the computation of y = a1x1 + a2x2 + a3x3 On a sequential processor On the VLIW processor with 2 load/store units, 1 multiply unit and 1 add unit cycle 1: load a1 cycle 2: load x1 cycle 3: load a2 cycle 4: load x2 cycle 5: multiply z1 a1 x1 cycle 6: multiply z2 a2 x2 cycle 7: add y z1 z2 cycle 8: load a3 cycle 9: load x3 cycle 10: multiply z1 a3 x3 cycle 11: add y y z2 cycle 1: load a1 load x1 cycle 2: load a2 load x2 Multiply z1 a1 x1 cycle 3: load a3 load x3 Multiply z2 a2 x2 cycle 4: multiply z3 a3 x3 add y z1 z2 cycle 5: add y y z3 requires 11 cycles. requires 5 cycles.
  • 12. Working o Long instruction words are fetched from the memory o A common multi-ported register file for fetching the operands and storing the results. o Parallel random access to the register file is possible through the read/write cross bar. o Execution in the functional units is carried out concurrently with the load/store operation of data between RAM and the register file. o One or multiple register files for FX and FP data. o Rely on compiler to find parallelism and schedule dependency free program code.
  • 13. Difference Between VLIW & Superscalar Architecture
  • 14. VLIW vs. Superscalar Architecture o Instruction formulation  Superscalar: ⁻ Receive conventional instructions conceived for sequential processors.  VLIW: ⁻ Receive long instruction words, each comprising a field (or opcode) for each execution unit. ⁻ Instruction word length depends number of execution units and code length to control each unit (such as opcode length, registers). ⁻ Typical word length is 64 – 1024 bits, much longer than conventional machine word length.
  • 15. VLIW vs. Superscalar Architecture o Instruction scheduling  Superscalar: ⁻ Done dynamically at run-time by the hardware. ⁻ Data dependency is checked and resolved in hardware. ⁻ Need a look ahead hardware window for instruction fetch.  VLIW: ⁻ Done statically at compile time by compiler. ⁻ Data dependency is checked by compiler. ⁻ In case of un-filled opcodes in a VLIW, memory space and instruction bandwidth are wasted.
  • 16. Comparison: CISC, RISC, VLIW ARCHITECTURE CHARACTERIST C CISC RISC VLIW Instruction Size Varies One size, usually 32 bits One size Instruction Semantics Varies from simple to complex; possibly many dependent operations per instruction Almost always one simple operation Many simple, independent operations Registers Few, sometimes special Many, general-purpose Many, general- purpose Hardware Design Exploit microcode implementations Exploit implementations with one pipeline and & no microcode Exploit implementations with multiple pipelines, no microcode & no complex dispatch logic
  • 17. Advantages of VLIW o Dependencies are determined by compiler and used to schedule according to function unit latencies . o Function units are assigned by compiler and correspond to the position within the instruction packet. o Reduces hardware complexity. • Tasks such as decoding, data dependency detection, instruction issues etc. becoming simple. • Ensures potentially higher Clock Rate. • Ensures Low power consumption
  • 18. Disadvantages of VLIW o Higher complexity of the compiler o Compatibility across implementations : Compiler optimization needs to consider technology dependent parameters such as latencies and load-use time of cache. o Unscheduled events (e.g. cache miss) stall entire processor . o Code density: In case of un-filled opcodes in a VLIW, memory space and instruction bandwidth are wasted i.e. low slot utilization. o Code expansion: Causes high power consumption
  • 19. Applications o VLIW architecture is suitable for Digital Signal Processing applications. o Processing of media data like compression/decompression of Image and speech data.
  • 20. Examples of VLIW processor o VLIW Mini supercomputers: Multiflow TRACE 7/300, 14/300, 28/300 Multiflow TRACE /500 Cydrome Cydra 5 IBM Yorktown VLIW Computer o Single-Chip VLIW Processors: Intel iWarp, Philip’s LIFE Chips o Single-Chip VLIW Media (through-put) Processors: Trimedia, Chromatic, Micro-Unity o DSP Processors (TI TMS320C6x )